I’m looking for a self hosted solution to this problem:

I want to create a full text search index from a collection of PDF manuals (text, not images, I don’t care about OCR here). There is a UI to search for text matches in documents, and clicking a search hit opens the PDF scrolled to where the search hit is (bonus points if the search hit is hilighted)

  • doeknius_gloek@feddit.de
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    That’s a very specific problem and I don’t know if there is an existing solution that does exactly what you want.

    paperless-ngx does a lot of the things you ask for, it lets you upload pdfs, does OCR and gives you full text search via a web ui. It’s just not made specifically for manuals and it does not highlight the search hits or scrolls to them.