Context: my father is a lawyer and therefore has a bajillion pdf files that were digitised, stored in a server. I’ve gotten an idea on how to do OCR in all of them.

But after that, how can I make them easily searchable? (Keep in mind that unfortunately, the directory structure is important information to classify the files, aka you may have a path like clientABC/caseAV1/d.pdf

  • purplemonkeymad@programming.dev
    link
    fedilink
    arrow-up
    2
    ·
    1 day ago

    What is the server are they on?

    If they are just on a windows server, then the indexing service is actually good for fast results on a network share. If it’s a windows 10/11 pc, I think you need to enable classic search for it to provide results to clients over the network.

    Alternatively I believe everything (the program) supports indexing network locations.