First_Thunder@lemmy.zip to

Self Hosted - Self-hosting your services.@lemmy.ml · 2 days ago

Best way to search files on remote server?

5

Best way to search files on remote server?

First_Thunder@lemmy.zip to

Self Hosted - Self-hosting your services.@lemmy.ml · 2 days ago

Context: my father is a lawyer and therefore has a bajillion pdf files that were digitised, stored in a server. I’ve gotten an idea on how to do OCR in all of them.

But after that, how can I make them easily searchable? (Keep in mind that unfortunately, the directory structure is important information to classify the files, aka you may have a path like clientABC/caseAV1/d.pdf

Chat

VoxAliorum@lemmy.ml
link
fedilink
arrow-up
3·
edit-2
8 hours ago
Search them for words? Try pdfgrep with recursive - very easy to setup and try. If you feel like that’s taking too long, you probably need to accept some simplifications/helper structures.

Self Hosted - Self-hosting your services.@lemmy.ml

selfhost@lemmy.ml

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !selfhost@lemmy.ml

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules

No harassment
crossposts from c/Open Source & c/docker & related may be allowed, depending on context
Video Promoting is allowed if is within the topic.
No spamming.
Stay friendly.
Follow the lemmy.ml instance rules.
Tag your post. (Read under)

Important

Lemmy doesn’t have tags yet, so mark it with [Question], [Help], [Project], [Other], [Promoting] or other you may think is appropriate. This is strongly encouraged!

Cross-posting

!everything_git@lemmy.ml is allowed!
!docker@lemmy.ml is allowed!
!portainer@lemmy.ml is allowed!
!fediverse@lemmy.ml is allowed if topic has to do with selfhosting.
!selfhosted@lemmy.ml is allowed!

If you see a rule-breaker please DM the mods!

community_visibility: public

public_blurb

36 users / day
74 users / week
222 users / month
935 users / 6 months
number_of_local_subscribers
16K subscribers
444 Posts
2.73K Comments
Modlog