14
submitted 1 year ago* (last edited 1 year ago) by aGeN@sh.itjust.works to c/selfhosted@lemmy.world

I'm having a play about with Yacy today. I have had the thoughts that it would be great to have the possibility to search through my paperless-ngx documents. Is it doable?? Has anyone got it working??

top 10 comments
sorted by: hot top controversial new old
[-] aGeN@sh.itjust.works 3 points 1 year ago

You can do a search on paperless and get a json output with the following http://192.168.1.34:8001/api/documents/?query=test but Yacy cant crawl paperless?

[-] johntash@eviltoast.org 1 points 1 year ago

Maybe you could add it to SearXNG as it's own engine?

[-] aGeN@sh.itjust.works 2 points 1 year ago* (last edited 1 year ago)

That seems like it could work. I seen SearXNG has a template engine to use. Just need to figure what how to use it with paperless :)

[-] johntash@eviltoast.org 1 points 1 year ago

I'd be curious how well it works if you try it. I kind of want to, but I'm not sure how I feel about letting something unauthenticated (SearXNG) access my paperless instance with some personal docs in

[-] aGeN@sh.itjust.works 2 points 1 year ago

I have it working!! Once I get the configs tidied up I'll share them :)

[-] aGeN@sh.itjust.works 1 points 1 year ago

OH!! with searxng not yacy

[-] rubenix_bcn@lemmy.world 1 points 11 months ago

i know this is quite old, anyways how did u manage to work @aGen@sh.itjust.works? can u share configs pls?

[-] eleitl@lemmy.ml 2 points 1 year ago

YaCy indexes http content, so if your documents are all reachable via a http interface they can be indexed.

[-] troy@programming.dev 1 points 1 year ago

Paperless will store documents in plain text. Maybe one could write a small webserver or extend Paperless to serve an RSS feed that could be consumed by Yacy.

load more comments
view more: next ›
this post was submitted on 23 Jul 2023
14 points (100.0% liked)

Selfhosted

40247 readers
591 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 1 year ago
MODERATORS