37
submitted 6 months ago by Meuzzin@lemmy.world to c/selfhosted@lemmy.world

Heyas, wondering if there's an open sourced piece of software or the like, that could scrape media platforms for a specific topic. Platforms like YT, X, Lemmy, News Media, etc., perhaps using RSS? But, a program I can host on my server, that only I have access too, via webpage, CLI, whatever...

Thanks for any info...

you are viewing a single comment's thread
view the rest of the comments
[-] november@iusearchlinux.fyi 25 points 6 months ago* (last edited 6 months ago)

FreshRSS has been working great for me! It even has the ability for web scraping if you need it.

[-] Meuzzin@lemmy.world 3 points 6 months ago

Right when I saw you reply, I saw a post about it. Digging in to it now. Thanks!

[-] charles@lemmy.ca 1 points 6 months ago* (last edited 6 months ago)

Seconding the recommendation for FreshRSS, it's the one I ended up hosting when I looked into this a while back and it's been really great. Takes a minute to get everything setup, especially if you want to have different settings for different types of feeds, but once it's all set it's perfect (for my needs at least).

I've also got it setup with my domain so I can access the feed from anywhere and that's been one of my favourite features.

this post was submitted on 21 Dec 2023
37 points (100.0% liked)

Selfhosted

37765 readers
265 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 1 year ago
MODERATORS