Nice list. I tried at some point to analyze html using a tree-sitter grammar and generate a list of articles, index them, and be on alert every so often for new entries.
RSS feed could be generated automatically with some AI code generator (or tree-sitter query generator), and just parsing the elements of the page.
Eventually i failed, but also i didn't try hard enough.
1. https://github.com/0x2E/fusion - A lightweight, self-hosted friendly RSS aggregator and reader
2. https://rawweb.org/ - A search engine for indie websites (the crawler collects data from RSS feeds)
3. https://github.com/0x2E/rss-finder - A tool for finding the RSS link of a website