Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Won't machine learning largely solve this problem? Fine, don't provide an API, but I can extract a useful JSON document from your HTML representation.


If you're referring to reddit you can just add .json to the URL. eg: https://www.reddit.com/r/blog.json or https://www.reddit.com/r/teslamotors/comments/149ad64/teslas...


As far as I'm aware, Reddit still allows you to append .json to any of their pages and you get the results as a nicely formatted json document.

No LLM required.


The question is: will that still be available after the API is paywalled?


I imagine there's ways to curtail that (like detecting non-human users).




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: