Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Where does one find a good robots.txt? Are there any well maintained out there?



Cloudflare actually has this as a free tier feature so even if you don't want to use it for your site you can just setup a throwaway domain on Cloudflare and periodically copy the robots.txt they generate from your scraper allow/block preferences, since they'll be keeping up to date with all the latest.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: