Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Any scraper is also a “user agent doing work for users”. Which ones should respect robots.tx?


Does the user agent fit the definition of a web crawler? If so, then observe robots.txt. This one does not, see https://en.m.wikipedia.org/wiki/Web_crawler




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: