I wish every post/tool out there on scraping covered obeying robots.txt. It's a ...

mdaniel · on Aug 5, 2014

I'm glad that all of the sites you target want your scraper to access them. The goal in many cases where one would use a scraper is to access information not provided in an API or otherwise encased in HTML. Most of their robots.txt are "User-Agent: *\nDisallow: /\n"

preinheimer · on Aug 11, 2014

Then we have no right to scrape that content.

Why is there an implied right to scrape?