Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Even more important than that for me (possibly for you too) is that you make sure that none of these pages make it into googles index.

The duplication of content (potentially sending the original pages down in search ranks) and the fact that you are polluting the organic search results for the sites you mirror could be a big issue for the owners of the pages.



Good point! There is a robots.txt that prevents the site from getting indexed now: http://hn.getpageback.com/robots.txt




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: