Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

'seo' appears only twice in this but deserves way more blame imo

The life cycle of an article now seems to be 1) legit author in legit place, possibly paywalled, 2) worse copies elsewhere, with or without sources

(I can't say how prevalent this pattern is, but you've seen it too, and if google has any nlp chops at all they are able to measure it in their index)

even bad content doesn't come out of nowhere. an indexing platform that tried to tease out the 'copy of a copy' structure of the web would have a much easier time filtering out the blurry copies.

not saying this is google's job, but they are the architects of the incentive structure here, and are in the best position to fix it if they wanted to



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: