Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I'm curious how you wrote the logic for recipe detection? I've know some others who've tried to solve this and like many pattern recognition tasks it turns out to be harder than you'd think, but not impossible of course. Just curious how you did it.


So schema.org has a standard json schema for recipes. Crawlers use this information to parse and figure out what kind of content lives on the page.

Since google encourages this and ranks pages with schema higher, all recipe blogs and sites add this schema to their html.

This is structured data. I just parse this json.

Not all recipes follow this though. So you may occasionally see error messages for old sites that have not implemented schema.org structure.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: