New benchmark for competitive coding dropped yesterday - https://livecodebenchpr... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Snuggly73 79 days ago \| parent \| context \| favorite \| on: Generative AI coding tools and agents do not work ... New benchmark for competitive coding dropped yesterday - https://livecodebenchpro.com/ Apparently models are not doing great for problems out of distribution.

p1dda 79 days ago [–]

It goes to show that the LLMs aren't intelligent in the way humans are. LLMs are a really great replacement for googling though

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact