Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

New benchmark for competitive coding dropped yesterday - https://livecodebenchpro.com/

Apparently models are not doing great for problems out of distribution.



It goes to show that the LLMs aren't intelligent in the way humans are. LLMs are a really great replacement for googling though




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: