Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

And nowadays a better known benchmark, so data scientists can overfit their models to it even more, even when LLMs are famous for overfitting. So, I wouldn’t trust any results regarding this specific test nowadays.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: