Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

From my understanding, Google put online the largest RL cluster in the world not so long ago. It's not surprising they do really well on things that are "easy" to RL, like math or SimpleQA


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: