Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

One can forgive the lack of quality results for the 70B model, but apparently they trained 7B and 13B versions of their model, and don't report those either.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: