Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

There are many lists, but I find all of them outdated or containing wrong information or missing the actual benchmarks I'm looking for.

I was thinking, that maybe it's better to make my own benchmarks with the questions/things I'm interested in, and whenever a new model comes out run those tests with that model using open-router.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: