Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

they responded to my tweet last year and said they didn't quantize the models.


It's very hard to find right now but I'm sure they said they don't quantize KV cache, but their weights are in fp8.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: