Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Cerebras runs Llama 3.1 70B at 2,100 tokens per second, live demo available (twitter.com/cerebrassystems)
6 points by modeless on Oct 24, 2024 | past
Cerebras Inference (twitter.com/cerebrassystems)
4 points by montyanderson on Aug 27, 2024 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: