Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Serving models is currently expensive. I'd argue that some big cloud providers have conspired to make egress bandwidth expensive.

Cloudflare R2 has unlimited egress, and AFAIK, that's what ollama uses for hosting quantized model weights.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: