Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Cloudflare AI and Replicate are great for running off-the-shelf models, but anything custom is going to incur a 10+ minute cold start.

For running custom fine-tuned models on serverless, you could look into https://beam.cloud which is optimized for serving custom models with extremely fast cold start (I'm a little biased since I work there, but the numbers don't lie)




Thanks! Looks promising from the outside. Will surely check out


Why would it incur a cold start of 10 minutes on cloudflare? :O

Any proof?




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: