Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

4-bit quantized 33b runs great on a mp pro with m3 max chip



I'm using the 5-bit quant with llama.cpp and it's excellent on my M2 96GB MacBook! Running this model + Mixtral will be fun.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: