Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Great minds! Thanks, I'll try those model outs. What are Gemma and Qwen? How do they compare to the new deepseek models?


You're not running a deepseek model on your macbook.

Gemma is Google's distillation of their larger Gemini model (at least that's my understanding.) Qwen is alibab's model. Qwen is usually very good at code, gemma tends to be a little better at everything else.

There are Deepseek distills that use either qwen or gemma as a base. I haven't been impressed with them though. TBH I've felt like most of the reasoning models are overhyped.


Cool, I'll try them out and see which I like best. Good to know that deepseek distills are not the move. I'm excited on being able to take pictures of plants/trees/other things and get information.

Any tips or fun ways you used your local model while camping?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: