Yeah, Phi-2 is weird on chat, StableLM beats it on some metrics, Phi-2 does on others but also doesn't really have system integration yet.
The base model of StableLM 3b zephyr is actually under an even more permissive license (we didn't change in retrospect) and is the best base to train on for MacBooks with 8gb RAM, edge devices etc.
With LLM Farm quantised you can run it faster than you can read on a iPhone or whatever.
The base model of StableLM 3b zephyr is actually under an even more permissive license (we didn't change in retrospect) and is the best base to train on for MacBooks with 8gb RAM, edge devices etc.
With LLM Farm quantised you can run it faster than you can read on a iPhone or whatever.
https://huggingface.co/stabilityai/stablelm-3b-4e1t
It's also one of the only models with fully dataset, training and other transparency: https://stability.wandb.io/stability-llm/stable-lm/reports/S...