Deepseek-coder-6.7B really is a quite surprisingly capable model. It's easy to g...

triyambakam · on Jan 16, 2024

Thanks for the tip with ollama

discordance · on Jan 17, 2024

If you do:

1. ollama run deepseek-coder:6.7b

2. pip install litellm

3. litellm --model deepseek-coder:6.7b

You will have a local OpenAI compatible API for it.

behnamoh · on Jan 17, 2024

ollama is actually not a great way to run these models as it makes it difficult to change server parameters and doesn't use `mlock` to keep the models in memory.

bravura · on Jan 17, 2024

What do you suggest?

behnamoh · on Jan 17, 2024

vanilla llama.cpp (run a `/server`)