For self-hosting, there are a few companies that offer per-token pricing for LoR...

delijati · 2025-07-29T22:50:52 1753829452

Do you maybe know if there is a company in the EU that hosts models (DeepSeek, Qwen3, Kimi)?

reissbaker · 2025-07-30T04:27:04 1753849624

Most inference companies (Synthetic included) host in a mix of the U.S. and EU — I don't know of any that promise EU-only hosting, though. Even Mistral doesn't promise EU-only AFAIK, despite being a French company. I think at that point you're probably looking at on-prem hosting, or buying a maxed-out Mac Studio and running the big models quantized to Q4 (although even that couldn't run Kimi: you might be able to get it working over ethernet with two Mac Studios, but the tokens/sec will be pretty rough).