> We believe DeepSeek has access to around 10,000 of these H800s and about 10,000 H100s. Furthermore they have orders for many more H20’s, with Nvidia having produced over 1 million of the China specific GPU in the last 9 months.
that report is lazy. they assume all GPUs owned (openly reported) by the parent company (a hedge fund which claims to use those GPUs to generate trades) were used by the invested company.
that's as dumb as saying coca cola have acccess to all offices of Berkshire Hathaway.
likewise, all comments praising deepseek history are also misleading as the company barely exists for a year.
everything is opaque marketing being repeated. just drop the off topic bla bla bla and focus on the facts and code in front of you.
so instead of trusting the company that just released the most competitive LLM model with clear positive impacts on numerous small countries & a combined billions of population which would otherwise be excluded from having such access, you choose to be believe some online big mouth called semianalysis.com.
Hey, could you please make your points without resorting to the flamewar style? You've done that repeatedly in this thread, as well as in other threads recently (e.g. https://news.ycombinator.com/item?id=43035040). This is not what HN is for, and destroys what it is for.
If you wouldn't mind reviewing https://news.ycombinator.com/newsguidelines.html and taking the intended spirit of the site more to heart, we'd be grateful. The basic idea is to make your substantive points thoughtfully, regardless of how wrong anyone else is or you feel they are.
Parent Highflyer hedgefund only been around for a few years with 8B AUM, aka their single digit % management fees since founding is in low 100s millions total (for all operating expenses), hence fiscally cannot acquire 1B+ of just hardware capex. Deepseek having access to that much hardware doesn't pass basic smell test, and semi analysis has been dodging call outs on socials for this basic math illiteracy.
They claim that a small Chinese hedge fund could acquire $1bln in GPUs, with no state support, including many sanctioned chips, then trained a model optimized for a far smaller server compute size, and that they have a source at this very small fund who is willing to admit to export violations. A 40bln param active model is exactly the size you would expect from a server of the size they claim.
What’s more likely - that semianalysis made it up like they have a bunch of other things, or that all the above is true?