Hacker News new | past | comments | ask | show | jobs | submit login

“Pure garage-energy” with 10,000 A100s, apparently. I’d love to have a garage like that.



From https://semianalysis.com/2025/01/31/deepseek-debates/

> We believe DeepSeek has access to around 10,000 of these H800s and about 10,000 H100s. Furthermore they have orders for many more H20’s, with Nvidia having produced over 1 million of the China specific GPU in the last 9 months.


The paper in the repo says: “ For DL training, we deployed the Fire-Flyer 2 with 10,000 PCIe A100 GPUs“


that report is lazy. they assume all GPUs owned (openly reported) by the parent company (a hedge fund which claims to use those GPUs to generate trades) were used by the invested company.

that's as dumb as saying coca cola have acccess to all offices of Berkshire Hathaway.

likewise, all comments praising deepseek history are also misleading as the company barely exists for a year.

everything is opaque marketing being repeated. just drop the off topic bla bla bla and focus on the facts and code in front of you.

thanks for coming to my ted talk.


so instead of trusting the company that just released the most competitive LLM model with clear positive impacts on numerous small countries & a combined billions of population which would otherwise be excluded from having such access, you choose to be believe some online big mouth called semianalysis.com.

love your joke!


Hey, could you please make your points without resorting to the flamewar style? You've done that repeatedly in this thread, as well as in other threads recently (e.g. https://news.ycombinator.com/item?id=43035040). This is not what HN is for, and destroys what it is for.

If you wouldn't mind reviewing https://news.ycombinator.com/newsguidelines.html and taking the intended spirit of the site more to heart, we'd be grateful. The basic idea is to make your substantive points thoughtfully, regardless of how wrong anyone else is or you feel they are.


Didn't the deepseek paper itself state they trained on 2048 H200s?

Claiming they have access to 5x this amount is not such a bold claim?


Appeals to authority are so totally unconvincing.

What claims from the semianalysis article do you think are false? And based on what evidence?


Parent Highflyer hedgefund only been around for a few years with 8B AUM, aka their single digit % management fees since founding is in low 100s millions total (for all operating expenses), hence fiscally cannot acquire 1B+ of just hardware capex. Deepseek having access to that much hardware doesn't pass basic smell test, and semi analysis has been dodging call outs on socials for this basic math illiteracy.


[flagged]


Semianalysis is a trusted and reputable source that has deep knowledge of the semiconductor industry and supply chains. So yes, they know.

> love your joke!

> Get a real life please.

> Too bad that you didn't learn this back in school.

Read the site guidelines, as you’re repeatedly resorting to personal attacks in your comments on discussions around DeepSeek or China.

https://news.ycombinator.com/newsguidelines.html


SemiAnalysis has made up many things.

They claim that a small Chinese hedge fund could acquire $1bln in GPUs, with no state support, including many sanctioned chips, then trained a model optimized for a far smaller server compute size, and that they have a source at this very small fund who is willing to admit to export violations. A 40bln param active model is exactly the size you would expect from a server of the size they claim.

What’s more likely - that semianalysis made it up like they have a bunch of other things, or that all the above is true?


They had their A100s back in 2021 to early 2022, well before any GPU sanction kicked in.

For a few months H800 wasn't sanctioned and that's when they bought them.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: