That's a red herring because it ignores the part where they could have done the ...

gordonhart · 2025-02-18T17:35:10 1739900110

_Could_ they have done the same thing with a tiny fraction of the money? Grok 3 benchmarks are SOTA for both base model and reasoning. By definition, nobody has been able to do the same thing with any amount of money (discounting o3 which has been teased but is unreleased). That may change in the future! But as of now this is the case.

gmerc · 2025-02-18T17:41:10 1739900470

So apart from the part where SOTA doesn't mean anything in the real world (there is no monetisation, there's no moat), please, it's benchmarks, we all know how you beat those since 2023.

Time to review https://arxiv.org/abs/2309.08632 AI-CEO.org's best friend

(and actually o3-mini-high beat them in a bunch of benchmarks so they removed it from those charts in the livestream)

YetAnotherNick · 2025-02-18T17:48:34 1739900914

Why don't you do it then? If you are talking about Deepseek "$5M", then you would be interested to know that they pay 7 digit salaries and reportedly have H100s worth $2B[1].

[1]: https://sherwood.news/tech/the-trillion-dollar-mystery-surro...

zhobbs · 2025-02-18T20:04:04 1739909044

Just wonder if it matters? If Google spent 10x as much in the first 5 years of its life would it be a worse company now? Giant TAM, winner takes all (or most?), all that matters is winning.

loandbehold · 2025-02-18T17:51:28 1739901088

People like Demis Hasabis and Derio Amodei say that R1 efficiency gains are exaggerated. $5M training cost seems to be fake as sources suggest they own more GPUs.