It seems like a pointless discussion since DeepSeek uses Nvidia GPUs after all.

jjeaff · 2025-01-26T21:03:24 1737925404

it uses a fractional amount of GPUs though.

breadwinner · 2025-01-26T21:45:55 1737927955

As it says in the article, you are talking about a mere constant of proportionality, a single multiple. When you're dealing with an exponential growth curve, that stuff gets washed out so quickly that it doesn't end up matter all that much.

Keep in mind that the goal everyone is driving towards is AGI, not simply an incremental improvement over the latest model from Open AI.

high_na_euv · 2025-01-27T11:05:18 1737975918

Why do you assume that exponential growth curve is real?

UltraSane · 2025-01-27T00:14:01 1737936841

Jevons Paradox states that increasing efficiency can cause an even larger increase in demand.

cma · 2025-01-26T22:25:35 1737930335

Their loss curve with the RL didn't level off much though, could be taken a lot further and scaled up to more parameters on the big nvidia mega clusters out there. And the architecture is heavily tuned to nvidia optimizations.

ithkuil · 2025-01-26T22:02:15 1737928935

Which due to the Jevons Paradox may ultimately cause more shovels to be sold

dutchbookmaker · 2025-01-27T00:19:52 1737937192

"wait" I suspect we are all in a bit of denial.

When was the last time the US got their lunch ate in technology?

Sputnik might be a bit hyperbolic but after using the model all day and as someone who had been thinking of a pro subscription, it is hard to grasp the ramifications.

There is just no good reference point that I can think of.

blackeyeblitzar · 2025-01-26T23:34:51 1737934491

Yep some CEO said they have 50K GPUs of the prior generation. They probably accumulated them through intermediaries that are basically helping nvidia sell to sanctioned parties by proxy

idonotknowwhy · 2025-01-27T00:41:36 1737938496

Deepseek was there side project. They had a lot of GPUs from their crypto mining project.

Then Ethereum turned off PoW mining, so they looked into other things to do with their GPUs, and started DeepSeek.

saagarjha · 2025-01-27T00:58:55 1737939535

Mining crypto on H100s?