it uses a fractional amount of GPUs though.

breadwinner · 2025-01-26T21:45:55 1737927955

As it says in the article, you are talking about a mere constant of proportionality, a single multiple. When you're dealing with an exponential growth curve, that stuff gets washed out so quickly that it doesn't end up matter all that much.

Keep in mind that the goal everyone is driving towards is AGI, not simply an incremental improvement over the latest model from Open AI.

high_na_euv · 2025-01-27T11:05:18 1737975918

Why do you assume that exponential growth curve is real?

UltraSane · 2025-01-27T00:14:01 1737936841

Jevons Paradox states that increasing efficiency can cause an even larger increase in demand.

cma · 2025-01-26T22:25:35 1737930335

Their loss curve with the RL didn't level off much though, could be taken a lot further and scaled up to more parameters on the big nvidia mega clusters out there. And the architecture is heavily tuned to nvidia optimizations.

ithkuil · 2025-01-26T22:02:15 1737928935

Which due to the Jevons Paradox may ultimately cause more shovels to be sold

dutchbookmaker · 2025-01-27T00:19:52 1737937192

"wait" I suspect we are all in a bit of denial.

When was the last time the US got their lunch ate in technology?

Sputnik might be a bit hyperbolic but after using the model all day and as someone who had been thinking of a pro subscription, it is hard to grasp the ramifications.

There is just no good reference point that I can think of.