If you have 1,000 researchers working for your company and you constantly have d...

shawndrost · 2025-11-07T16:41:09 1762533669

Do they include the costs of dead-end runs?

Der_Einzige · 2025-11-07T16:59:01 1762534741

No, they don't! That's why the "5.5 million" deepseek V3 number as read by American investors was total bullshit (because investors ignored their astrik saying "only final training run")

simonw · 2025-11-07T18:06:18 1762538778

Yeah, that's one of the most frustrating things about these published numbers. Nobody ever wants to share how much money they spent on runs that didn't produce a useful model.

As with staffing costs though it's hard to account for these against individual models. If Anthropic run a bunch of training experiments that help them discover a new training optimization, then use that optimization as part of the runs for the next Opus and Sonnet and Haiku (and every subsequent model for the lifetime of the company) how should the cost of that experimental run be divvied up?