We don't just tell you they were imagined, we can provide receipts. https://metr...

steveklabnik · 2025-07-20T18:06:14 1753034774

> We do not provide evidence that:

> AI systems do not currently speed up many or most software developers

> We do not claim that our developers or repositories represent a majority or plurality of software development work

brokencode · 2025-07-20T21:34:44 1753047284

Certainly an interesting result, but remember that a single paper doesn’t prove anything. This will no doubt be something studied very extensively and change over time as tools develop.

Personally, I find the current tools don’t work great for large existing codebases and complex tasks. But I’ve found they can help me quickly make small scripts to save me time.

I know, it’s not the most glamorous application, but it’s what I find useful today. And I have confidence the tools will continue to improve. They hardly even existed a few years ago.

nojito · 2025-07-20T15:08:38 1753024118

Cursor is an old way of using LLMs.

Not to mention in the study less than 1/2 have ever used it before the study.

roywiggins · 2025-07-20T15:15:45 1753024545

The AI tooling churn is so fast that by the time a study comes out people will be able to say "well they were using an older tool" no matter what tool that the study used.

cratermoon · 2025-07-20T15:18:56 1753024736

It's the eternal future. "AI will soon be able to...".

There's an entire class of investment scammers that string along their marks, claiming that the big payoff is just around corner while they fleece the victim with the death of a thousand cuts.

bubblyworld · 2025-07-21T00:39:07 1753058347

What is the problem with this, exactly? It's a valid criticism of the study (when applied to current agentic coding practices). That the pace of progress is so fast sucks for researchers, in some sense, but this is the reality right now.

nojito · 2025-07-20T15:23:55 1753025035

Not really. Chatting with a llm was cutting edge for 3 years it’s only within the last 8-10 months with Claude code and Gemini cli do we have the next big change in how we interact with llms

camdenreslink · 2025-07-20T17:46:32 1753033592

How is Claude Code and Gemini CLI any different from using Cursor in agent mode? It's basically the same exact thing.

steveklabnik · 2025-07-20T18:04:31 1753034671

I can't speak to how they're technically different, but in practice, Cursor was basically useless for me, and Claude Code works well. Even with Cursor using Claude's models.

roywiggins · 2025-07-20T15:29:27 1753025367

Claude Code was released in May.

nojito · 2025-07-20T15:44:22 1753026262

Yup. But they are improvements over what cursor was releasing over the last year or so.

roywiggins · 2025-07-20T15:49:57 1753026597

If there are paradigm-shattering improvements every six months, every single study that is ever released will be "behind" or "use an older tool." In six months when a study comes out using Claude Code, people dissatisfied with it will be able to point to the newest hotness, ad infinitum.