Surprised that "controlling cost" isn't a section in this post. Here's my attemp...

sagarpatil · 2025-04-20T05:01:05 1745125265

If I have to be so cautious while using a tool might as well write the code myself lol. I’ve used Claude Code extensively and it is one of the best AI IDE. It just gets things done. The only downside is the cost. I was averaging $35-$40/day. At this cost, I’d rather just use Cursor/Windsurf.

BeetleB · 2025-04-19T16:42:05 1745080925

Oh wow. Reading your comment guarantees I'll never use Claude Code.

I use Aider. It's awesome. You explicitly specify the files. You don't have to do work to limit context.

jjallen · 2025-04-19T23:14:30 1745104470

Not having to specify files is a humongous feature for me. Having to remember which file code is in is half the work once you pass a certain codebase size.

m3kw9 · 2025-04-20T02:33:21 1745116401

That sometimes work sometimes doesn’t and takes 10x time. Same with codex. I would have both and switch between them depending on what you feel will get it right better

LeafItAlone · 2025-04-19T20:48:40 1745095720

Aider is a great tool. I do love it. But I find I have to do more with it to get the same output as Claude Code (no matter what LLM I used with Aider). Sure it may end up being cheaper per run, but not when my time is factored in. The flip side is I find Aider much easier to limit.

Game_Ender · 2025-04-19T21:41:29 1745098889

What are those extra things you have to do more of? I only have experience with Aider so I am curious what I am missing here.

simonw · 2025-04-19T22:33:42 1745102022

With Claude Code you can at least type "/code" at any point to see how much it's spent, and it will show you when you end a session (with Ctrl+C) too.

The output of /cost looks like this:

  > /cost 
    ⎿  Total cost: $0.1331
       Total duration (API): 1m 13.1s
       Total duration (wall): 1m 21.3s

boredtofears · 2025-04-19T16:55:52 1745081752

Yeah, I tried CC out and quickly noticed it was spending $5+ for simple LLM capable tasks. I rarely break $1-2 a session using aider. Aider feels like more of a precision tool. I like having the ability to manually specify.

I do find Claude Code to be really good at exploration though - like checking out a repository I'm unfamiliar with and then asking questions about it.

Jerry2 · 2025-04-19T20:41:50 1745095310

>I use Aider. It's awesome.

What do you use for the model? Claude? Gemini? o3?

m3kw9 · 2025-04-20T02:34:36 1745116476

Gemini 2.5 pro is my choice

kiratp · 2025-04-19T19:17:44 1745090264

The productivity boost can be so massive that this amount of fiddling to control costs is counterproductive.

Developers tend to seriously underestimate the opportunity cost of their own time.

Hint - it’s many multiples of your total compensation broken down to 40 hour work weeks.

Aurornis · 2025-04-20T02:52:13 1745117533

The cost of the task scales with how long it takes, plus or minus.

Substitute “cost” with “time” in the above post and all of the same tips are still valuable.

I don’t do much agentic LLM coding but the speed (or lack thereof) was one of my least favorite parts. Using any tricks that narrow scope, prevent reprocessing files over and over again, or searching through the codebase are all helpful even if you don’t care about the dollar amount.

pizza · 2025-04-19T20:48:29 1745095709

Hard agree. Whether it's 50 cents or 10 dollars per session, I'm using it to get work done for the sake of quickly completing work that aims to unblock many orders of magnitude more value. But in so far as cheaper correct sessions correlate with sessions where the problem solving was more efficient anyhow, they're fairly solid tips.

afiodorov · 2025-04-19T22:35:06 1745102106

I agree but optimisation often reveals implementation details helping to understand limits of current tech more. It might not be worth the time but part of engineering is optimisation and another part is deep understanding of tech. It is sometimes worth optimising anyway if you want to take the engineering discipline to the next level within yourself.

I myself didn’t think about not running linters however it makes obvious sense now and gives me the insight about how Claude Code works allowing me to use this insight in related engineering work.

pclmulqdq · 2025-04-19T19:08:01 1745089681

It's interesting that this is a problem for people because I have never spent more than about $0.50 on a task with Claude Code. I have pretty good code hygiene and I tell Claude what to do with clear instructions and guidelines, and Claude does it. I will usually go through a few revisions and then just change anything myself if I find it not quite working. It's exactly like having an eager intern.

jjmarr · 2025-04-19T19:19:20 1745090360

I don't think about controlling cost because I price my time at US$40/h and virtually all models are cheaper than that (with the exception of o1 or Gemini 2.5 pro).

If I spend $2 instead of $0.50 on a session but I had to spend 6 minutes thinking about context, I haven't gained any money.

jasonjmcghee · 2025-04-19T19:49:29 1745092169

If you do it a bit, it just becomes habit / no extra time or cognitive load.

Correlation or causation aside, the same people I see complain about cost, complain about quality.

It might indicate more tightly controlled sessions may also produce better results.

Or maybe it's just people that tend to complain about one thing, complain about another.

owebmaster · 2025-04-19T19:47:31 1745092051

Important to remind people this is only true if you have a profitable product, otherwise you’re spending money you haven’t earned.

jjmarr · 2025-04-19T23:14:07 1745104447

If what I'm doing doesn't have a positive expected value, the correct move isn't to use inferior dev tooling to save money, it's to stop working on it entirely.

oezi · 2025-04-20T08:35:19 1745138119

There might be value but you might not receive any of it. Most salaried employees won't see returns.

ngruhn · 2025-04-20T07:37:05 1745134625

Come on, every hobby has negative expected value. You're not doing it for the money but it still makes sense to save money.

jasonjmcghee · 2025-04-19T19:51:49 1745092309

If your expectation is to produce the same amount of output, you could argue when paying for AI tools, you're choosing to spend money to gain free time.

4 hours coding project X or 3 hours and a short hike with your partner / friends etc

irthomasthomas · 2025-04-20T00:11:07 1745107867

I assume they use a conversation, so if you compress the prompt immediately you should only break cache once, and still hit cache on subsequent prompts?

So instead of Write Hit Hit Hit

It's Write Write Hit Hit Hit

gundmc · 2025-04-19T21:21:21 1745097681

Never edit files manually during a session (that'll bust cache). THIS INCLUDES LINT

Yesterday I gave up and disabled my format-on-save config within VSCode. It was burning way too many tokens with unnecessary file reads after failed diffs. The LLMs still have a decent number of failed diffs, but it helps a lot.

chewz · 2025-04-19T19:29:18 1745090958

My attempt is - Do not use Claude Code at all, it is terrible tool. It is bad at almost everything starting with making simple edits to files.

And most of all Claude Code is overeager to start messing with your code and run unnecessary $$ instead of making sensible plan.

This isn't problem with Claude Sonnet - it is fundamnetal problem with Claude Code.

winrid · 2025-04-19T19:41:00 1745091660

I pretty much one shot a scraper from an old Joomla site with 200+ articles to a new WP site, including all users and assets, and converting all the PDFs to articles. It cost me like $3 in tokens.

hu3 · 2025-04-19T20:49:29 1745095769

I guess the question the is: can't VScode Copilot do the same for a fixed $20/month? It even has access to all SOTA models like Claude 3.7, Gemini 2.5 Pro and GPT o3

mceachen · 2025-04-19T21:21:47 1745097707

Vscode’s agent mode in copilot (even in the insider’s nightly) is a bit rough in my experience: lots of 500 errors, stalls, and outright failures to follow tasks (as if there’s a mismatch between what the ui says it will include in context vs what gets fed to the LLM).

darksaints · 2025-04-19T21:20:52 1745097652

I would have thought so, but somehow no. I have a cursor subscription with access to all of those models, and I still consistently get better results from claude code.

winrid · 2025-04-19T21:22:16 1745097736

I haven't tried copilot. Mostly because I don't use VSCode, I use jetbrains ides. How do they provide Claude 3.7 for $20/mo with unlimited usage?

oezi · 2025-04-20T08:36:53 1745138213

By providing bad UI that you don't use it so much.

troupo · 2025-04-19T20:51:04 1745095864

was it a wget call feeding into html2pdf?

winrid · 2025-04-19T21:20:43 1745097643

no it's a few hundred lines of python to parse weird and inconsistent HTML into json files and CSV files, and then a sync script that can call the WP API to create all the authors as needed, update the articles, and migrate the images

SoftTalker · 2025-04-20T03:06:21 1745118381

Plumbing to pipe shit from one sewer to another.

winrid · 2025-04-20T04:55:24 1745124924

Yep, don't wanna spend more of my life doing that than I have to!

bugglebeetle · 2025-04-19T15:42:53 1745077373

If I have to spend this much time thinking about any of this, congratulations, you’ve designed a product with a terrible UI.

djtango · 2025-04-20T09:37:42 1745141862

I have been quite skeptical of using AI tools and my experiences using them have been frustrating for developing software but power tools usually come with a learning curve while "good product" with clean simplified interface often results in reduced capability.

VIM, Emacs and Excel are obvious power tools which may require you to think but often produce unrivalled productivity for power users

So I don't think the verdict that the product has a bad UI is fair. Natural language interfaces is such a step up from old school APIs with countless flags and parameters

jasonjmcghee · 2025-04-19T15:53:11 1745077991

Some tools take more effort to hold properly than others. I'm not saying there's not a lot of room for improvement - or that the ux couldn't hold the users hand more to force things like this in some "assisted mode" but at the end of the day, it's a thin, useful wrapper around an llm, and llms require effort to use effectively.

I definitely get value out of it- more than any other tool like it that I've tried.

oxidant · 2025-04-19T19:30:00 1745091000

Think about what you would do in an unfamiliar project with no context and the ticket

"please fix the authorization bug in /api/users/:id".

You'd start by grepping the code base and trying to understand it.

Compare that to, "fix the permission in src/controllers/users.ts in the function `getById`. We need to check the user in the JWT is the same user that is being requested"

troupo · 2025-04-19T20:52:29 1745095949

So, AIs are overeager junior developers at best, and not the magical programmer replacements they are advertised as.

oezi · 2025-04-20T08:38:14 1745138294

As of April 2025. The pace is so fast that it will overtake seniors within years maybe months.

apwell23 · 2025-04-20T09:11:36 1745140296

overtake ceo by 2026

lacker · 2025-04-19T23:44:51 1745106291

Let's split the difference and call them "magical overeager junior developer replacements".

xpe · 2025-04-19T21:09:43 1745096983

> So, AIs are overeager junior developers at best, and not the magical programmer replacements they are advertised as.

This may be a quick quip or a rant. But the things we say have a way of reinforcing how we think. So I suggest refining until what we say cuts to the core of the matter. The claim above is a false dichotomy. Let's put aside advertisements and hype. Trying to map between AI capabilities and human ones is complicated. There is high quality writing on this to be found. I recommend reading literature reviews on evals.

troupo · 2025-04-19T21:17:12 1745097432

[flagged]

drodgers · 2025-04-19T21:57:45 1745099865

Don’t be a dismissive dick; that’s not appropriate for this forum. The above post is clearly trying to engage thoughtfully and offers genuinely good advice.

troupo · 2025-04-20T09:38:04 1745141884

The above post produces some vague philosophical statements, and equally vague "juts google it" claims.

oxidant · 2025-04-20T00:43:53 1745109833

The grandparent is talking about how to control cost by focusing the tool. My response was to a comment about how that takes too much thinking.

If you give a junior an overly broad prompt, they are going to have to do a ton of searching and reading to find out what they need to do. If you give them specific instructions, including files, they are more likely to get it right.

I never said they were replacements. At best, they're tools that are incredibly effective when used on the correct type of problem with the right type of prompt.

tetha · 2025-04-19T19:23:52 1745090632

Mh. Like, I'm deeply impressed what these AI assistants can do by now. But, the list in the parent comment there is very similar to my mental check-list of pair-programming / pair-admin'ing with less experienced people.

I guess "context length" in AIs is what I intuitively tracked with people already. It can be a struggle to connect the Zabbix alert, the ticket and the situation on the system already, even if you don't track down all the zabbix code and scripts. And then we throw in Ansible configuring the thing, and then the business requriements by more, or less controlled dev-teams. And then you realize dev is controlled by impossible sales-terms.

These are scope -- or I guess context -- expansions that cause people to struggle.

sqs · 2025-04-19T16:11:41 1745079101

It's fundamentally hard. If you have an easy solution, you can go make a easy few billion dollars.

datavirtue · 2025-04-19T18:39:16 1745087956

GitHub copilot follows your context perfectly. I don't have to tell it anything about files. I tried this initially and it just screwed up the results.

xpe · 2025-04-19T21:05:30 1745096730

> GitHub copilot follows your context perfectly. I don't have to tell it anything about files. I tried this initially and it just screwed up the results.

Just to make sure we're on the same page. There are two things in play. First, a language model's ability to know what file you are referring to. Second, an assistant's ability to make sure the right file is in the context window. In your experience, how does Claude Code compare to Copilot w.r.t (1) and (2)?