For about a month now I've been paying $20-$30/day to delegate the bulk of my coding to Sonnet.
The agentic loop thats trained into it is just simply not matched by another other model.
I can't admit to myself there's any open question as to if there is any long-term value.
I expect within 2 years, this will seem like a non-controversial idea, and it won't bring in a ton of assumptions about the speaker.
I have invested much time and effort making sure local models are a peer to remote ones in my app, and none, including DeepSeek's local models, are remotely close to the things needed to make that flow work.
EDIT: Reply-throttled, so answering replies here:
- The machine is building the machine: Telosnex, a cross-platform Flutter app
- it can do 90% of the scope, especially after I wrote precanned instructions for doing e.g. property-based testing.
- Things it's done mostly wholesale:
-- secure iframe environment, on all 6 platforms, to: execute JS in, or render react components it wrote.
-- completely refactoring my llama.cpp inference to use non-deprecated APIs.
- Codebase is about 40K real lines of code. (I have to think this helps a lot I doubt that ex. from scratch it would be able to build a Flutter app that used llama.cpp.)
- $30/day!?! -- Yeah, it's crazy, its up an order of magnitude from my most busy days when I just copy-pasted back and forth. It reads as much code as it wants, and you're doing more work literally, so it adds up.
$20 to $30 a day? How?? I have been using Sonnet every day for a month and have spent a total of... $7 on OpenRouter (7/30 = 0.23 a day on average).
Granted, I ask it very specific questions that generate short answers (many of which are incorrect, btw), but still, it's difficult to imagine what kind of tasks, done by a single person, would generate such amounts?
Unfortunately, the current available version doesn't have the agent stuff yet.
Hopefully in a week, realistically two.
I had the existing client app I've released-but-not-released-out-loud. Couple days before Christmas, for fun, I spent a couple hours wiring up the Anthropic Model Context Protocol filesystem server example. Within an hour it was clear this was special and I needed to get it out ASAP. Stunning stuff in action.
The website for your editor (https://telosnex.com/) has some... character. However I do believe its worth a second look at making it look nicer, I know you aren't a designer and probably think you're going for a more "raw" and "friendly" look by not putting that much effort in and using conflicting fonts, colour schemes etc, and I agree there is a lot of value in avoiding corpo-internet styles, but I still think it could stand to look less like a mixture of ai sludge and poor photoshop jobs on the homepage.
I can't admit to myself there's any open question as to if there is any long-term value.
I expect within 2 years, this will seem like a non-controversial idea, and it won't bring in a ton of assumptions about the speaker.
I have invested much time and effort making sure local models are a peer to remote ones in my app, and none, including DeepSeek's local models, are remotely close to the things needed to make that flow work.
EDIT: Reply-throttled, so answering replies here:
- The machine is building the machine: Telosnex, a cross-platform Flutter app
- it can do 90% of the scope, especially after I wrote precanned instructions for doing e.g. property-based testing.
- Things it's done mostly wholesale: -- secure iframe environment, on all 6 platforms, to: execute JS in, or render react components it wrote. -- completely refactoring my llama.cpp inference to use non-deprecated APIs.
- Codebase is about 40K real lines of code. (I have to think this helps a lot I doubt that ex. from scratch it would be able to build a Flutter app that used llama.cpp.)
- $30/day!?! -- Yeah, it's crazy, its up an order of magnitude from my most busy days when I just copy-pasted back and forth. It reads as much code as it wants, and you're doing more work literally, so it adds up.
- $20/day is realistic average
- Lines added per day +55%, lines deleted per day +29%, files changed per day 9 -> 21 https://x.com/jpohhhh/status/1881453489852948561