There is only one true agent in 2025, Claude Code. That said, Gemini is very pow...

patrickhogan1 · 2025-08-06T20:07:52 1754510872

I agree with you at this point. Even though Google is performing well on benchmarks and releasing impressive models like World Models Genie 3, the Gemini CLI suggestions/changes feel overly formulaic. Almost like its priorities are that of an OCD coder that cares more about tabs vs spaces instead of building a useful feature. For example, in a recent project, Google CLI spent all of my token allotment for that day on trivial tasks like tweaking ESLint configs or modularizing code that didn't need modularization.

In contrast, Claude Code seems to interpret my prompts better and helps me ship real product features for users.

Maybe it’s a system prompt issue. Its likely my prompting causing the problem. But Claude Code seems to understand my intent better.

ramoz · 2025-08-06T20:34:50 1754512490

It's how these models/their-harnesses (e.g. the Claude Code js program) are being trained together in the RL stages.

I think the software is now a very important part of the training process. Which is why I think frontier labs are only capable of shipping "actual" agents.

Anthropic has figured something out here that others have not.

https://news.ycombinator.com/item?id=44816424

patrickhogan1 · 2025-08-07T03:06:52 1754536012

I’m being down voted. I don’t have an agenda. I’m simply sharing my experience. If you’re getting good results with Gemini CLI as an alternative to Claude Code, please let me know what you’re doing to get that performance.

I’m impressed by Gemini Pro 2.5’s NLP capabilities. I use that model in production on several projects. My comments are directed only at Gemini CLI. Which FWIW is better than OpenAI Codex CLI, but much worse (for me) than Claude Code.

Even with Pro, the strict token limits combined with the model's tendency to add unrequested modifications means I run out of tokens before completing my intended tasks. Others have the same issue https://github.com/google-gemini/gemini-cli/issues/4300

dash2 · 2025-08-06T21:11:34 1754514694

Perhaps this is the modern version of "every company ships its own org chart"? Maybe Gemini's priorities are those of a Google engineer, Claude's are those of an engineer at Anthropic....

the_sleaze_ · 2025-08-06T19:16:00 1754507760

Thinking the same. I don't want Github approval process to sit in between me and the changes - the killer feature of claude code is being able to head it off as it starts to go down a bad path, and to code myself in between its steps.

Do you let juniors complete full features without asking questions or make them check in when they get flustered?

jondwillis · 2025-08-06T19:50:06 1754509806

I do want to try out some background agents, but from my experience with Cursor’s (frontier model agents) frequency of going off the rails despite having rules and context to help avoid producing slop, I can’t see background agents being that generally useful yet.

ramoz · 2025-08-06T19:54:11 1754510051

for you or anyone else that wants this to be real - I would love to test a solution out with you.

pjm331 · 2025-08-06T22:18:42 1754518722

Source graph amp is pretty good as well albeit lacking a lot of the polish and features of Claude code

But I sometimes reach for it for code review in particular since it calls out to o3 via its “oracle” tool

ramoz · 2025-08-06T22:36:28 1754519788

o3 is a great oracle I use as well - in my dumb reddit/theater mode I mention that.

I'm building integrations for both Claude Code and AMP! AMP also provides really important features of a harness that others haven't quite caught up on. OpenCode, sort of, but that is driven in a bit of a cultish open source way.