Ultrathink is a Claude Code a magic word

vunderba · 2025-04-19T23:41:59 1745106119

@dickfickling beat me to it, but ultrathink is already explicitly called out in the public Anthropic documentation:

"Ask Claude to make a plan for how to approach a specific problem. We recommend using the word "think" to trigger extended thinking mode, which gives Claude additional computation time to evaluate alternatives more thoroughly. These specific phrases are mapped directly to increasing levels of thinking budget in the system: "think" < "think hard" < "think harder" < "ultrathink." Each level allocates progressively more thinking budget for Claude to use."

https://www.anthropic.com/engineering/claude-code-best-pract...

I don't know what the max allowable "budget_tokens" is for Claude 3.7 Thinking mode, but the SDK shows an example of 32k which matches up with the article's findings.

simonw · 2025-04-20T00:07:28 1745107648

Looks like that documentation is incorrect. It suggests there are four levels - "think" < "think hard" < "think harder" < "ultrathink." - but if you look in the code there are actually only three.

m1keil · 2025-04-19T23:40:01 1745106001

I hope we will exit this stage of magic spells and incantations sooner rather than later.

Frummy · 2025-04-19T23:57:00 1745107020

I hope we delve deeper into pentacles and rites in candlelit basements to appease black boxes of neural mimicries of canaanite archetypes

4b11b4 · 2025-04-20T00:37:48 1745109468

I thought that earlier on, I don't think we will though

patcon · 2025-04-20T00:05:30 1745107530

Sincerely, I respect your response to how arbitrary it seems in this form.

But... I'd like you to take a moment and think really hard about whether this is truly novel behavior for LLMs, or rather something that has always been part of the interplay between inter-agent communication and intra-agent thought :)

bee_rider · 2025-04-20T00:35:44 1745109344

It sounds like it is a “specific phrase mapped directly” based on another comment here? I guess that means hardcoded? Not completely sure, though.

simonw · 2025-04-20T01:23:19 1745112199

It's hard-coded - this isn't a weird model thing, Claude Code detects the exact string "ultrathink" and sets the thinking token budget to 31999.

I included that de-obfuscated code in my post: https://simonwillison.net/2025/Apr/19/claude-code-best-pract...

zenkey · 2025-04-19T23:38:00 1745105880

It would be cool if these "secret keywords" were more directly exposed in the UI somehow, perhaps as a toggleable developer/experimental mode? I would have a lot of fun tinkering with them.

refulgentis · 2025-04-20T00:01:20 1745107280

It's for Claude Code FWIW, just leaving a sigil here for fellow API implementers who are confused: your general point stands (though I wonder about UI affordances other than text given it's a CLI tool)

sn9 · 2025-04-20T01:06:02 1745111162

Tengu think? As in Japanese Tengu?

layer8 · 2025-04-20T00:36:20 1745109380

I think I'll wait for Hyperthink.

wpollock · 2025-04-20T00:00:34 1745107234

Nice to know, although I was taught that the magic word is "please".

replwoacause · 2025-04-20T00:03:15 1745107395

This would be helpful information if I hadn’t already switched to Gemini 2.5 because it’s 96% cheaper

user3939382 · 2025-04-20T00:12:23 1745107943

After stunts like Amp and Web Integrity (among others) I don’t care what they charge, I want nothing to do with Google.

bn-l · 2025-04-20T00:32:06 1745109126

It does feel like a Faustian bargain using it.

fragmede · 2025-04-19T23:29:52 1745105392

Crazy that it's a key word that's implemented in the code that expands the context window, and that a light touch of reverse engineering was required to find it.

dickfickling · 2025-04-19T23:32:12 1745105532

It’s described here: https://www.anthropic.com/engineering/claude-code-best-pract...

fragmede · 2025-04-19T23:42:55 1745106175

Ah yes, the documentation. If everyone read documentation, we wouldn't need LLMs to read it for us!

canadiantim · 2025-04-19T22:53:37 1745103217

That very very quickly moved from blog to twitter to blog to HN. Gotta love the velocity of information these days

doubled112 · 2025-04-19T22:57:06 1745103426

Link first, ask questions later

sauravt · 2025-04-19T23:03:58 1745103838

megathink sounds better

dghlsakjg · 2025-04-19T23:27:48 1745105268

But paradoxically only allocates 1/3 the tokens according to the code.

Perhaps they should switch to the metric thinking system.

Gigathinking, and Terathinking should be on the menu as well.

benatkin · 2025-04-19T23:07:32 1745104052

And doublemegathink if you want it to do two megathinks in parallel

Terr_ · 2025-04-19T23:22:54 1745104974

Not to be confused with doublethink, a mode that is always active for LLMs.

andrewfromx · 2025-04-19T22:49:21 1745102961

I asked Claude if this was true, and Claude confirmed.