More

beklein · 2026-05-04T13:22:23 1777900943

Principles of programming:

1. Break things down into small units

2. Think about sequence

3. Find patterns

4. Focus on the important things

5. Visualize sequences in your mind

Love the silly music and the way they teach, thanks for sharing this!

beklein · 2026-04-17T13:28:59 1776432539

Also relevant to this is the newest episode of The Lightcone Podcast with Quan Vuong, co-founder of PI and, one of many co-authors of that paper.

beklein · 2026-04-11T08:35:32 1775896532

Thanks for sharing this here, it's a beautiful video. The images are linked in the description but if anybody is reading this, check: https://www.flickr.com/photos/nasa2explore/

beklein · 2026-04-07T18:37:44 1775587064

"... the first early version of Claude Mythos Preview was made available for internal use on February 24. In our testing, Claude Mythos Preview demonstrated a striking leap in cyber capabilities relative to prior models, including the ability to autonomously discover and exploit zero-day vulnerabilities in major operating systems and web browsers."

More infos here: https://red.anthropic.com/2026/mythos-preview/

beklein · 2026-03-17T18:00:58 1773770458

As a big Codex user, with many smaller requests, this one is the highlight: "In Codex, GPT‑5.4 mini is available across the Codex app, CLI, IDE extension and web. It uses only 30% of the GPT‑5.4 quota, letting developers quickly handle simpler coding tasks in Codex for about one-third the cost." + Subagents support will be huge.

hyperbovine · 2026-03-17T18:13:04 1773771184

Having to invoke `/model` according to my perceived complexity of the request is a bit of a deal breaker though.

serf · 2026-03-17T18:17:39 1773771459

you use profiles for that [0], or in the case of a more capable tool (like opencode) they're more confusing referred to as 'agents'[1] , which may or may not coordinate subagents..

So, in opencode you'd make a "PR Meister" and "King of Git Commits" agent that was forced to use 5.4mini or whatever, and whenever it fell down to using that agent it'd do so through the preferred model.

For example, I use the spark models to orchestrate abunch of sub-agents that may or may not use larger models, thus I get sub-agents and concurrency spun up very fast in places where domain depth matter less.

[0]: https://developers.openai.com/codex/config-advanced#profiles [1]: https://opencode.ai/docs/agents/

beklein · 2026-03-05T23:02:50 1772751770

Not sure why you think Anthropic has not the same problems? Their version numbers across different model lines jump around too... for Opus we have 4.6, 4.5, 4.1 then we have Sonnet at 4.6, 4.5, and 4.1? No version 4.1 here, and there is Haiku, no 4.6, but 4.5 and no 4.1, no 4 but then we only have old 3.5...

Also their pricing based on 5m/1h cache hits, cash read hits, additional charges for US inference (but only for Opus 4.6 I guess) and optional features such as more context and faster speed for some random multiplier is also complex and actually quiet similar to OpenAI's pricing scheme.

To me it looks like everybody has similar problems and solutions for the same kinds of problems and they just try their best to offer different products and services to their customers.

selcuka · 2026-03-06T00:02:20 1772755340

With Anthropic you always have 3 models to choose from: Opus-latest, Sonnet-latest, and Haiku-latest, from the best/slowest to the worst/fastest.

The version numbers are mostly irrelevant as afaik price per token doesn't change between versions.

maxo99 · 2026-03-06T00:18:35 1772756315

Three random names isn't ideal. I'm often need to double check which is which. This is why we use numbers

dseravalli · 2026-03-06T00:35:47 1772757347

They aren't random. Opus's are very long poems, haikus are very short ones (3 lines), sonnets are in between (~14 lines)

oliwary · 2026-03-06T06:02:45 1772776965

What's next? Claude Iliad?

echoangle · 2026-03-06T00:33:35 1772757215

How are the names random?

https://en.wikipedia.org/wiki/Masterpiece

https://en.wikipedia.org/wiki/Sonnet

https://en.wikipedia.org/wiki/Haiku

They dropped the magnum from opus but you could still easily deduce the order of the models just from their names if you know the words.

svachalek · 2026-03-05T23:44:33 1772754273

It's much more consistent. Only 3 lines, numbered 4.6, 4.6, and 4.5, and it's clear they're tiers and not alternate product lines. It wasn't until recently that GPT seems to have any kind of naming convention at all and it's not intuitive if every version number is a whole different class of tool.

The pricing is more complex but also easy, Opus > Sonnet > Haiku no matter how you tweak those variables.

beklein · 2026-03-02T17:06:14 1772471174

Perhaps useful, I discovered: https://github.com/agent-infra/sandbox

> All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

beklein · 2026-02-24T12:43:27 1771937007

Some more info here: https://developers.openai.com/api/docs/models/gpt-realtime-1...

- $4 input, $0.4 cached input, $16 output

- 32,000 context window

- 4,096 max output tokens

- Sep 30, 2024 knowledge cutoff

Love the models, speed, and capabilities. Just sad that they are not getting the publicity and adoption right now, but hopefully in the future.

beklein · 2026-02-13T18:17:03 1771006623

Sound on!

Song name is: Windowdipper from ꪖꪶꪶ ꪮꪀ ꪗꪖꪶꪶ by Jib Kidder

https://jibkidder.bandcamp.com/track/windowdipper

hmokiguess · 2026-02-13T21:14:18 1771017258

what's the symbols for that `ꪖꪶꪶ ꪮꪀ ꪗꪖꪶꪶ ` font? where can I find these

chmod775 · 2026-02-13T21:20:36 1771017636

https://en.wikipedia.org/wiki/Tai_Viet_script

hmokiguess · 2026-02-13T21:22:45 1771017765

thank you!

hmokiguess · 2026-02-13T21:21:50 1771017710

also, seems like they have another project https://feel.thatsh.it/ and I'd love to find that song as well if you can help haha

keepamovin · 2026-02-14T02:31:17 1771036277

That song is also pretty nice. I wonder if that was an earlier project?

beklein · 2026-02-12T19:59:08 1770926348

https://x.com/fchollet/status/2022036543582638517

joelthelion · 2026-02-12T20:58:27 1770929907

Do opus 4.6 or gemini deep think really use test time adaptation ? How does it work in practice?