More

redrove · 2026-05-02T05:58:42 1777701522

So this is basically vCluster[0] but Rancher branded?

[0] https://github.com/loft-sh/vcluster

phrotoma · 2026-05-02T10:36:50 1777718210

Thanks. I knew I'd seen this idea before but couldn't remember the project name.

AlfeG · 2026-05-02T06:13:02 1777702382

Closely related in pupopse, yes. Branded? no.

fsniper · 2026-05-03T10:15:26 1777803326

also kcp

redrove · 2026-05-01T06:53:26 1777618406

Nothing, but he got to plug his vibe coded startup that he advertises in the about section.

redrove · 2026-04-29T16:41:36 1777480896

It’s 128b dense model. Good luck getting more than 3t/s out of a mac. It doesn’t matter if it fits or not.

zozbot234 · 2026-04-29T17:12:03 1777482723

You could run it on a single Mac Studio with M3 Ultra, or two Mac Studios with M4 Max at higher perf than that. And lightly quantizing this could give us modern dense models in the ~80GB size range, which is a very compelling target.

freakynit · 2026-04-29T17:15:56 1777482956

Wouldn't matter much still. M3 ultra has 819GB/s unified memory bandwidth. That means theoretical max tokem rate is 819/128 =~ 6.39 t/s. At 80 GB (5 bit quantization), its still near about 10 t/s ... far from a good coding experience. Also, these are theoretical max.. real world token generation rates would be at least 15-20% less.

redrove · 2026-04-27T14:00:47 1777298447

I keep trying to use dirac-cli with codex and it won't work: Error: Codex API error: Codex API request failed: 400.

Any ideas?

GodelNumbering · 2026-04-27T14:07:54 1777298874

Assuming you logged in with OAuth, I am guessing you are trying to use gpt-5.5?

In my tests, it worked using gpt-5.4 for me and I assumed gpt-5.5 is not available to me because I am on the free plan

Do you have the subscription that allows 5.5? If so, I can look into what changed in API. Sorry I rarely use openAI so it is a bit of an untrodden path

redrove · 2026-04-27T15:23:56 1777303436

Yes I'm on ChatGPT Pro (OAuth) and I'm trying to use gpt-5.5-xhigh.

That was the issue, 5.4 works just fine.

Support for service: priority (GPT /fast mode) would also be cool!

GodelNumbering · 2026-04-27T23:21:58 1777332118

Will fix this soon. Please feel free to create a github issue in the meantime.

GodelNumbering · 2026-04-29T17:16:16 1777482976

Gpt 5.5 is now fully supported using codex and api

redrove · 2026-05-01T14:13:04 1777644784

cool!

redrove · 2026-04-24T20:22:15 1777062135

So an Obsidian plugin? Got it.

kenforthewin · 2026-04-24T20:48:29 1777063709

One can imagine an obsidian plugin of any arbitrary level of complexity, given it's written in a Turing-complete language.

redrove · 2026-04-24T15:49:54 1777045794

This is very well put. Thanks for the comment.

redrove · 2026-04-24T05:48:29 1777009709

It seems to be 160GB at mixed FP4+FP8 precision, FYI. Full FP8 is 250GB+. (B)F16 at around double I would assume.

zargon · 2026-04-24T05:51:12 1777009872

There is no BF16. There is no FP8 for the instruct model. The instruct model at full precision is 160 GB (mixed FP4 and FP8). The base model at full precision is 284 GB (FP8). Almost everyone is going to use instruct. But I do love to see base models released.

redrove · 2026-04-22T06:07:15 1776838035

Gift link: https://economist.com/finance-and-economics/2026/04/21/globa...

wao0uuno · 2026-04-22T07:48:34 1776844114

Expired.

redrove · 2026-04-21T05:41:26 1776750086

I’ve been using Codex Pro since they lobotomized Opus 4.6. Codex is so much better, GPT 5.4 xhigh fast is definitely the smartest and fastest model available.

For a while there I had both Opus 4.6 and Codex access and I frequently pitted them against each other, I never once saw Opus come out ahead. Opus was good as a reviewer though, but as an implementer it just felt lazy compared to 5.4 xhigh.

One feature that I haven’t seen discussed that much is how codex has auto-review on tool runs. No longer are you a slave to all or nothing confirmations or endless bugging, it’s such a bad pattern.

Even in a week of heavy duty work and personal use I still haven’t been able to exhaust the usage on the $200 plan.

I’ll probably change my mind when (not IF) OpenAI rug pull, but for spring ‘26, codex is definitely the better deal.

walthamstow · 2026-04-21T07:12:28 1776755548

I also made the switch to OpenAI, the $20 plan, I dunno about "so much better" but it's more or less the same, which is great!

The models and tools levelling out is great for users because the cost of switching is basically nil. I'm reading people ITT saying they signed up for a year - big mistake. A year is a decade right now.

redrove · 2026-04-21T07:54:15 1776758055

I underscored using xhigh + fast mode when saying it’s so much better.

Now with Opus 4.7 of course the “burden” of adjusting reasoning effort has been taken away from you even at the API level.

In my experience people don’t change the thinking level at all.

sitkack · 2026-04-21T08:25:27 1776759927

What issues did you consider about sending your code base to OpenAI?

walthamstow · 2026-04-21T10:31:03 1776767463

None mate. Code is cheap, it's not worth anything any more, especially not my little personal projects

Scotchy · 2026-04-21T05:45:00 1776750300

Any alternative to Claude Design ? Tried Figma with Opus 4.6 but it doesn't come close in my experience.

Codex is abysmal for UI design imo.

dgb23 · 2026-04-21T05:57:48 1776751068

It really depends on what you‘re trying to do and what your skillset is.

But if you go information architecture first and have that codified in some way (espescially if you already have the templates), then you can nudge any agent to go straight into CSS and it will produce something reasonable.

makingstuffs · 2026-04-21T07:41:39 1776757299

Have you tried stitch.withgoogle.com?

freedomben · 2026-04-21T09:58:59 1776765539

Thanks for the tip! Hadn't seen that, but definitely giving it a try.

gbalduzzi · 2026-04-21T06:06:58 1776751618

I created some decent prototypes with stitch but I don't know how it compares to claude design

freedomben · 2026-04-21T09:58:33 1776765513

stitch.withgoogle.com

joelmanner · 2026-04-21T14:38:32 1776782312

I've been using paper.design and it's been working well for me via mcp on claude code

StrangeSound · 2026-04-21T09:59:50 1776765590

Google Stitch

redrove · 2026-04-03T12:44:00 1775220240

https://omlx.ai/

leftnode · 2026-04-03T14:16:07 1775225767

Does this have a CLI only interface?

redrove · 2026-04-03T19:47:35 1775245655

Yes. You could also look at the README.md.