More

AlexCoventry · 2025-12-02T02:27:53 1764642473

Mixture-of-Expert models benefit from economies of scale, because they can process queries in parallel, and expect different queries to hit different experts at a given layer. This leads to higher utilization of GPU resources. So unless your application is already getting a lot of use, you're probably under-utilizing your hardware.

AlexCoventry · 2025-12-01T00:19:08 1764548348

Interestingly I can't get ChatGPT to help me find a video showing me how to disable the cellular modem on my Subaru 2024 Crosstrek. Time to do some old-fashioned research, I guess...

https://chatgpt.com/share/692cde57-0930-800e-b45f-7a41ca5c8e...

jacquesm · 2025-12-01T01:42:45 1764553365

Who cares about what ChatGPT can't do? It can't make me a sandwich either.

AlexCoventry · 2025-11-28T03:51:56 1764301916

"Process-oriented" verification has been a thing for a while in mathematical reasoning CoT. Google had a paper about it last year [1]. The key term to look for is "Process-reward model." I particularly like RL Tango [2].

[1] https://arxiv.org/abs/2406.06592

[2] https://arxiv.org/abs/2505.15034

AlexCoventry · 2025-11-11T04:25:43 1762835143

I recommend reading his nephew's biography, Prof. He makes a strong case for why it was probably suicide.

AlexCoventry · 2025-11-10T23:43:35 1762818215

That's partly because they are getting more reliable, though, just as WP did.

AlexCoventry · 2025-10-12T17:42:25 1760290945

I think this is cool, but some performance benchmarks would really help to sell it.

AlexCoventry · 2025-09-28T16:27:40 1759076860

> figuring how to get good product out of them

What have you figured out so far, apart from explicit up-front design?

AlexCoventry · 2025-09-01T01:24:21 1756689861

Why is reading code harder than writing it?

blackoil · 2025-09-01T06:20:06 1756707606

I think it has to do with mental model. If you already know what to write and it is reasonably complex you'll have a mental model ready and can quickly write it down (now even faster as LLMs autocomplete 3-4 lines at a time). While reading someone else code you'll have to constantly map the code in your mind with code written and have to then compare quality, security and other issues.

stavros · 2025-09-01T07:53:26 1756713206

Yeah, it's exactly this. Having to create a mental model from the code is much harder than having one and just writing it out.

AlexCoventry · 2025-09-01T21:27:17 1756762037

I just tend to find LLM code output extremely to read, I guess. It tends to be verbose and do a lot of unnecessary stuff, but I can always get the point easily and edit accordingly.

theshrike79 · 2025-09-01T07:35:13 1756712113

I'd say just reading your own code from a few years back will be as hard as reading someone else's.

AlexCoventry · 2025-08-31T22:16:26 1756678586

Manicheanimation

AlexCoventry · 2025-08-30T18:30:52 1756578652

He should cite John Ousterhout, IMO. He's clearly influenced by Ousterhout's (excellent) work.

zakirullin · 2025-08-30T20:48:22 1756586902

And I did it a few times :) He knows about the article, we talked about it.

roadbuster · 2025-08-30T20:51:46 1756587106

Lucky to have an opportunity to chat with him! Did he have any specific feedback on your essay?

zakirullin · 2025-08-30T20:59:48 1756587588

There's some: https://groups.google.com/g/software-design-book/c/c_CNaDTJt...

AlexCoventry · 2025-08-31T17:21:30 1756660890

Heh, apologies. Should have Ctrl+F'd.