More

j2kun · 2026-04-15T21:25:53 1776288353

The article heavily quotes the "AI Security Institute" as a third-party analysis. It was the first I heard of them, so I looked up their about page, and it appears to be primarily people from the AI industry (former Deepmind/OpenAI staff, etc.), with no folks from the security industry mentioned. So while the security landscape is clearly evolving (cf. also Big Sleep and Project Zero), the conclusion of "to harden a system we need to spend more tokens" sounds like yet more AI boosting from a different angle. It raises the question of why no other alternatives (like formal verification) are mentioned in the article or the AISI report.

I wouldn't be surprised if NVIDIA picked up this talking point to sell more GPUs.

tptacek · 2026-04-15T21:32:02 1776288722

I would be interested in which notable security researchers you can find to take the other side of this argument. I don't know anything about the "AI Security Institute", but they're saying something broadly mirrored by security researchers. From what I can see, the "debate" in the actual practitioner community is whether frontier models are merely as big a deal as fuzzing was, or something signficantly bigger. Fuzzing was a profound shift in vulnerability research.

(Fan of your writing, btw.)

j2kun · 2026-04-15T23:24:43 1776295483

It's less that I think they would take the other side of the argument, than that they would lend some credence to the content of the analysis. For example, I would not particularly trust a bunch of AI researchers to come up with a representative set of CTF tasks, which seems to be the basis of this analysis.

tptacek · 2026-04-15T23:54:10 1776297250

Yeah, you might be right about this particular analysis! The sense I have from talking to people at the labs is that they're really just picking deliberately diverse and high-profile targets to see what the models are capable of.

VorpalWay · 2026-04-15T22:05:21 1776290721

> but they're saying something broadly mirrored by security researchers.

You might well be right, it is not an area I know much of or work in. But I'm a fan of reliable sources for claims. It is far to easy to make general statements on the internet that appear authorative.

croemer · 2026-04-16T07:27:02 1776324422

They are a UK government unit: "The AI Security Institute is a research organisation within the Department of Science, Innovation and Technology."

Unfortunately, they fit straight lines to graphs with y axis from 0 to 100% and x axis being time - which is not great. Should do logistic instead.

wg0 · 2026-04-16T11:53:52 1776340432

If true, that's naked, shameless and brutal capitalism.

Seems much like those secretly tobacco industry funded reports about tobacco being safe and such.

j2kun · 2026-04-15T18:43:49 1776278629

In that case, the US was worried about espionage, not violation of civil liberties.

j2kun · 2026-04-14T15:54:26 1776182066

I work in an esoteric compiler domain (compilers for fancy cryptography) and we've been eyeing e-graphs for a bit. This article is super helpful seeing how it materialized in a real-world scenario.

An interesting move in this direction is the Tamagoyaki project: https://github.com/jumerckx/Tamagoyaki that supports equality saturation directly in MLIR.

j2kun · 2026-04-14T15:46:03 1776181563

It reads to me as a cogent and measured response to a very clickbaity advertisement about the result.

> Not tested. Proved. For every possible input.

Finding inputs that crashed and then saying, "be clear about what is in the scope of what you proved" is interesting and factual.

j2kun · 2026-04-12T22:07:33 1776031653

Self-harm (especially when depicting minors) has special standards. The recent court ruling on child safety against Meta probably led directly to this decision.

bawolff · 2026-04-13T00:34:13 1776040453

I don't think it particularly does in other media. Plenty of books have that as a theme. On netflix, 13 reasons why was one of their big hits.

j2kun · 2026-04-13T17:22:05 1776100925

https://www.npr.org/2019/07/16/742386829/netflix-edits-out-c...

And that was in 2019.

Explicit depictions are the target in most of these controversies.

j2kun · 2026-04-12T22:06:44 1776031604

Self-harm (especially when depicting minors) has special standards. The recent court losses on child safety for Meta and YouTube probably led to this.

AlienRobot · 2026-04-13T00:53:19 1776041599

Completely absurd. If it's not safe for children just slap an age rating on it.

I don't like this trend of every technology assuming I'm a child that needs to be protected from the world while simultaneously assuming I'm an adult with infinite disposable income that must be shown ads to all the time. This is insincere. Children need to be "protected" only when it's convenient and allows the platform to exercise unchecked control. Nobody is protecting children from ads because that would be inconvenient.

j2kun · 2026-04-13T17:24:50 1776101090

To be clear, I'm not advocating for the behavior here, just explaining that, for most tech companies, the risk of liability is a huge motivator. Liability for poor use of ad targeting would induce similar behavior (and I think that'd be a win for everyone involved)

p-t · 2026-04-13T12:50:20 1776084620

i feel like this is going to be really dangerous tbh, especially if it starts blocking informational content

j2kun · 2026-04-10T21:42:17 1775857337

How often are you playing as black?

freetime2 · 2026-04-10T21:48:13 1775857693

As often as the system decides that I should play as black.

SV_BubbleTime · 2026-04-10T21:53:18 1775857998

Ha. I thought mine was broken on iPhone for a second.

j2kun · 2026-04-10T18:16:02 1775844962

A comment not about the article, but rather about the perceived quality of the HN comments.

j2kun · 2026-04-09T23:48:46 1775778526

X suppresses posts from people you follow in favor of algorithmically boosted posts, so at scale the follow counts don't matter as much.

j2kun · 2026-04-09T21:34:11 1775770451

My favorite class of HN comment: bringing concreteness to a vibes fight.

harimau777 · 2026-04-09T22:17:25 1775773045

Perhaps unfortunately, vibes are part of being human. Ignore them at your peril.

bluefirebrand · 2026-04-10T03:59:24 1775793564

Vibes can be safely ignored when they are disproven by easily accessible facts

joquarky · 2026-04-10T14:26:54 1775831214

Facts are ultimately just vibes that pass a cultural filter.