Maybe we should work on solving that problem, then? And maybe this is what worki... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		taneq 11 months ago \| parent \| context \| favorite \| on: Strengthening AI Agent Hijacking Evaluations Maybe we should work on solving that problem, then? And maybe this is what working on that problem looks like?

Eridrus 11 months ago [–]

Eval sets are not an appropriate tool for evaluating progress on security problems since the bar here is 100% correctness in the face of sustained targeted adversarial effort.

This work largely resembles the Politician's syllogism; it's something, but it's not actually addressing the problem.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact