It is also well-established that models internalize values, preferences, and dri...

cebert · 2025-05-23T02:36:12 1747967772

> AI coding agents have a strong drive to make tests green, and anyone who has used these tools has seen them cheat to achieve green tests.

As long as you hit an arbitrary branch coverage %, a lot of MBAs will be happy. No one said the tests have to provide value.

cortesoft · 2025-05-22T20:10:07 1747944607

I've seen a lot of humans cheat for green tests, too

whodatbo1 · 2025-05-22T22:40:44 1747953644

benchmaxing is the expectation ;)