Hacker News new | past | comments | ask | show | jobs | submit login

It isn't available over api yet, as far as I know. So it can't be really tested independently.



The comparisons I saw I think were manual, so it makes sense it can run a whole suite- these were just some basic prompts and showed the difference in how the produced output ran.




Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: