Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Whenever one of these well known gotcha prompts gets "solved" the question is always whether they actually solved the underlying reason it used to fail, or did they just have a bunch of third-world workers tag pictures of horses and astronauts until the model started handling that specific example more reliably. As the saying goes, every measure which becomes a target becomes a bad measure.


Well, what you do is try examples close to the problem space around it. For example a fiddler crab riding a chameleon

https://sora.com/g/gen_01jrbq91wtefjtpb8ceajdh9mt

and then iterate around other combinations to see if it's generalized or not.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: