Hacker News new | past | comments | ask | show | jobs | submit login

Or you could prove it does not reason by adversarially generating correct simple logic puzzles in the same class, with known answers. Rephrasijg the sentence structure, changing words in the thesaurus, slightly modifying the initial conditions should not produce invalid reasoning explanations or results.

Essentially the sufficiently complex text form of sudoku.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: