I give a simple ascii maze and ask it to give me the move to get out.
In 3-4 moves the most advanced models try to go through walls.
An alternative is providing all the tile relation to the other tiles.
This is because LLMs are bad at 2D text visualisation.
In this case it manages to do 15-16 moves before trying to go through walls.
An alternative is providing all the tile relation to the other tiles. This is because LLMs are bad at 2D text visualisation. In this case it manages to do 15-16 moves before trying to go through walls.