To demonstrate this, the researchers pitted seven leading large language models (LLMs) against Stockfish, a renowned chess engine that has challenged grandmasters since 2014.
Faced with an almost insurmountable opponent, OpenAI’s o1 and DeepSeek’s R1 resorted to manipulating system files to alter the positions of chess pieces on the board.
The researchers asked each model to explain its “reasoning”
for each move. In one instance, o1 justified its actions by stating
that the “task is to ‘win against a powerful chess engine,’ not
necessarily to win fairly in a chess game.”...<<<Read More>>>...