Be ahead of the curve
Research papers, repositories, and articles about game theory
Showing 1 of 1 items
The authors use token-level uncertainty to decide when an LLM should think longer in games like tic-tac-toe. Low entropy means short context and reasoning, high entropy triggers more examples and multiple reasoning paths.