Tree of Thoughts
Posted: Sun May 25, 2025 4:49 am
Reasoning and Acting (ReAct): This strategy taps into real-world information for reasoning in addition to data the LLM has been trained on. ReAct-based reasoningOpens in a new window is touted as more akin to a human-like task solving that involves interactive decision-making and verbal reasoning, leading to better error handling and lower hallucination rates. It synergizes reasoning and action through user action, which increases interpretability and trustworthiness of responses. This strategy is also referred to as a “stepwise planner” because it approaches problem-solving in a step-by-step manner and also seeks user feedback at every step.
(ToT): This variation of the Chain-of-Thought approach afghanistan phone number list generates multiple thoughts at each intermediate step. Instead of picking just one reasoning path, it explores and evaluates the current status of the environment with each step to actively look ahead or backtrack to make more deliberate decisions.
This strategy has been proven to significantly outperform CoT on complex tasks like math games, creative writing exercises, and mini-crossword puzzles. ToT reasoningOpens in a new window is deemed to be even closer to a human decision-making paradigm that explores multiple options, weighs pros and cons, and then picks the best one.
(ToT): This variation of the Chain-of-Thought approach afghanistan phone number list generates multiple thoughts at each intermediate step. Instead of picking just one reasoning path, it explores and evaluates the current status of the environment with each step to actively look ahead or backtrack to make more deliberate decisions.
This strategy has been proven to significantly outperform CoT on complex tasks like math games, creative writing exercises, and mini-crossword puzzles. ToT reasoningOpens in a new window is deemed to be even closer to a human decision-making paradigm that explores multiple options, weighs pros and cons, and then picks the best one.