Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Augmenting the multi-step reasoning abilities of Large Language Models (LLMs) has been a persistent challenge. Recently, verification has shown promise in improving solution consistency by evaluating generated outputs. However, current verification approaches suffer from sampling inefficiencies, requiring a large number of samples to achieve satisfactory performance. Additionally, training an effective verifier often depends on extensive …
Read more “Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo”