BriefGPT.xyz
Jun, 2024
深度研究逻辑推理与LLM:工具选择的重要性
A Closer Look at Logical Reasoning with LLMs: The Choice of Tool Matters
HTML
PDF
Long Hei Matthew Lam, Ehsan Shareghi
TL;DR
通过将大型语言模型 (LLMs) 与各种符号求解器相结合,我们对 Z3、Pyke 和 Prover9 三个符号求解器的性能进行实验证明,其中与 LLMs 相结合时,Pyke 的性能明显低于 Prover9 和 Z3,Z3 的总体准确性略高于 Prover9,但 Prover9 能够处理更多问题。
Abstract
logical reasoning
serves as a cornerstone for human cognition. Recently, the emergence of Large Language Models (LLMs) has demonstrated promising progress in solving
logical reasoning
tasks effectively. To improv
→