This study introduces a dedicated model aimed at solving the BRAINTEASER task 9 , a novel challenge designed to assess models lateral thinking capabilities through sentence and word puzzles. Our model demonstrates remarkable efficacy, securing Rank 1 in sentence puzzle solving during the test phase with an overall score of 0.98. Additionally, we explore the comparative performance of ChatGPT, specifically analyzing how variations in temperature settings affect its ability to engage in lateral thinking and problem-solving. Our findings indicate a notable performance disparity between the dedicated model and ChatGPT, underscoring the potential of specialized approaches in enhancing creative reasoning in AI.

本研究提出了一种专用模型，旨在解决BRAINTEASER任务，这是一个设计用来评估模型通过句子和单词谜题的侧向思维能力的新挑战。我们的模型在测试阶段在解决句子谜题方面表现出卓越的效果，总得分达到0.98。此外，我们探讨了ChatGPT的比较性能，特别分析了温度设置变化对其参与侧向思维和问题解决能力的影响。我们的发现显示了专用模型和ChatGPT之间显著的性能差异，凸显了专门方法在增强人工智能中的创造性推理能力方面的潜力。

SemEval-2024任务9：解码脑筋急转弯的有效性——专用模型与ChatGPT的对比