AI holds promise for transforming scientific processes, including hypothesis generation. Prior work on hypothesis generation can be broadly categorized into theory-driven and data-driven approaches. While both have proven effective in generating novel and plausible hypotheses, it remains an open question whether they can complement each other. To address this, we develop the first method that combines literature-based insights with data to perform LLM-powered hypothesis generation. We apply our method on five different datasets and demonstrate that integrating literature and data outperforms other baselines (8.97\% over few-shot, 15.75\% over literature-based alone, and 3.37\% over data-driven alone). Additionally, we conduct the first human evaluation to assess the utility of LLM-generated hypotheses in assisting human decision-making on two challenging tasks: deception detection and AI generated content detection. Our results show that human accuracy improves significantly by 7.44\% and 14.19\% on these tasks, respectively. These findings suggest that integrating literature-based and data-driven approaches provides a comprehensive and nuanced framework for hypothesis generation and could open new avenues for scientific inquiry.

本研究解决了文献驱动与数据驱动的假设生成方法互补性的问题。提出了一种新方法，将文献洞察与数据相结合，利用大型语言模型（LLM）进行假设生成，实验证明此方法在多个数据集上表现优于传统方法。此外，首次通过人类评估验证了LLM生成假设在复杂决策任务中的有效性，显著提高了人类的判断准确率。这一研究为假设生成提供了更全面的框架，潜在推动科学研究的新方向。

文献与数据结合：假设生成的协同方法