BriefGPT.xyz
Apr, 2024
利用大型语言模型和视觉语言模型增强交互式图像检索的查询重写
Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models
HTML
PDF
Hongyi Zhu, Jia-Hong Huang, Stevan Rudinac, Evangelos Kanoulas
TL;DR
我们提出了一种互动式图像检索系统,结合了视觉语言模型和大型语言模型,通过用户反馈迭代改进查询,并利用无噪声的查询扩展提高检索准确性,在评估中获得了10%的召回率改善。
Abstract
image search
stands as a pivotal task in multimedia and computer vision, finding applications across diverse domains, ranging from internet search to medical diagnostics. Conventional
image search
systems operate
→