BriefGPT.xyz
Nov, 2024
大型视觉-语言模型与行人重识别的结合
When Large Vision-Language Models Meet Person Re-Identification
HTML
PDF
Qizao Wang, Bin Li, Xiangyang Xue
TL;DR
本研究旨在解决大型视觉-语言模型(LVLMs)在行人重识别(ReID)中有效应用的挑战。提出的LVLM-ReID框架利用语义引导交互模块生成行人的语义标记,增强了行人身份特征的提取和识别。该方法在多个基准测试中表现优异,表明LVLM生成的语义在促进行人重识别方面具有重要潜力,并为未来的研究指明了方向。
Abstract
Large
Vision-Language Models
(LVLMs) that incorporate visual models and Large Language Models (LLMs) have achieved impressive results across various
Cross-modal
understanding and reasoning tasks. In recent years,
→