BriefGPT.xyz
Mar, 2025
DeepRAG:从零开始为检索增强生成构建定制的印地文嵌入模型
DeepRAG: Building a Custom Hindi Embedding Model for Retrieval Augmented Generation from Scratch
HTML
PDF
Nandakishor M
TL;DR
本文提出了DeepRAG,这是一个专门为印地语在检索增强生成系统中构建的嵌入模型。研究中通过从头开始创建印地文嵌入,解决了现有多语言模型在印地语检索任务中性能不足的问题,结果相比于多语言模型提高了23%的检索精度,展示了特定语言模型的重要性。
Abstract
In this paper, I present our work on DeepRAG, a specialized
Embedding
model we built specifically for
Hindi
language in RAG systems. While LLMs have gotten really good at generating text, their performance in ret
→