REPLUG: Retrieval-Augmented Black-Box Language Models
https://papers.cool/arxiv/2301.12652
Authors: Weijia Shi ; Sewon Min ; Michihiro Yasunaga ; Minjoon Seo ; Rich James ; Mike Lewis ; Luke Zettlemoyer ; Wen-tau Yih
Summary: We introduce REPLUG, a retrieval-augmented language modeling framework that treats the language model (LM) as a black box and augments it with a tuneable retrieval model. Unlike prior retrieval-augmented LMs that train language models with special cross attention mechanisms to encode the retrieved text, REPLUG simply prepends retrieved documents to the input for the frozen black-box LM. This simple design can be easily applied to any existing retrieval and language models. Furthermore, we show that the LM can be used to supervise the retrieval model, which can then find documents that help the LM make better predictions. Our experiments demonstrate that REPLUG with the tuned retriever significantly improves the performance of GPT-3 (175B) on language modeling by 6.3%, as well as the performance of Codex on five-shot MMLU by 5.1%.
A: 这篇论文介绍了REPLUG(Retrieve and Plug),这是一个增强黑盒语言模型(LM)的框架,通过将其与可调的检索模型相结合。REPLUG旨在解决以下问题:
总的来说,REPLUG框架试图通过检索增强来克服大型黑盒语言模型的局限性,提高其在语言建模和理解任务中的性能,同时保持模型的灵活性和适用性。
A: 相关研究主要集中在以下几个方面: