Query Rewriting for Retrieval-Augmented Large Language Models
https://papers.cool/arxiv/2305.14283
Authors: Xinbei Ma ; Yeyun Gong ; Pengcheng He ; Hai Zhao ; Nan Duan
Summary: Large Language Models (LLMs) play powerful, black-box readers in the retrieve-then-read pipeline, making remarkable progress in knowledge-intensive tasks. This work introduces a new framework, Rewrite-Retrieve-Read instead of the previous retrieve-then-read for the retrieval-augmented LLMs from the perspective of the query rewriting. Unlike prior studies focusing on adapting either the retriever or the reader, our approach pays attention to the adaptation of the search query itself, for there is inevitably a gap between the input text and the needed knowledge in retrieval. We first prompt an LLM to generate the query, then use a web search engine to retrieve contexts. Furthermore, to better align the query to the frozen modules, we propose a trainable scheme for our pipeline. A small language model is adopted as a trainable rewriter to cater to the black-box LLM reader. The rewriter is trained using the feedback of the LLM reader by reinforcement learning. Evaluation is conducted on downstream tasks, open-domain QA and multiple-choice QA. Experiments results show consistent performance improvement, indicating that our framework is proven effective and scalable, and brings a new framework for retrieval-augmented LLM.
A: 这篇论文试图解决的问题是如何提高检索增强型大型语言模型(LLMs)在知识密集型任务中的性能。具体来说,它关注于以下几个方面:
总的来说,这篇论文旨在通过引入查询重写步骤和可训练的重写器,来解决检索增强型LLMs在知识检索和理解方面的局限性,从而提高其在实际应用中的可靠性和实用性。
A: 这篇论文提到了多个与检索增强型大型语言模型(LLMs)相关的研究领域,主要包括以下几个方面: