Understanding Retrieval Augmentation for Long-Form Question Answering
https://papers.cool/arxiv/2310.12150
Authors: Hung-Ting Chen ; Fangyuan Xu ; Shane Arora ; Eunsol Choi
Summary: We present a study of retrieval-augmented language models (LMs) on long-form question answering. We analyze how retrieval augmentation impacts different LMs, by comparing answers generated from models while using the same evidence documents, and how differing quality of retrieval document set impacts the answers generated from the same LM. We study various attributes of generated answers (e.g., fluency, length, variance) with an emphasis on the attribution of generated long-form answers to in-context evidence documents. We collect human annotations of answer attribution and evaluate methods for automatically judging attribution. Our study provides new insights on how retrieval augmentation impacts long, knowledge-rich text generation of LMs. We further identify attribution patterns for long text generation and analyze the main culprits of attribution errors. Together, our analysis reveals how retrieval augmentation impacts long knowledge-rich text generation and provide directions for future work.
A: 这篇论文研究了检索增强型语言模型(LMs)在长篇幅问答(Long-Form Question Answering, LFQA)任务中的表现。主要解决了以下问题:
总的来说,这篇论文旨在深入理解检索增强型语言模型在长篇幅问答任务中的表现,特别是在答案的生成和归因方面,以及如何通过改进模型和检索系统来提高答案的质量和可靠性。
A: 这篇论文提到了以下几项与检索增强型语言模型(LMs)和长篇幅问答(LFQA)相关的研究: