Retrieve-and-Sample: Document-level Event Argument Extraction via Hybrid Retrieval Augmentation
https://papers.cool/venue/2023.acl-long.17@ACL
Authors: Yubing Ren ; Yanan Cao ; Ping Guo ; Fang Fang ; Wei Ma ; Zheng Lin
Summary: Recent studies have shown the effectiveness of retrieval augmentation in many generative NLP tasks. These retrieval-augmented methods allow models to explicitly acquire prior external knowledge in a non-parametric manner and regard the retrieved reference instances as cues to augment text generation. These methods use similarity-based retrieval, which is based on a simple hypothesis: the more the retrieved demonstration resembles the original input, the more likely the demonstration label resembles the input label. However, due to the complexity of event labels and sparsity of event arguments, this hypothesis does not always hold in document-level EAE. This raises an interesting question: How do we design the retrieval strategy for document-level EAE? We investigate various retrieval settings from the input and label distribution views in this paper. We further augment document-level EAE with pseudo demonstrations sampled from event semantic regions that can cover adequate alternatives in the same context and event schema. Through extensive experiments on RAMS and WikiEvents, we demonstrate the validity of our newly introduced retrieval-augmented methods and analyze why they work.
A: 这篇论文试图解决的问题是文档级事件论元提取(Document-level Event Argument Extraction, EAE)的挑战。文档级EAE的目标是从整个文档中提取信息丰富的事件核,并从中提取出事件论元,并将它们分配给正确的角色。这项任务对于信息检索、问答系统和事件图推理等下游应用至关重要。然而,由于事件标签的复杂性和事件论元的稀疏性,传统的基于相似性的检索方法在文档级EAE中并不总是有效。论文提出了一种名为“Retrieve-and-Sample”的方法,通过混合检索增强来改进文档级EAE。
具体来说,论文探讨了以下几个关键问题:
论文通过在RAMS和WikiEvents数据集上的广泛实验,验证了所提出的检索增强方法的有效性,并分析了这些方法为何有效。
A: 相关研究主要集中在文档级事件论元提取(Document-level Event Argument Extraction, EAE)和检索增强文本生成(Retrieval-Augmented Text Generation, RAG)两个领域。以下是一些与这篇论文相关的研究工作:
这些研究展示了在不同的自然语言处理(NLP)任务中,如何利用检索增强方法来提高模型的性能,特别是在处理复杂和稀疏数据时。这篇论文通过提出新的检索策略和混合检索增强方法,进一步推动了这一领域的研究。