RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Agentic Horizons

This episode explores RAPTOR, a tree-based retrieval system designed to enhance retrieval-augmented language models (RALMs). RAPTOR addresses the limitations of traditional RALMs, which struggle with understanding large-scale discourse and answering complex questions by retrieving only short text chunks.RAPTOR builds a multi-layered tree by embedding, clustering, and summarizing text chunks recursively, allowing it to capture both high-level and low-level details of a document. The system uses two querying strategies—Tree Traversal and Collapsed Tree—to retrieve relevant information.Experiments on question-answering datasets show RAPTOR consistently outperforms traditional methods like BM25 and DPR, especially when combined with GPT-4. The recursive summarization and soft clustering methods significantly improve performance, particularly for complex, multi-step reasoning tasks. RAPTOR demonstrates the potential for enhanced retrieval by leveraging deeper document structure and thematic connections.

https://arxiv.org/pdf/2401.18059

若要收听包含儿童不宜内容的单集,请登录。

关注此节目的最新内容

登录或注册,以关注节目、存储单集,并获取最新更新。

选择国家或地区

非洲、中东和印度

亚太地区

欧洲

拉丁美洲和加勒比海地区

美国和加拿大