Menu

Post image 1
Post image 2
Post image 3
1 / 3
0

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

arXiv.org·[Submitted on 3 May 2026]·20 days ago
#F54IkD2c
Reading 0:00
15s threshold

Authors: Zhuofeng Li , Haoxiang Zhang , Cong Wei , Pan Lu , Ping Nie , Yi Lu , Yuyang Bai , Shangbin Feng , Hangxiao Zhu , Ming Zhong , Yuyu Zhang , Jianwen Xie , Yejin Choi , James Zou , Jiawei Han , Wenhu Chen , Jimmy Lin , Dongfu Jiang , Yu Zhang View PDF Abstract: Modern retrieval systems, whether lexical or semantic, expose a corpus through a fixed similarity interface that compresses access into a single top-k retrieval step before reasoning. This abstraction is efficient, but for agentic search, it becomes a bottleneck: exact lexical constraints, sparse clue conjunctions, local context checks, and multi-step hypothesis refinement are difficult to implement by calling a conventional off-the-shelf retriever, and evidence filtered out early cannot be recovered by stronger downstream reasoning.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Read More