🚀Shippingscore 86.6May 15, 2026·2605.16217cs.CLcs.AIcs.IR

Argus: Evidence Assembly for Scalable Deep Research Agents

Zhen Zhang, Liangcai Su, Zhuo Chen, Xiang Lin, Haotian Xu, Simon Shaolei Du, Kaiyu Yang, Bo An, Lidong Bing, Xinyu Wang

Narrative

Argus reframes deep research agents around evidence assembly rather than parallel redundancy. Instead of running many independent search rollouts that duplicate answers and bloat context windows, a Navigator agent maintains a shared evidence graph, identifies missing pieces, and dispatches Searcher agents to fill gaps. Both components are built on a 35B-A3B MoE backbone; the Navigator is trained with RL to verify, dispatch, and synthesize. With 8 parallel Searchers it adds 12.7 points averaged across 8 benchmarks, and with 64 Searchers hits 86.2 on BrowseComp — reportedly above all proprietary agents tested — while keeping the Navigator's context under 21.5K tokens.

No production traction yet. The GitHub references are all paper-tracking bots and RSS aggregators, not implementations. Zero citations on Semantic Scholar. Worth watching as a design pattern for multi-agent search pipelines, but nothing is shipping from this work at the moment.

Abstract

Deep research agents have achieved remarkable progress on complex information seeking tasks. Even long ReAct style rollouts explore only a single trajectory, while recent state of the art systems scale inference time compute via parallel search and aggregation. Yet deep research answers are composed of complementary pieces of evidence, which parallel rollouts often duplicate rather than complete, yielding diminishing returns while pushing the aggregation context toward the model's limit. We propose Argus, an agentic system in which a Searcher and a Navigator cooperate to treat deep research as assembling a jigsaw from complementary evidence pieces, rather than brute forcing the whole answer in parallel. The Searcher collects evidence traces for a given sub-query through ReAct-style interaction. The Navigator maintains a shared evidence graph, verifying which pieces are still missing, dispatching Searchers to gather them, and reasoning over the completed graph to produce a source-traced final answer. We train the Navigator with reinforcement learning to verify, dispatch, and synthesize, while independently training the Searcher to remain a standard ReAct agent. The resulting Navigator supports rollouts with a single Searcher or many in parallel without retraining. With both Searcher and Navigator built on a 35B-A3B MoE backbone, Argus gains 5.5 points with a single Searcher and 12.7 points with 8 parallel Searchers, averaged over eight benchmarks. With 64 Searchers it reaches 86.2 on BrowseComp, surpassing every proprietary agent we benchmark, while the Navigator's reasoning context stays under 21.5K tokens.

Citation timeline

Not enough citation snapshots yet to plot a timeline. Come back after a few cron runs.

Signal

Stars: 123
Repos: 23
Citations: 0
Velocity: 0.00/d

GitHub repos (20)

CSQianDong/Awesome-arXiv-Daily-Reporter⭐ 47
“{'arxiv_id': 'arXiv:2605.15299', 'title': 'Fortress: A Case Study in Stabilizing Search Recommendations via Temporal Data Augmentation and Feature Pruning', 'authors': 'Milind Pandurang Jagre, Jia Huang, Dayvid V. R. Oliveira, Zhinan Cheng, Babak Seyed Aghazadeh, Puja Das, Chris ”
wwd29/arxiv-daily⭐ 21
“<ul> <li><strong>Authors: </strong>Zhen Zhang, Liangcai Su, Zhuo Chen, Xiang Lin, Haotian Xu, Simon Shaolei Du, Kaiyu Yang, Bo An, Lidong Bing, Xinyu Wang</a></li> <li><strong>Subjects: </strong>cs.CL, cs.AI, cs.IR</a></li> <li><strong>Abstract URL: </strong><a href="https://arxi”
ZenAlexa/agi-brief-history⭐ 11
“- **Summary**: Aphasias, selective language impairments which can arise from brain damage, reveal the functional organization of human language by providing causal links between affected brain regions and specific symptom profiles. Drawing on this literature, we introduce an apha”
ehijano/rss_fetch⭐ 11
“ </item> <item> <title>Argus: Evidence Assembly for Scalable Deep Research Agents</title> <link>https://arxiv.org/abs/2605.16217</link> <description>arXiv:2605.16217v1 Announce Type: cross Abstract: Deep research agents have achieved remarkable progress ”
lonePatient/lonePatient.github.io⭐ 9
“{% hideToggle 点击查看摘要 %} {% note blue no-icon %} ID-19-Argus: Evidence Assembly for Scalable Deep Research Agents {% endnote %} **链接**: https://arxiv.org/abs/2605.16217 **作者**: Zhen Zhang,Liangcai Su,Zhuo Chen,Xiang Lin,Haotian Xu,Simon Shaolei Du,Kaiyu Yang,Bo An,Lidong Bing,Xin”
jyyang621/DailyArXiv⭐ 8
“| **Title** | **Date** | **Comment** | | --- | --- | --- | | **[FORGE: Self-Evolving Agent Memory With No Weight Updates via Population Broadcast](https://arxiv.org/abs/2605.16233v1)** | 2026-05-15 | | | **[Argus: Evidence Assembly for Scalable Deep Research Agents](https://arxi”
Guesswhat-Studio/Linnet⭐ 8
“ "abstract": "Argus addresses the inefficiency of current deep research agents by treating evidence gathering as a jigsaw puzzle. Instead of parallelizing redundant searches, its Searcher collects evidence for sub-queries, while a Navigator manages a shared graph, identi”
2shin0/arxiv-ai-mailing⭐ 6
“ ## 60. Argus: Evidence Assembly for Scalable Deep Research Agents - **Authors**: Zhen Zhang , Liangcai Su , Zhuo Chen , Xiang Lin , Haotian Xu , Simon Shaolei Du , Kaiyu Yang , Bo An , Lidong Bing , Xinyu Wang - **URL**: [https://arxiv.org/abs/2605.16217](https://arxiv.org/abs/2”
Zhanli-Li/Zhanli-Li.github.io⭐ 2
“作者：Zhen Zhang、Liangcai Su、Zhuo Chen、Xiang Lin、Haotian Xu、Simon Shaolei Du、Kaiyu Yang、Bo An、Lidong Bing、Xinyu Wang 机构：MiroMind AI 日期：2026-05-15 链接：[arXiv](https://arxiv.org/abs/2605.16217)，[arXiv HTML](https://arxiv.org/html/2605.16217) 一句话核心 idea：Argus 把 deep research 看成证据”
Jinsu-L/DailyIR⭐ 0
“- **LLM Score**: 8 - **Keyword Score**: 6 - **Authors**: Zhen Zhang, Liangcai Su, Zhuo Chen, Xiang Lin, Haotian Xu, Simon Shaolei Du, Kaiyu Yang, Bo An, Lidong Bing, Xinyu Wang - **URL**: <http://arxiv.org/abs/2605.16217v1> - **Submitted**: 2026-05-15 17:29:27 - **Topic Keywords*”
jmk9/rnews⭐ 0
“ "id": "arxiv:2605.16217", "source": "arxiv", "title": "Argus: Evidence Assembly for Scalable Deep Research Agents", "url": "http://arxiv.org/abs/2605.16217v1", "summary": "Deep research agents have achieved remarkable progress on complex information seeking ta”
hamishivi/hamish-reads⭐ 0
“ "cs.IR" ], "published": "2026-05-18T00:00:00-04:00", "abs_url": "https://arxiv.org/abs/2605.16217", "pdf_url": "https://arxiv.org/pdf/2605.16217", "is_author_match": false, "relevance_score": 8.0,”
bspiegel27/bst_236_website⭐ 0
“ <entry> <id>http://arxiv.org/abs/2605.16217v1</id> <title>Argus: Evidence Assembly for Scalable Deep Research Agents</title>”
mghnasiri/PORID⭐ 0
“ "authors": "Zhen Zhang, Liangcai Su, Zhuo Chen", "url": "http://arxiv.org/abs/2605.16217v1", "date": "2026-05-15"”
mickdur/tech-watch⭐ 0
“ "https://arxiv.org/abs/2605.16205": "2026-05-18T07:51:44.206446+00:00", "https://arxiv.org/abs/2605.16207": "2026-05-18T07:51:44.206446+00:00", "https://arxiv.org/abs/2605.16215": "2026-05-18T07:51:44.206446+00:00", "https://arxiv.org/abs/2605.16217": "2026-05-18T07:51:44”
Jgraf30/Retrieval-Augmented-Generation-RAG-⭐ 0
“ "title": "Argus: Evidence Assembly for Scalable Deep Research Agents", "source": "arxiv", "date": "2026-05-15T17:29:27Z", "page_url": "https://arxiv.org/abs/2605.16217v1", "pdf_url": "https://arxiv.org/pdf/2605.16217v1", ”
pchaganti/pchaganti.github.io⭐ 0
“ { "title": "Argus: Evidence Assembly for Scalable Deep Research Agents", "summary": "Deep research agents have achieved remarkable progress on complex information seeking tasks. Even long ReAct style rollouts explore only a single trajectory, while recent state of”
sirichen2/sirichen2.github.io⭐ 0
“ <span class="paper-flag">Matched to your radar</span> <h2><a href="https://arxiv.org/abs/2605.16217v1" target="_blank" rel="noopener noreferrer">Argus: Evidence Assembly for Scalable Deep Research Agents</a></h2> </div>”
Varelser/varelser.github.io⭐ 0
“ <article class="digest-article" data-month="2026-05" data-genre="Agents / Reasoning" data-field="cs.CL" data-source="arXiv cs.AI" data-area="AI" data-categories="cs.CL, cs.AI, cs.IR" data-title="Argus: 大規模リサーチエージェントのための証拠収集システム" data-timestamp="1778866167000" data-url="https://a”
brianbaldock/aigregator⭐ 0
“<p><em>5 stories · cred-weighted 🟡 +0.0</em></p> <ul> <li>4 🔬 🟡 ▤×1 🏷️ agents, memory <strong>FORGE: self-evolving agent memory via population broadcast, no weight updates.</strong> Staged population-based protocol evolves prompt-injected natural-language memory; agents impro”