2025.10.29 | 通义深度研究报告；小模型折记忆胜671B巨模型

本期的 10 篇论文如下：

[00:23] 🔍 Tongyi DeepResearch Technical Report（通义深度研究报告：面向长程深度信息检索任务的智能体大模型）

[01:00] 🧠 AgentFold: Long-Horizon Web Agents with Proactive Context Management（AgentFold：面向长程任务的主动式上下文管理智能体）

[01:36] 🤖 RoboOmni: Proactive Robot Manipulation in Omni-modal Context（RoboOmni：全模态上下文下的主动机器人操作）

[02:33] 🎮 Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents（Game-TARS：面向可扩展通才多模态游戏智能体的预训练基础模型）

[03:05] 🎬 Uniform Discrete Diffusion with Metric Path for Video Generation（面向视频生成的度量路径均匀离散扩散模型）

[03:42] 🛠 OSWorld-MCP: Benchmarking MCP Tool Invocation In Computer-Use Agents（OSWorld-MCP：评测计算机代理调用MCP工具能力的基准）

[04:28] 🎨 Group Relative Attention Guidance for Image Editing（基于群组相对注意力引导的图像编辑方法）

[05:14] 🚀 WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking（WebLeaper：通过富信息搜索赋能网络智能体效率与效能）

[06:04] 🧭 Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance（MoE路由关乎成败：显式路由引导扩散Transformer扩容）

[07:01] 🧠 ParallelMuse: Agentic Parallel Thinking for Deep Information Seeking（并行缪斯：面向深度信息搜寻的主体化并行思考）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递