2025.06.27 | 强化学习提升搜索效率;记忆增强生成逼真驾驶场景。

HuggingFace 每日AI论文速递

本期的 15 篇论文如下:

[00:25] 🔍 MMSearch-R1: Incentivizing LMMs to Search(MMSearch-R1:激励大型多模态模型进行搜索)

[00:59] 🚗 MADrive: Memory-Augmented Driving Scene Modeling(MADrive:基于记忆增强的驾驶场景建模)

[01:43] 🤖 WorldVLA: Towards Autoregressive Action World Model(WorldVLA:面向自回归动作世界模型)

[02:23] 💡 Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test(大型语言模型预训练中Grokking现象 কোথায়? 无需测试,监测从记忆到泛化的过程)

[03:14] 🤖 Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge(Mind2Web 2:使用Agent-as-a-Judge评估自主搜索)

[04:00] 🚗 SAM4D: Segment Anything in Camera and LiDAR Streams(SAM4D:相机和激光雷达流中的可分割一切)

[04:40] 🎨 FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing(FaSTA$^*$: 快速-慢速工具路径智能体,通过子程序挖掘实现高效的多轮图像编辑)

[05:16] 🤖 Whole-Body Conditioned Egocentric Video Prediction(全身条件下的自我中心视频预测)

[05:53] 🧠 Arch-Router: Aligning LLM Routing with Human Preferences(Arch-Router:将LLM路由与人类偏好对齐)

[06:35] 🎨 FairyGen: Storied Cartoon Video from a Single Child-Drawn Character(FairyGen:从单张儿童绘画生成故事驱动的卡通视频)

[07:12] 🌐 DiLoCoX: A Low-Communication Large-Scale Training Framework for Decentralized Cluster(DiLoCoX:一种用于去中心化集群的低通信大规模训练框架)

[07:55] 🧬 An Agentic System for Rare Disease Diagnosis with Traceable Reasoning(基于Agent的罕见病诊断系统,具有可追溯的推理能力)

[08:35] 🤖 HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges(HeurAgenix:利用大型语言模型解决复杂组合优化难题)

[09:18] 🦘 Learning to Skip the Middle Layers of Transformers(学习跳过Transformer的中间层)

[09:57] 🎵 MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners(MuseControlLite:基于轻量级调节器的多功能音乐生成)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

To listen to explicit episodes, sign in.

Stay up to date with this show

Sign in or sign up to follow shows, save episodes and get the latest updates.

Select a country or region

Africa, Middle East, and India

Asia Pacific

Europe

Latin America and the Caribbean

The United States and Canada