本期的 15 篇论文如下:
[00:25] 🔍 MMSearch-R1: Incentivizing LMMs to Search(MMSearch-R1:激励大型多模态模型进行搜索)
[00:59] 🚗 MADrive: Memory-Augmented Driving Scene Modeling(MADrive:基于记忆增强的驾驶场景建模)
[01:43] 🤖 WorldVLA: Towards Autoregressive Action World Model(WorldVLA:面向自回归动作世界模型)
[02:23] 💡 Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test(大型语言模型预训练中Grokking现象 কোথায়? 无需测试,监测从记忆到泛化的过程)
[03:14] 🤖 Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge(Mind2Web 2:使用Agent-as-a-Judge评估自主搜索)
[04:00] 🚗 SAM4D: Segment Anything in Camera and LiDAR Streams(SAM4D:相机和激光雷达流中的可分割一切)
[04:40] 🎨 FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing(FaSTA$^*$: 快速-慢速工具路径智能体,通过子程序挖掘实现高效的多轮图像编辑)
[05:16] 🤖 Whole-Body Conditioned Egocentric Video Prediction(全身条件下的自我中心视频预测)
[05:53] 🧠 Arch-Router: Aligning LLM Routing with Human Preferences(Arch-Router:将LLM路由与人类偏好对齐)
[06:35] 🎨 FairyGen: Storied Cartoon Video from a Single Child-Drawn Character(FairyGen:从单张儿童绘画生成故事驱动的卡通视频)
[07:12] 🌐 DiLoCoX: A Low-Communication Large-Scale Training Framework for Decentralized Cluster(DiLoCoX:一种用于去中心化集群的低通信大规模训练框架)
[07:55] 🧬 An Agentic System for Rare Disease Diagnosis with Traceable Reasoning(基于Agent的罕见病诊断系统,具有可追溯的推理能力)
[08:35] 🤖 HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges(HeurAgenix:利用大型语言模型解决复杂组合优化难题)
[09:18] 🦘 Learning to Skip the Middle Layers of Transformers(学习跳过Transformer的中间层)
[09:57] 🎵 MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners(MuseControlLite:基于轻量级调节器的多功能音乐生成)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
정보
- 프로그램
- 주기매일 업데이트
- 발행일2025년 6월 28일 오전 12:00 UTC
- 길이11분
- 등급전체 연령 사용가