本期的 10 篇论文如下:
[00:30] TOP1(🔥257) | 🚀 Group Sequence Policy Optimization(组序列策略优化)
[02:21] TOP2(🔥227) | 🧮 A Survey of Context Engineering for Large Language Models(大型语言模型上下文工程综述)
[03:33] TOP3(🔥207) | 🧠 GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning(GLM-4.1V-Thinking:基于可扩展强化学习的通用多模态推理)
[05:02] TOP4(🔥151) | 🎬 Scaling RL to Long Videos(强化学习驱动视觉语言模型扩展至长视频)
[06:57] TOP5(🔥144) | 🧠 MemOS: A Memory OS for AI System(MemOS:面向人工智能系统的内存操作系统)
[08:47] TOP6(🔥126) | 🎬 Kwai Keye-VL Technical Report(Kwai Keye-VL 技术报告)
[10:41] TOP7(🔥126) | 🎯 GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding(GUI-G$^2$: 基于高斯奖励模型的GUI定位)
[12:38] TOP8(🔥121) | 🤖 Agentic Reinforced Policy Optimization(智能体强化策略优化)
[14:21] TOP9(🔥120) | 🧮 MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization(MiroMind-M1:通过上下文感知多阶段策略优化实现数学推理的开源进展)
[15:53] TOP10(🔥118) | ⚡ $\nabla$NABLA: Neighborhood Adaptive Block-Level Attention(邻域自适应块级注意力)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
Information
- Show
- FrequencyUpdated daily
- Published4 August 2025 at 00:00 UTC
- Length18 min
- RatingClean