HuggingFace 每日AI论文速递

【月末特辑】7月最火AI论文 | GSPO稳训练;序列级裁剪降方差;上下文工程综述,动态拼装信息流

本期的 10 篇论文如下:

[00:30] TOP1(🔥257) | 🚀 Group Sequence Policy Optimization(组序列策略优化)

[02:21] TOP2(🔥227) | 🧮 A Survey of Context Engineering for Large Language Models(大型语言模型上下文工程综述)

[03:33] TOP3(🔥207) | 🧠 GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning(GLM-4.1V-Thinking:基于可扩展强化学习的通用多模态推理)

[05:02] TOP4(🔥151) | 🎬 Scaling RL to Long Videos(强化学习驱动视觉语言模型扩展至长视频)

[06:57] TOP5(🔥144) | 🧠 MemOS: A Memory OS for AI System(MemOS:面向人工智能系统的内存操作系统)

[08:47] TOP6(🔥126) | 🎬 Kwai Keye-VL Technical Report(Kwai Keye-VL 技术报告)

[10:41] TOP7(🔥126) | 🎯 GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding(GUI-G$^2$: 基于高斯奖励模型的GUI定位)

[12:38] TOP8(🔥121) | 🤖 Agentic Reinforced Policy Optimization(智能体强化策略优化)

[14:21] TOP9(🔥120) | 🧮 MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization(MiroMind-M1:通过上下文感知多阶段策略优化实现数学推理的开源进展)

[15:53] TOP10(🔥118) | ⚡ $\nabla$NABLA: Neighborhood Adaptive Block-Level Attention(邻域自适应块级注意力)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递