本期的 11 篇论文如下:
[00:25] 👓 AI for Service: Proactive Assistance with AI Glasses(AI服务:AI眼镜的主动式协助)
[01:06] 🎬 ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints(ImagerySearch:面向超越语义依赖约束的自适应测试时搜索视频生成)
[01:43] 🎯 LaSeR: Reinforcement Learning with Last-Token Self-Rewarding(LaSeR:基于末词元自奖励的强化学习)
[02:33] 🧩 TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar(TokDrift:当大模型用子词而代码用语法时)
[03:35] 🧠 Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents(基于信息增益的策略优化:一种简单有效的多轮LLM智能体训练方法)
[04:04] ⚡ Attention Is All You Need for KV Cache in Diffusion LLMs(扩散式大语言模型只需注意力即可搞定KV缓存)
[04:45] 🤥 When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA(当模型撒谎时我们反而学到东西:用PsiloQA实现跨语言细粒度幻觉检测)
[05:33] 📄 PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model(PaddleOCR-VL:以9亿参数超轻量多模态模型刷新多语言文档解析性能)
[06:13] 🧠 VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning(VR-Thinker:通过“边看边想”推理提升视频奖励模型)
[06:52] 📐 MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning(MathCanvas:面向多模态数学推理的内生视觉思维链)
[07:39] 🧠 COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes(COIG-Writer:高质量中文创意写作数据集,附带思维过程)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
Información
- Programa
- FrecuenciaCada día
- Publicado17 de octubre de 2025, 11:00 p.m. UTC
- Duración9 min
- ClasificaciónApto