HuggingFace 每日AI论文速递

2025.10.17 | AI眼镜预判式服务;视频生成补想象力

本期的 11 篇论文如下:

[00:25] 👓 AI for Service: Proactive Assistance with AI Glasses(AI服务:AI眼镜的主动式协助)

[01:06] 🎬 ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints(ImagerySearch:面向超越语义依赖约束的自适应测试时搜索视频生成)

[01:43] 🎯 LaSeR: Reinforcement Learning with Last-Token Self-Rewarding(LaSeR:基于末词元自奖励的强化学习)

[02:33] 🧩 TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar(TokDrift:当大模型用子词而代码用语法时)

[03:35] 🧠 Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents(基于信息增益的策略优化:一种简单有效的多轮LLM智能体训练方法)

[04:04] ⚡ Attention Is All You Need for KV Cache in Diffusion LLMs(扩散式大语言模型只需注意力即可搞定KV缓存)

[04:45] 🤥 When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA(当模型撒谎时我们反而学到东西:用PsiloQA实现跨语言细粒度幻觉检测)

[05:33] 📄 PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model(PaddleOCR-VL:以9亿参数超轻量多模态模型刷新多语言文档解析性能)

[06:13] 🧠 VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning(VR-Thinker:通过“边看边想”推理提升视频奖励模型)

[06:52] 📐 MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning(MathCanvas:面向多模态数学推理的内生视觉思维链)

[07:39] 🧠 COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes(COIG-Writer:高质量中文创意写作数据集,附带思维过程)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递