18小时前
10 分钟

2025.10.21 | 模型不懂光影折射；小模型也能写报告

HuggingFace 每日AI论文速递

本期的 13 篇论文如下：

[00:21] 🪞 PICABench: How Far Are We from Physically Realistic Image Editing?（PICABench：我们离物理真实的图像编辑还有多远？）

[01:04] 🤖 DeepAnalyze: Agentic Large Language Models for Autonomous Data Science（DeepAnalyze：面向自主数据科学的智能体大模型）

[01:50] 🗜 Glyph: Scaling Context Windows via Visual-Text Compression（Glyph：通过视觉-文本压缩扩展上下文窗口长度）

[02:23] 🔍 Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation（面向通用检索增强生成的混合模态检索研究）

[03:10] 🔗 When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling（何时集成：定位Token级位置实现稳定高效的大模型集成）

[04:09] 🎯 Annotation-Efficient Universal Honesty Alignment（注释高效型通用诚实对齐）

[04:49] 🖌 Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback（Uniworld-V2：借助扩散负感知微调与MLLM隐式反馈强化图像编辑）

[05:46] 👁 RL makes MLLMs see better than SFT（强化学习让多模态大模型看得比监督微调更清楚）

[06:33] 🚀 Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling（视觉自回归模型在推理时扩展上击败扩散模型）

[07:09] 🎨 ConsistEdit: Highly Consistent and Precise Training-free Visual Editing（ConsistEdit：面向MM-DiT的高一致免训练视觉编辑）

[07:56] 🔄 Deep Self-Evolving Reasoning（深度自演化推理）

[08:22] 🧠 Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI（超越流水线：模型原生智能体AI范式转移综述）

[09:07] 🔮 Chronos-2: From Univariate to Universal Forecasting（Chronos-2：从单变量到通用预测）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

单集网页

节目

HuggingFace 每日AI论文速递
频率

一日一更
发布时间

2025年10月21日 UTC 23:00
长度

10 分钟
分级

儿童适宜