HuggingFace 每日AI论文速递

2025.08.06 | 高速推理扩散模型;紧凑视觉生成模型

本期的 13 篇论文如下:

[00:17] 🚀 Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference(种子扩散:一种具有高速推理能力的大规模扩散语言模型)

[00:39] 🎨 Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation(Skywork UniPic:用于视觉理解与生成的统一自回归建模)

[01:05] 🎥 LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation(LongVie:多模态引导的可控超长视频生成)

[01:27] 🔍 CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward(CompassVerifier:统一且鲁棒的大语言模型评估与结果奖励验证器)

[01:51] 🚀 CRINN: Contrastive Reinforcement Learning for Approximate Nearest Neighbor Search(CRINN: 用于近似最近邻搜索的对比强化学习)

[02:13] 🔍 Tool-integrated Reinforcement Learning for Repo Deep Search(用于仓库深度搜索的工具集成强化学习)

[02:36] 👥 Multi-human Interactive Talking Dataset(多人互动说话数据集)

[03:04] 🧠 Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction(哥德尔证明器V2:通过脚手架数据合成和自我校正扩展形式化定理证明)

[03:39] 🧭 LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?(LiveMCPBench:智能体能在海量MCP工具的海洋中航行吗?)

[04:08] 🧩 LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer(LAMIC: 基于多模态扩散Transformer可扩展性的布局感知多图像合成)

[04:37] 📊 ChartCap: Mitigating Hallucination of Dense Chart Captioning(ChartCap:缓解密集图表字幕生成的幻觉问题)

[05:03] 🛡 AlignGuard-LoRA: Alignment-Preserving Fine-Tuning via Fisher-Guided Decomposition and Riemannian-Geodesic Collision Regularization(AlignGuard-LoRA:基于Fisher引导分解与黎曼测地碰撞正则化的对齐保持微调)

[05:35] 🔍 TRACEALIGN -- Tracing the Drift: Attributing Alignment Failures to Training-Time Belief Sources in LLMs(TRACEALIGN -- 追踪漂移:将大语言模型中的对齐失败归因于训练时的信念源)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递