HuggingFace 每日AI论文速递

2025.08.04 | 扩散语言模型变长去噪,高效省资源;PixNerd图像扩散,高效高质量。

本期的 11 篇论文如下:

[00:22] 🔄 Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models(超越固定长度:扩散大语言模型的可变长度去噪)

[00:44] 🎨 PixNerd: Pixel Neural Field Diffusion(PixNerd:像素神经场扩散)

[01:11] 💡 SWE-Exp: Experience-Driven Software Issue Resolution(SWE-Exp:经验驱动的软件问题解决)

[01:38] 🔍 Multimodal Referring Segmentation: A Survey(多模态指代表达分割:一项综述)

[01:59] 🧠 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding(3D-R1:增强3D VLM的推理能力以实现统一场景理解)

[02:40] 🤖 SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution(SWE-Debate:用于软件问题解决的竞争性多智能体辩论)

[03:05] ⚖ Learning an Efficient Multi-Turn Dialogue Evaluator from Multiple Judges(从多个评委中学习高效的多轮对话评估器)

[03:33] 🤯 Investigating Hallucination in Conversations for Low Resource Languages(研究低资源语言对话中的幻觉现象)

[04:00] 🧭 IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation(IGL-Nav:用于图像目标导航的增量式三维高斯定位)

[04:30] 🎧 SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation(SpA2V: 利用空间听觉线索进行音频驱动的空间感知视频生成)

[04:55] 🎮 Multi-Agent Game Generation and Evaluation via Audio-Visual Recordings(多智能体游戏生成与评估基于视听记录)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递