HuggingFace 每日AI论文速递

2025.09.15 | 数据集升级测互动;模型大小非长程瓶颈

本期的 14 篇论文如下:

[00:25] 📚 IntrEx: A Dataset for Modeling Engagement in Educational Conversations(IntrEx:面向教育对话中参与度建模的数据集)

[01:02] 📏 The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs(“收益递减的幻觉”:衡量大语言模型的长时程执行能力)

[01:54] 🧩 X-Part: high fidelity and structure coherent shape decomposition(X-Part:高保真且结构一致的三维形状分解)

[02:33] 🖼 InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis(InfGen:分辨率无关的可扩展图像合成新范式)

[03:04] 🔍 HANRAG: Heuristic Accurate Noise-resistant Retrieval-Augmented Generation for Multi-hop Question Answering(HANRAG:面向多跳问答的启发式精准抗噪检索增强生成方法)

[03:50] 🎙 VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions(VStyle:基于语音指令的语音风格自适应基准)

[04:44] 🌸 FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies(FLOWER:以高效视觉-语言-动作流策略普及通用机器人策略)

[05:20] 🎨 Inpainting-Guided Policy Optimization for Diffusion Large Language Models(面向扩散大语言模型的基于文本补全引导的策略优化方法)

[05:58] 🤖 Virtual Agent Economies(虚拟代理经济)

[06:28] 📈 QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading(QuantAgent:面向高频交易的价格驱动多智能体大语言模型框架)

[07:02] 🧪 MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools(MCP-AgentBench:基于MCP中介工具的通用语言智能体真实性能评测)

[07:41] 🎨 Color Me Correctly: Bridging Perceptual Color Spaces and Text Embeddings for Improved Diffusion Generation(精准上色:连接感知色彩空间与文本嵌入以提升扩散生成质量)

[08:31] 🦎 LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios(LoFT:面向开放世界长尾场景的参数高效半监督微调方法)

[09:13] 🗞 CMHG: A Dataset and Benchmark for Headline Generation of Minority Languages in China(CMHG:中国少数民族语言新闻标题生成数据集与评测基准)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递