21 G. TEMU
11 MIN

2025.10.03 | LongCodeZip删得快准；迈向分钟级高质量视频生成

HuggingFace 每日AI论文速递

本期的 15 篇论文如下：

[00:22] 🗜 LongCodeZip: Compress Long Context for Code Language Models（LongCodeZip：面向代码大模型的长上下文压缩方法）

[00:56] 🎬 Self-Forcing++: Towards Minute-Scale High-Quality Video Generation（自增强++：迈向分钟级高质量视频生成）

[01:38] 🧠 ExGRPO: Learning to Reason from Experience（基于经验的群体相对策略优化：让大模型学会从经验中推理）

[02:32] 🥷 StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions（隐身投毒：基于密度引导幻觉的鲁棒3D高斯溅射攻击）

[03:32] 🎛 Interactive Training: Feedback-Driven Neural Network Optimization（交互式训练：反馈驱动的神经网络优化）

[04:24] 📈 StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?（StockBench：大模型智能体能否在真实股市中稳定盈利？）

[05:07] 🔍 VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning（VOGUE：用视觉不确定性引导探索，提升多模态推理）

[05:44] 🪓 The Rogue Scalpel: Activation Steering Compromises LLM Safety（失控的手术刀：激活向量操控竟瓦解大模型安全锁）

[06:21] 🔍 CLUE: Non-parametric Verification from Experience via Hidden-State Clustering（CLUE：基于隐状态聚类的非参数经验验证）

[07:09] 🔍 ModernVBERT: Towards Smaller Visual Document Retrievers（ModernVBERT：打造更轻量的视觉文档检索器）

[07:54] 🗺 RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning（RewardMap：通过多阶段强化学习解决细粒度视觉推理中的稀疏奖励问题）

[08:37] 🚀 F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data（F2LLM技术报告：仅用600万开源数据即可达到SOTA嵌入性能）

[09:13] 🧠 RLP: Reinforcement as a Pretraining Objective（RLP：将强化学习作为预训练目标）

[09:45] 🖱 DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing（DragFlow：借助区域监督释放DiT先验，实现拖拽式编辑）

[10:19] 🚀 The Unreasonable Effectiveness of Scaling Agents for Computer Use（扩展计算机使用代理的规模带来的不合理有效性）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

Strona internetowa odcinka

Program

HuggingFace 每日AI论文速递
Częstotliwość

Uakt. codziennie
Opublikowano

3 października 2025 23:00 UTC
Czas trwania

11 min
Klasyfikacja

Dla wszystkich

2025.10.03 | LongCodeZip删得快准；迈向分钟级高质量视频生成

Informacje