HuggingFace 每日AI论文速递

2025.10.03 | LongCodeZip删得快准;迈向分钟级高质量视频生成

本期的 15 篇论文如下:

[00:22] 🗜 LongCodeZip: Compress Long Context for Code Language Models(LongCodeZip:面向代码大模型的长上下文压缩方法)

[00:56] 🎬 Self-Forcing++: Towards Minute-Scale High-Quality Video Generation(自增强++:迈向分钟级高质量视频生成)

[01:38] 🧠 ExGRPO: Learning to Reason from Experience(基于经验的群体相对策略优化:让大模型学会从经验中推理)

[02:32] 🥷 StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions(隐身投毒:基于密度引导幻觉的鲁棒3D高斯溅射攻击)

[03:32] 🎛 Interactive Training: Feedback-Driven Neural Network Optimization(交互式训练:反馈驱动的神经网络优化)

[04:24] 📈 StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?(StockBench:大模型智能体能否在真实股市中稳定盈利?)

[05:07] 🔍 VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning(VOGUE:用视觉不确定性引导探索,提升多模态推理)

[05:44] 🪓 The Rogue Scalpel: Activation Steering Compromises LLM Safety(失控的手术刀:激活向量操控竟瓦解大模型安全锁)

[06:21] 🔍 CLUE: Non-parametric Verification from Experience via Hidden-State Clustering(CLUE:基于隐状态聚类的非参数经验验证)

[07:09] 🔍 ModernVBERT: Towards Smaller Visual Document Retrievers(ModernVBERT:打造更轻量的视觉文档检索器)

[07:54] 🗺 RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning(RewardMap:通过多阶段强化学习解决细粒度视觉推理中的稀疏奖励问题)

[08:37] 🚀 F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data(F2LLM技术报告:仅用600万开源数据即可达到SOTA嵌入性能)

[09:13] 🧠 RLP: Reinforcement as a Pretraining Objective(RLP:将强化学习作为预训练目标)

[09:45] 🖱 DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing(DragFlow:借助区域监督释放DiT先验,实现拖拽式编辑)

[10:19] 🚀 The Unreasonable Effectiveness of Scaling Agents for Computer Use(扩展计算机使用代理的规模带来的不合理有效性)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递