HuggingFace 每日AI论文速递

2025.07.31 | ScreenCoder自动化UI转代码;Falcon-H1混合架构,提升长序列效率。

本期的 9 篇论文如下:

[00:22] 💻 ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents(ScreenCoder:模块化多模态智能体赋能前端视觉代码生成)

[01:02] 🚀 Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance(Falcon-H1:重塑效率与性能的混合架构语言模型系列)

[01:33] 💥 BANG: Dividing 3D Assets via Generative Exploded Dynamics(BANG:基于生成式爆炸动态的三维资产分解)

[02:17] 🧠 VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning(VL-Cogito:面向高级多模态推理的渐进式课程强化学习)

[02:51] 🚁 Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision(弱监督下航空影像车辆检测器在未知领域的适配)

[03:34] 🧩 Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation(迈向指代性音视频分割中的全模态表达与推理)

[04:04] 🚀 Efficient Differentially Private Fine-Tuning of LLMs via Reinforcement Learning(基于强化学习的大语言模型高效差分隐私微调)

[04:56] 🛠 Repair-R1: Better Test Before Repair(Repair-R1:修复前先测试,效果更佳)

[05:33] 🌍 MetaCLIP 2: A Worldwide Scaling Recipe(MetaCLIP 2:全球规模化训练方案)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递