本期的 9 篇论文如下:
[00:22] 💻 ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents(ScreenCoder:模块化多模态智能体赋能前端视觉代码生成)
[01:02] 🚀 Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance(Falcon-H1:重塑效率与性能的混合架构语言模型系列)
[01:33] 💥 BANG: Dividing 3D Assets via Generative Exploded Dynamics(BANG:基于生成式爆炸动态的三维资产分解)
[02:17] 🧠 VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning(VL-Cogito:面向高级多模态推理的渐进式课程强化学习)
[02:51] 🚁 Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision(弱监督下航空影像车辆检测器在未知领域的适配)
[03:34] 🧩 Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation(迈向指代性音视频分割中的全模态表达与推理)
[04:04] 🚀 Efficient Differentially Private Fine-Tuning of LLMs via Reinforcement Learning(基于强化学习的大语言模型高效差分隐私微调)
[04:56] 🛠 Repair-R1: Better Test Before Repair(Repair-R1:修复前先测试,效果更佳)
[05:33] 🌍 MetaCLIP 2: A Worldwide Scaling Recipe(MetaCLIP 2:全球规模化训练方案)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
Информация
- Подкаст
- ЧастотаЕжедневно
- Опубликовано1 августа 2025 г. в 00:00 UTC
- Длительность6 мин.
- ОграниченияБез ненормативной лексики