HuggingFace 每日AI论文速递

2025.09.18 | FP8压缩+翻译微调低成本炼阿语大模型;2B-8B小模型洗数据硬刚GPT-4o

本期的 14 篇论文如下:

[00:19] 🐪 Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale(Hala技术报告:规模化构建阿拉伯语为中心的指令与翻译模型)

[00:56] 🚀 SAIL-VL2 Technical Report(SAIL-VL2技术报告)

[01:42] 🌐 PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era(全景视界:具身AI时代的360°视觉崛起)

[02:33] 🎓 GenExam: A Multidisciplinary Text-to-Image Exam(GenExam:多学科文本到图像生成考试基准)

[03:25] 🧹 Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning(擦除敏感记忆!用机器遗忘技术为代码大模型“去隐私”)

[03:59] 🩺 MedResearcher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework(MedResearcher-R1:基于知识引导轨迹合成的专家级医学深度研究智能体)

[04:37] 🔍 MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook(MARS2 2025多模态推理挑战赛:数据集、方法、结果、讨论与展望)

[05:22] 🎭 Wan-Animate: Unified Character Animation and Replacement with Holistic Replication(Wan-Animate:统一角色动画与替换的完整复现框架)

[05:59] 🧮 THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning(THOR:融合工具的分层强化学习优化数学推理)

[06:40] 🔍 Improving Context Fidelity via Native Retrieval-Augmented Reasoning(提升上下文保真度的原生检索增强推理方法)

[07:20] 🌍 AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions(AERIS:面向可靠且高技巧地球系统预测的阿尔贡地球系统模型)

[08:13] 🎛 SteeringControl: Holistic Evaluation of Alignment Steering in LLMs(SteeringControl:对大模型对齐操控的全景评估)

[08:48] ⚛ Quantum Variational Activation Functions Empower Kolmogorov-Arnold Networks(量子变分激活函数赋能Kolmogorov-Arnold网络)

[09:37] 🚀 Hybrid Quantum-Classical Model for Image Classification(用于图像分类的混合量子-经典模型)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递