2025.07.09 | 潜在推理提升LLM表达能力;SingLoRA优化低秩适应性能。

HuggingFace 每日AI论文速递

本期的 15 篇论文如下:

[00:25] 🤔 A Survey on Latent Reasoning(潜在推理研究综述)

[00:59] 💡 SingLoRA: Low Rank Adaptation Using a Single Matrix(SingLoRA:使用单矩阵的低秩适应)

[01:47] 🧩 OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion(OmniPart:基于语义解耦和结构内聚的部件感知三维生成)

[02:36] 🤖 CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization(CriticLean:评论引导的数学形式化强化学习)

[03:17] 🤖 StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling(StreamVLN:基于慢速-快速上下文建模的流式视觉-语言导航)

[03:50] 🫂 RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents(RLVER:基于可验证情感奖励的强化学习,用于培养共情智能体)

[04:30] 🩺 MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos(MedGen:通过扩展细粒度标注的医学视频来解锁医学视频生成)

[05:14] 🤖 Is Diversity All You Need for Scalable Robotic Manipulation?(可扩展的机器人操作是否只需要多样性?)

[05:54] 🤖 Coding Triangle: How Does Large Language Model Understand Code?(代码三角形:大型语言模型如何理解代码?)

[06:38] 🇪 Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts(尼罗河Chat:用于阿拉伯语和拉丁语埃及语语言模型)

[07:21] 🖱 GTA1: GUI Test-time Scaling Agent(GTA1:GUI测试时缩放代理)

[08:00] 🧮 Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers(基于大语言模型的重排序器效率-效果再排序的FLOPs研究)

[08:45] 🧬 PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs(PRING:重新思考从蛋白质对到图的蛋白质-蛋白质相互作用预测)

[09:33] 🩻 SAMed-2: Selective Memory Enhanced Medical Segment Anything Model(SAMed-2:选择性记忆增强医学图像分割模型)

[10:01] 🎬 Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation(Tora2:用于多实体视频生成的运动和外观定制扩散Transformer)

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

To listen to explicit episodes, sign in.

Stay up to date with this show

Sign in or sign up to follow shows, save episodes and get the latest updates.

Select a country or region

Africa, Middle East, and India

Asia Pacific

Europe

Latin America and the Caribbean

The United States and Canada