2025.07.09 | 潜在推理提升LLM表达能力；SingLoRA优化低秩适应性能。

本期的 15 篇论文如下：

[00:25] 🤔 A Survey on Latent Reasoning（潜在推理研究综述）

[00:59] 💡 SingLoRA: Low Rank Adaptation Using a Single Matrix（SingLoRA：使用单矩阵的低秩适应）

[01:47] 🧩 OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion（OmniPart：基于语义解耦和结构内聚的部件感知三维生成）

[02:36] 🤖 CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization（CriticLean：评论引导的数学形式化强化学习）

[03:17] 🤖 StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling（StreamVLN：基于慢速-快速上下文建模的流式视觉-语言导航）

[03:50] 🫂 RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents（RLVER：基于可验证情感奖励的强化学习，用于培养共情智能体）

[04:30] 🩺 MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos（MedGen：通过扩展细粒度标注的医学视频来解锁医学视频生成）

[05:14] 🤖 Is Diversity All You Need for Scalable Robotic Manipulation?（可扩展的机器人操作是否只需要多样性？）

[05:54] 🤖 Coding Triangle: How Does Large Language Model Understand Code?（代码三角形：大型语言模型如何理解代码？）

[06:38] 🇪 Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts（尼罗河Chat：用于阿拉伯语和拉丁语埃及语语言模型）

[07:21] 🖱 GTA1: GUI Test-time Scaling Agent（GTA1：GUI测试时缩放代理）

[08:00] 🧮 Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers（基于大语言模型的重排序器效率-效果再排序的FLOPs研究）

[08:45] 🧬 PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs（PRING：重新思考从蛋白质对到图的蛋白质-蛋白质相互作用预测）

[09:33] 🩻 SAMed-2: Selective Memory Enhanced Medical Segment Anything Model（SAMed-2：选择性记忆增强医学图像分割模型）

[10:01] 🎬 Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation（Tora2：用于多实体视频生成的运动和外观定制扩散Transformer）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

Information