Daily Paper Cast

Jingwen Liang, Gengyu Wang

0.0（0則評分）
科學
每日更新

We update every weekday to discuss highest-voted papers from Huggingface Daily Paper (https://huggingface.co/papers). Both the podcast scripts and audio are generated by AI. Feedback and suggestions are welcome! Email us: dailypapercast.ai@gmail.com Creator: Jingwen Liang, 3D ML, https://www.linkedin.com/in/jingwen-liang/ Gengyu Wang, LLM ML, http://wanggengyu.com Listen on: Spotify: https://open.spotify.com/show/21nrhmdaA8qoBiH8q03NXL Apple Podcast: https://podcasts.apple.com/us/podcast/daily-paper-cast/id1777620236 Cover Image by Kawen Kuang https://kawen.art

顯示全部 (1,221)

創作者

Jingwen Liang, Gengyu Wang
活躍年代

2024年 - 2025年
集數

1221
年齡分級

兒少適宜
節目網站

Daily Paper Cast

科技

科技

每週更新
科學

科學

每日更新
自然科學

自然科學

每週更新
投資

投資

9月26日更新

Daily Paper Cast

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

GEM: A Gym for Agentic LLMs

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

PIPer: On-Device Environment Setup via Online Reinforcement Learning

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

ACON: Optimizing Context Compression for Long-horizon LLM Agents

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

簡介

資訊

你可能也會喜歡

Daily Paper Cast

集數

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

GEM: A Gym for Agentic LLMs

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

PIPer: On-Device Environment Setup via Online Reinforcement Learning

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

ACON: Optimizing Context Compression for Long-horizon LLM Agents

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

簡介

資訊

你可能也會喜歡