Daily Paper Cast

Jingwen Liang, Gengyu Wang

0,0 (0)
CIÊNCIA
DIÁRIO

We update every weekday to discuss highest-voted papers from Huggingface Daily Paper (https://huggingface.co/papers). Both the podcast scripts and audio are generated by AI. Feedback and suggestions are welcome! Email us: dailypapercast.ai@gmail.com Creator: Jingwen Liang, 3D ML, https://www.linkedin.com/in/jingwen-liang/ Gengyu Wang, LLM ML, http://wanggengyu.com Listen on: Spotify: https://open.spotify.com/show/21nrhmdaA8qoBiH8q03NXL Apple Podcast: https://podcasts.apple.com/us/podcast/daily-paper-cast/id1777620236 Cover Image by Kawen Kuang https://kawen.art

Ver tudo (1,4 mil)

Criado por

Jingwen Liang, Gengyu Wang
Anos de atividade

2024 - 2025
Episódios

1,4 mil
Classificação

Livre
Site do podcast

Daily Paper Cast

Daily Paper Cast

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

DeepEyesV2: Toward Agentic Multimodal Model

Visual Spatial Tuning

VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Sobre

Informações

Daily Paper Cast

Episódios

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

DeepEyesV2: Toward Agentic Multimodal Model

Visual Spatial Tuning

VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Sobre

Informações