Daily Paper Cast

Jingwen Liang, Gengyu Wang

0.0 (0)
CIENCIAS
CADA DÍA

We update every weekday to discuss highest-voted papers from Huggingface Daily Paper (https://huggingface.co/papers). Both the podcast scripts and audio are generated by AI. Feedback and suggestions are welcome! Email us: dailypapercast.ai@gmail.com Creator: Jingwen Liang, 3D ML, https://www.linkedin.com/in/jingwen-liang/ Gengyu Wang, LLM ML, http://wanggengyu.com Listen on: Spotify: https://open.spotify.com/show/21nrhmdaA8qoBiH8q03NXL Apple Podcast: https://podcasts.apple.com/us/podcast/daily-paper-cast/id1777620236 Cover Image by Kawen Kuang https://kawen.art

Ver todo (1.4 k)

Creador

Jingwen Liang, Gengyu Wang
Años de actividad

2024 - 2025
Episodios

1.4 k
Clasificación

Apto
Mostrar sitio web

Daily Paper Cast

Daily Paper Cast

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

SAM 3: Segment Anything with Concepts

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

VisPlay: Self-Evolving Vision-Language Models from Images

Acerca de

Información

Daily Paper Cast

Episodios

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

SAM 3: Segment Anything with Concepts

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

VisPlay: Self-Evolving Vision-Language Models from Images

Acerca de

Información