977 episodes

Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video format. Our service fills the gap between overly brief paper summaries and time-consuming full paper reads. You gain academic insights in a time-efficient, digestible format. Code behind this work: https://github.com/imelnyk/ArxivPapers Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

Arxiv Papers Igor Melnyk

    • Science

Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video format. Our service fills the gap between overly brief paper summaries and time-consuming full paper reads. You gain academic insights in a time-efficient, digestible format. Code behind this work: https://github.com/imelnyk/ArxivPapers Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

    [QA] A Careful Examination of Large Language Model Performance on Grade School Arithmetic

    [QA] A Careful Examination of Large Language Model Performance on Grade School Arithmetic

    Study investigates dataset contamination in large language models for mathematical reasoning using Grade School Math 1000 benchmark, finding evidence of overfitting and potential memorization of benchmark questions.



    https://arxiv.org/abs//2405.00332



    YouTube: https://www.youtube.com/@ArxivPapers



    TikTok: https://www.tiktok.com/@arxiv_papers



    Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016



    Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers




    ---

    Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

    • 7 min
    A Careful Examination of Large Language Model Performance on Grade School Arithmetic

    A Careful Examination of Large Language Model Performance on Grade School Arithmetic

    Study investigates dataset contamination in large language models for mathematical reasoning using Grade School Math 1000 benchmark, finding evidence of overfitting and potential memorization of benchmark questions.



    https://arxiv.org/abs//2405.00332



    YouTube: https://www.youtube.com/@ArxivPapers



    TikTok: https://www.tiktok.com/@arxiv_papers



    Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016



    Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers




    ---

    Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

    • 16 min
    [QA] Self-Play Preference Optimization for Language Model Alignment

    [QA] Self-Play Preference Optimization for Language Model Alignment

    The paper introduces SPPO, a self-play method for language model alignment, achieving state-of-the-art results without external supervision, outperforming DPO and IPO on various benchmarks.



    https://arxiv.org/abs//2405.00675



    YouTube: https://www.youtube.com/@ArxivPapers



    TikTok: https://www.tiktok.com/@arxiv_papers



    Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016



    Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers




    ---

    Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

    • 9 min
    Self-Play Preference Optimization for Language Model Alignment

    Self-Play Preference Optimization for Language Model Alignment

    The paper introduces SPPO, a self-play method for language model alignment, achieving state-of-the-art results without external supervision, outperforming DPO and IPO on various benchmarks.



    https://arxiv.org/abs//2405.00675



    YouTube: https://www.youtube.com/@ArxivPapers



    TikTok: https://www.tiktok.com/@arxiv_papers



    Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016



    Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers




    ---

    Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

    • 19 min
    [QA] Is Bigger Edit Batch Size Always Better? - An Empirical Study on Model Editing with Llama-3

    [QA] Is Bigger Edit Batch Size Always Better? - An Empirical Study on Model Editing with Llama-3

    Study evaluates model editing techniques on Llama-3, finding sequential editing more effective than batch editing. Suggests combining both methods for optimal performance.



    https://arxiv.org/abs//2405.00664



    YouTube: https://www.youtube.com/@ArxivPapers



    TikTok: https://www.tiktok.com/@arxiv_papers



    Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016



    Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers




    ---

    Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

    • 9 min
    Is Bigger Edit Batch Size Always Better? - An Empirical Study on Model Editing with Llama-3

    Is Bigger Edit Batch Size Always Better? - An Empirical Study on Model Editing with Llama-3

    Study evaluates model editing techniques on Llama-3, finding sequential editing more effective than batch editing. Suggests combining both methods for optimal performance.



    https://arxiv.org/abs//2405.00664



    YouTube: https://www.youtube.com/@ArxivPapers



    TikTok: https://www.tiktok.com/@arxiv_papers



    Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016



    Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers




    ---

    Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

    • 5 min

Top Podcasts In Science

Hidden Brain
Hidden Brain, Shankar Vedantam
StarTalk Radio
Neil deGrasse Tyson
Radiolab
WNYC Studios
Unexplainable
Vox
The Infinite Monkey Cage
BBC Radio 4
Making Sense with Sam Harris
Sam Harris

You Might Also Like

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Sam Charrington
Practical AI: Machine Learning, Data Science
Changelog Media
Last Week in AI
Skynet Today
Eye On A.I.
Craig S. Smith
The AI Podcast
NVIDIA
Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn