977 episodes

Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video format. Our service fills the gap between overly brief paper summaries and time-consuming full paper reads. You gain academic insights in a time-efficient, digestible format. Code behind this work: https://github.com/imelnyk/ArxivPapers Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support

Arxiv Papers Igor Melnyk

- Science

- 2 MAY 2024
[QA] A Careful Examination of Large Language Model Performance on Grade School Arithmetic

[QA] A Careful Examination of Large Language Model Performance on Grade School Arithmetic

Study investigates dataset contamination in large language models for mathematical reasoning using Grade School Math 1000 benchmark, finding evidence of overfitting and potential memorization of benchmark questions.

https://arxiv.org/abs//2405.00332

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

---

Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
- 7 min
- 2 MAY 2024
A Careful Examination of Large Language Model Performance on Grade School Arithmetic

A Careful Examination of Large Language Model Performance on Grade School Arithmetic

Study investigates dataset contamination in large language models for mathematical reasoning using Grade School Math 1000 benchmark, finding evidence of overfitting and potential memorization of benchmark questions.

https://arxiv.org/abs//2405.00332

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

---

Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
- 16 min
- 1 MAY 2024
[QA] Self-Play Preference Optimization for Language Model Alignment

[QA] Self-Play Preference Optimization for Language Model Alignment

The paper introduces SPPO, a self-play method for language model alignment, achieving state-of-the-art results without external supervision, outperforming DPO and IPO on various benchmarks.

https://arxiv.org/abs//2405.00675

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

---

Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
- 9 min
- 1 MAY 2024
Self-Play Preference Optimization for Language Model Alignment

Self-Play Preference Optimization for Language Model Alignment

The paper introduces SPPO, a self-play method for language model alignment, achieving state-of-the-art results without external supervision, outperforming DPO and IPO on various benchmarks.

https://arxiv.org/abs//2405.00675

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

---

Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
- 19 min
- 1 MAY 2024
[QA] Is Bigger Edit Batch Size Always Better? - An Empirical Study on Model Editing with Llama-3

[QA] Is Bigger Edit Batch Size Always Better? - An Empirical Study on Model Editing with Llama-3

Study evaluates model editing techniques on Llama-3, finding sequential editing more effective than batch editing. Suggests combining both methods for optimal performance.

https://arxiv.org/abs//2405.00664

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

---

Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
- 9 min
- 1 MAY 2024
Is Bigger Edit Batch Size Always Better? - An Empirical Study on Model Editing with Llama-3

Is Bigger Edit Batch Size Always Better? - An Empirical Study on Model Editing with Llama-3

Study evaluates model editing techniques on Llama-3, finding sequential editing more effective than batch editing. Suggests combining both methods for optimal performance.

https://arxiv.org/abs//2405.00664

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

---

Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
- 5 min