4天前
单集 105
10 分钟

Evaluating Retrieval Capabilities of Language Models [Microsoft]

In this episode, we explore how to evaluate the retrieval-augmented generation (RAG) capabilities of small language models. On the business side, we discuss why RAG, long context windows, and small language models are critical for building scalable and reliable AI systems. On the technical side, we walk through the Needle-in-a-Haystack methodology and discuss key findings about retrieval performance across different models.
For more details, you can refer to their published tech blog, linked here for your reference: https://medium.com/data-science-at-microsoft/evaluating-rag-capabilities-of-small-language-models-e7531b3a5061

单集网页

节目

Snacks Weekly on Data Science
频率

一周一更
发布时间

2025年9月29日 UTC 11:00
长度

10 分钟
单集

105
分级

儿童适宜

Evaluating Retrieval Capabilities of Language Models [Microsoft]

信息