39 мин.

Episode 236: Eva Maxfield Brown & Boris Veytsman on OSS Dependencies in the Sciences Sustain

    • Технологии

Guests

Eva Maxfield Brown | Boris Veytsman


Panelist

Richard Littauer


Show Notes

In this episode of Sustain, host Richard Littauer engages with guests Eva Maxfield Brown and Boris Veytsman to explore their co-authored paper, "Biomedical Open Source Software: Crucial Packages and Hidden Heroes." The paper focuses on identifying crucial but often overlooked software dependencies in biomedical research. The discussions delve into how the study used data from two million papers to map these dependencies, revealing both well-supported and undermaintained software components vital to scientific research. There’s a conversation on the methodological challenges and the concept of "Nebraska packages," which are essential yet potentially undermaintained elements crucial to the software stack used in both industry and science. The conversation also covers broader implications for software sustainability, security, and future research directions, including improving how software contributions are tracked and recognized within scientific careers. Press download now to hear more!


[00:01:47] Richard dives into the paper co-authored by Eva and Boris. Boris explains the origins of the paper, starting from a workshop at CZI aimed at accelerating science through sustainable software, leading to the analysis of software used in biomedical research. He highlights the focus on identifying crucial yet often unmentioned software dependencies in research software, which he labels as “unsung heroes.”


[00:05:22] Boris provides findings from their study, noting that while many foundational packages were cited, there are significant packages that, despite their critical role, remain uncited.


[00:06:43] Eva discusses the concept of “Nebraska packages,” which are essential yet potentially undermaintained components that are crucial to the software stack used in both industry and science. Also, she elaborates on the methodological challenges of determining which packages to include in their analysis, particularly in terms of dependencies that vary between different users and contexts.


[00:09:42] Richard reflects on the broader implications of their discussion for the open source community, particularly in terms of software sustainability and security. Eva emphasizes the importance of security across all fields and discusses the potential impact of software bugs on scientific research and the need for robust software infrastructure.


[00:12:04] Boris comments on the necessity of well-tested tools in the scientific community, given that many scientists may lack a strong background in software development and training.


[00:13:47] Richard quotes from the paper discussing the absence of cycles in the network of software packages used in science, indicating a more robust design compared to general software. He questions this in light of earlier comments about scientists not being great at coding.


[00:14:08] Eva explains that the paper’s findings about acyclic dependencies (DAGs) might seem surprising given the common perception that scientific software is poorly developed. She notes that while scientists may not be trained in proper software packaging, the Python environment helps prevent cyclic dependencies.


[00:17:31] Richard brings up “Katz centrality” which is discussed in the paper, and Boris clarifies that “Katz centrality” refers to a concept by Leo Katz on network centrality, explaining how it helps determine the importance of nodes within a network.


[00:20:13] Richard questions the practical applications of the research findings, probing for advice on supporting crucial but underrecognized dependencies within software ecosystems. Eva addresses future research directions, including improving ecosystem matching algorithms for better accuracy in linking software mentions to the correct ecosystems.


[00:22:50] Eva suggests expanding the research to cover more domains beyond biomedicine, considering differe

Guests

Eva Maxfield Brown | Boris Veytsman


Panelist

Richard Littauer


Show Notes

In this episode of Sustain, host Richard Littauer engages with guests Eva Maxfield Brown and Boris Veytsman to explore their co-authored paper, "Biomedical Open Source Software: Crucial Packages and Hidden Heroes." The paper focuses on identifying crucial but often overlooked software dependencies in biomedical research. The discussions delve into how the study used data from two million papers to map these dependencies, revealing both well-supported and undermaintained software components vital to scientific research. There’s a conversation on the methodological challenges and the concept of "Nebraska packages," which are essential yet potentially undermaintained elements crucial to the software stack used in both industry and science. The conversation also covers broader implications for software sustainability, security, and future research directions, including improving how software contributions are tracked and recognized within scientific careers. Press download now to hear more!


[00:01:47] Richard dives into the paper co-authored by Eva and Boris. Boris explains the origins of the paper, starting from a workshop at CZI aimed at accelerating science through sustainable software, leading to the analysis of software used in biomedical research. He highlights the focus on identifying crucial yet often unmentioned software dependencies in research software, which he labels as “unsung heroes.”


[00:05:22] Boris provides findings from their study, noting that while many foundational packages were cited, there are significant packages that, despite their critical role, remain uncited.


[00:06:43] Eva discusses the concept of “Nebraska packages,” which are essential yet potentially undermaintained components that are crucial to the software stack used in both industry and science. Also, she elaborates on the methodological challenges of determining which packages to include in their analysis, particularly in terms of dependencies that vary between different users and contexts.


[00:09:42] Richard reflects on the broader implications of their discussion for the open source community, particularly in terms of software sustainability and security. Eva emphasizes the importance of security across all fields and discusses the potential impact of software bugs on scientific research and the need for robust software infrastructure.


[00:12:04] Boris comments on the necessity of well-tested tools in the scientific community, given that many scientists may lack a strong background in software development and training.


[00:13:47] Richard quotes from the paper discussing the absence of cycles in the network of software packages used in science, indicating a more robust design compared to general software. He questions this in light of earlier comments about scientists not being great at coding.


[00:14:08] Eva explains that the paper’s findings about acyclic dependencies (DAGs) might seem surprising given the common perception that scientific software is poorly developed. She notes that while scientists may not be trained in proper software packaging, the Python environment helps prevent cyclic dependencies.


[00:17:31] Richard brings up “Katz centrality” which is discussed in the paper, and Boris clarifies that “Katz centrality” refers to a concept by Leo Katz on network centrality, explaining how it helps determine the importance of nodes within a network.


[00:20:13] Richard questions the practical applications of the research findings, probing for advice on supporting crucial but underrecognized dependencies within software ecosystems. Eva addresses future research directions, including improving ecosystem matching algorithms for better accuracy in linking software mentions to the correct ecosystems.


[00:22:50] Eva suggests expanding the research to cover more domains beyond biomedicine, considering differe

39 мин.

Топ подкастов в категории «Технологии»

Запуск завтра
libo/libo
Продакты продуктов
Никита и Дима
Podlodka Podcast
Егор Толстой, Стас Цыганов, Екатерина Петрова и Евгений Кателла
Тысяча фичей
Саша Пахомов
make sense podcast
make sense podcast
Люди и код
Skillbox Media Code