The weekly podcast about Python and its use in science, research, and data. Tune in for engaging, educational, and technical discussions about machine learning, artificial intelligence, data science, and scientific research that is powered by Python.
Experimenting With Reinforcement Learning Using MushroomRL
Reinforcement learning is a branch of machine learning and AI that has a lot of promise for applications that need to evolve with changes to their inputs. To support the research happening in the field, including applications for robotics, Carlo D'Eramo and Davide Tateo created MushroomRL. In this episode they share how they have designed the project to be easy to work with, so that students can use it in their study, as well as extensible so that it can be used by businesses and industry professionals. They also discuss the strengths of reinforcement learning, how to design problems that can leverage its capabilities, and how to get started with MushroomRL for your own work.
Doing Dask Powered Data Science In The Saturn Cloud
A perennial problem of doing data science is that it works great on your laptop, until it doesn't. Another problem is being able to recreate your environment to collaborate on a problem with colleagues. Saturn Cloud aims to help with both of those problems by providing an easy to use platform for creating reproducible environments that you can use to build data science workflows and scale them easily with a managed Dask service. In this episode Julia Signall, head of open source at Saturn Cloud, explains how she is working with the product team and PyData community to reduce the points of friction that data scientists encounter as they are getting their work done.
Monitor The Health Of Your Machine Learning Products In Production With Evidently
You've got a machine learning model trained and running in production, but that's only half of the battle. Are you certain that it is still serving the predictions that you tested? Are the inputs within the range of tolerance that you designed? Monitoring machine learning products is an essential step of the story so that you know when it needs to be retrained against new data, or parameters need to be adjusted. In this episode Emeli Dral shares the work that she and her team at Evidently are doing to build an open source system for tracking and alerting on the health of your ML products in production. She discusses the ways that model drift can occur, the types of metrics that you need to track, and what to do when the health of your system is suffering. This is an important and complex aspect of the machine learning lifecycle, so give it a listen and then try out Evidently for your own projects.
Making Automated Machine Learning More Accessible With EvalML
Building a machine learning model is a process that requires a lot of iteration and trial and error. For certain classes of problem a large portion of the searching and tuning can be automated. This allows data scientists to focus their time on more complex or valuable projects, as well as opening the door for non-specialists to experiment with machine learning. Frustrated with some of the awkward or difficult to use tools for AutoML, Angela Lin and Jeremy Shih helped to create the EvalML framework. In this episode they share the use cases for automated machine learning, how they have designed the EvalML project to be approachable, and how you can use it for building and training your own models.
Growing And Supporting The Data Science Community At Anaconda
Data scientists are tasked with answering challenging questions using data that is often messy and incomplete. Anaconda is on a mission to make the lives of data professionals more manageable through creation and maintenance of high quality libraries and frameworks, the distribution of an easy to use Python distribution and package ecosystem, and high quality training material. In this episode Kevin Goldsmith, CTO of Anaconda, discusses the technical and social challenges faced by data scientists, the ways that the Python ecosystem has evolved to help address those difficulties, and how Anaconda is engaging with the community to provide high quality tools and education for this constantly changing practice.
Network Analysis At The Speed Of C With The Power Of Python Using NetworKit
Analysing networks is a growing area of research in academia and industry. In order to be able to answer questions about large or complex relationships it is necessary to have fast and efficient algorithms that can process the data quickly. In this episode Eugenio Angriman discusses his contributions to the NetworKit library to provide an accessible interface for these algorithms. He shares how he is using NetworKit for his own research, the challenges of working with large and complex networks, and the kinds of questions that can be answered with data that fits on your laptop.
Goes Deep in a Good Way
I enjoy some episodes of other Python podcasts, but I enjoy every episode of this podcast. Tobias’ expertise and experience allow for a level of depth that makes his podcast stand out.
Love it, great well informed questions.
At first it was hard to get into the podcasts, now it’s one I look forward too. He asks really good questions. It’s easy to tell Tobus does his homework. The only suggestion I’d give is to pause for an answer between each question. Every question seems to be two distinct questions for the guests. Maybe it is so the guests can speak to what they are most comfortable, but each of the questions are great and would be good on their own. Overall, I would highly recommend to anyone with a technical bent and especially a pythonic one.
Starts hard but hang in there
I agree that the delivery is a little flat but this guy asks really thoughtful questions. I’ve been listening for over a year and like I said, keep listening. Watch your guests levels though. If the levels are too off, I move on.