7 episodes

A podcast about data science, machine learning, artificial intelligence, statistics and everything related to data.

Home Page: https://www.yourdatateacher.com

Your Data Teacher Podcast Your Data Teacher

    • Technology

A podcast about data science, machine learning, artificial intelligence, statistics and everything related to data.

Home Page: https://www.yourdatateacher.com

    Episode 7 - A Python library to remove collinearity

    Episode 7 - A Python library to remove collinearity

    Collinearity is a huge problem for machine learning problems. It increases the dimensions of our dataset without increasing the amount of information. That's why I've created a Python library that can be used to remove collinearity from a dataset. I talk about this library in this podcast. 

    Article: https://www.yourdatateacher.com/2021/06/28/a-python-library-to-remove-collinearity/ 

    Pypi package: https://pypi.org/project/collinearity/ 

    GitHub repo: https://github.com/gianlucamalato/collinearity

    • 8 min
    Episode 6 - Checking the distribution of your data using Q-Q plot

    Episode 6 - Checking the distribution of your data using Q-Q plot

    In this episode, I'm talking about Q-Q plot and how to use it for checking if our dataset follows a particular distribution. Instead of using complex hypothesis tests like Kolmogorov-Smirnov test, using this simple plot, we'll be able to check if our dataset follows a particular distribution or if two datasets have been created according to the same distribution.

    Link to the article: https://www.yourdatateacher.com/2021/06/16/how-to-use-q-q-plot-for-checking-the-distribution-of-our-data/

    • 7 min
    Episode 5 - Tuning the threshold in binary classification tasks

    Episode 5 - Tuning the threshold in binary classification tasks

    In this episode, I'll talk about tuning the threshold in binary classification tasks. The usual value for the threshold is 0.5, but it's useful to optimize it in order to make the model fit our needs. I talk about optimizing according to the ROC curve and maximizing the balanced accuracy.  

    Link to the article: https://www.yourdatateacher.com/2021/06/14/are-you-still-using-0-5-as-a-threshold/

    • 7 min
    Episode 4 - Ensemble models. Bagging and boosting

    Episode 4 - Ensemble models. Bagging and boosting

    In this episode, I'm going to talk about ensemble models, particularly bagging and boosting. Bagging is very useful for reducing variance, boosting is used for reducing bias. The most common bagging algorithm is Random Forest, the most common boosting algorithm is Gradient Boosting, whose most common implementations are XGBoost, LightGBM and CatBoost.

    Home Page: https://www.yourdatateacher.com

    • 11 min
    Episode 3 - Precision, recall, accuracy. How to choose?

    Episode 3 - Precision, recall, accuracy. How to choose?

    In this episode, I talk about accuracy, precision and recall. We're going to focus on what they are and when to use them in machine learning projects.



    Link to the article: https://www.yourdatateacher.com/2021/06/07/precision-recall-accuracy-how-to-choose/

    • 11 min
    Episode 2 - How to explain neural networks using SHAP

    Episode 2 - How to explain neural networks using SHAP

    Today we're going to talk about how we can explain neural networks. Neural networks are like black boxes that hide the way they model and represent data. That's why explaining them is very difficult. A very powerful approach is called SHAP. Using this method, we can calculate the impact of a feature according to a given model independently of the type of model we're using. It's very useful for black boxes like neural networks.

    Home page: https://www.yourdatateacher.com

    Link to the article: https://www.yourdatateacher.com/2021/05/17/how-to-explain-neural-networks-using-shap/

    • 6 min

Top Podcasts In Technology

Steady Lads
Steady Lads
Acquired
Ben Gilbert and David Rosenthal
All-In with Chamath, Jason, Sacks & Friedberg
All-In Podcast, LLC
Lex Fridman Podcast
Lex Fridman
Salesforce Developer Podcast
René Winkelmeyer
Apple Events (video)
Apple