The Data Skeptic Podcast features interviews and discussion of topics related to data science, statistics, machine learning, artificial intelligence and the like, all from the perspective of applying critical thinking and the scientific method to evaluate the veracity of claims and efficacy of approaches.
Your Mouse Reveals Your Gender and Age
When we navigate a webpage, it is fairly easy for our mouse movement to be tracked and collected. Today, Luis Leiva, a Professor of Computer Science discusses how these mouse tracking data can be used to predict age, gender and user attention. He also discusses the privacy concerns with mouse tracking data and possible ways it can be curtailed.
Measuring Web Search Behavior
On the show, Aleksandra Urman and Mykola Makhortykh join us to discuss their work on the comparative analysis of web search behavior using web tracking data. They shared interesting results from their analysis, bordering around the user preferences for search engines, demographic patterns, and differences between how men and women surf the net.
StrategyQA and Big Bench
Did Aristotle Use a Laptop? That's a question from the StrategyQA benchmark which highlights the stretch goals for current artificial intelligence systems. Answering a question like that requires several cognitive steps and reasoning. Constructing a dataset of similarly challenging questions is a major undertaking. On today's episode, Mor Geva returns to share details about the creation of StrategyQA and the larger Big Bench dataset it has been included in.
Ad Blockers Effect on News Consumption
While at first glance, the use of ad blockers drops the revenue of news publishers, this may not be completely true. On the show today, Shunyao Yan, an Assistant Professor in Marketing at Leavey School of Business, Santa Clara University, discussed the effect of ad blockers on news consumption and how ad blockers can potentially be helpful for news publishers.
Your Consent is Worth 75 Euros a Year
People who do not want their data tracked and shared online can pay a token for a cookie paywall. But are the websites keeping to their side of the bargain? Victor Morel, a Postdoc candidate at the Chalmers University of Technology joins us to discuss his work around auditing the activities of cookie paywalls. He discussed the findings from his analysis and proffers some solutions to making cookie paywalls more transparent.
Automated Email Generation for Targeted Attacks
The advancement of generative language models has been a force for good, but also for evil. On the show, Avisha Das, a post-doctoral scholar at the University of Texas Health Center, joins us to discuss how attackers use machine learning to create unsuspecting phishing emails. She also discussed how she used RNN for automated email generation, with the goal of defeating statistical detectors.
I have been following for a few months now. If you’re looking for a wide perspective on big data and AI, this is the place. Kyle has a wide knowledge base, he’s not focused on a simple thing, rather, he explores multiple models, to help with research. His guests are amazing. I can’t wait to be invited to appear on the show to discuss standard operations for an average guy doing signs and lighting.
Always learning about data w/ Kyle
Similar to one of my other favorite analysts - Avinash Kaushik - Kyle constantly pushes & challenges assumptions to arrive at opportunities or actions clearly supported by data. I always learn something tuning into the show.
Best Data Science Podcast!
This is my favorite data science podcast! Kyle interviews such interesting guests and covers the coolest papers. This podcast has covered multiple methods that I have applied to problems as a practicing data scientist. I’m always excited when a new episode comes out!