The Data Science Podcast with Fexingo: Analytics, Machine Learning, and Data-Driven Conversations

Fexingo

Lucas and Luna sit at a data-science workstation, two thin laptops open to scatter plots and clustering visualizations, and ask: what can we actually learn from the numbers? Each episode of The Data Science Podcast with Fexingo is a grounded, specific conversation about a single analytics problem or machine-learning method — from regularization in regression to the bias-variance trade-off in random forests. Lucas leads with a journalistic eye for how models are built and tested in the real world, citing actual case studies like how Netflix used matrix factorization for recommendations or how healthcare researchers apply survival analysis to clinical trials. Luna keeps the discussion honest, asking about data quality, feature engineering pitfalls, and whether a model’s accuracy actually translates to business value. They never resort to buzzwords: instead, they walk through the workflow from data collection to deployment, discussing trade-offs like interpretability versus performance. The show serves data scientists, analysts, and engineers who want to stay sharp on methods without the hype. Listeners walk away with a clearer understanding of why one algorithm beats another on a given dataset, and what that means for their own projects. Can a neural network ever be truly explainable? And if not, should we trust it anyway? #DataScience #MachineLearning #Analytics #DataEngineering #Statistics #Python #RStats #DeepLearning #AI #BigData #DataVisualization #PredictiveModeling #CausalInference #DataQuality #FeatureEngineering #Business #FexingoBusiness #BusinessPodcast #Technology Keep every episode free: buymeacoffee.com/fexingo

  1. 2 days ago

    How Data Scientists Use Neural Radiance Fields for 3D Reconstruction

    Lucas and Luna dive into Neural Radiance Fields (NeRFs), a technique that has reshaped 3D reconstruction from 2D images. They walk through how NeRFs work at a high level—converting sparse photographs into continuous volumetric scene representations—and why this matters for industries like autonomous driving, cultural heritage preservation, and virtual production. The episode anchors on a concrete example: how the Google Research team originally trained a NeRF on 100 images of a single scene to synthesize novel views with photorealistic quality, and how recent advances like Instant NGP have cut training time from hours to seconds. Lucas explains the key algorithmic steps: ray marching through a neural network that outputs color and density per point, then volumetric rendering to produce a pixel value. Luna questions where the bottleneck remains (data capture, not compute) and probes the real-world trade-off between quality and speed. The conversation stays grounded in tools and techniques data scientists actually use—no math beyond a brief mention of positional encoding—and closes by asking what happens when NeRFs meet generative AI for full scene editing. #NeuralRadianceFields #NeRF #3DReconstruction #ComputerVision #DeepLearning #InstantNGP #VolumetricRendering #RayMarching #GoogleResearch #PositionalEncoding #AutonomousDriving #VirtualProduction #CulturalHeritage #DataScience #Technology #FexingoBusiness #BusinessPodcast #MachineLearning Keep every episode free: buymeacoffee.com/fexingo

    11 min
  2. 3 days ago

    How Data Scientists Use Diffusion Models for Image Generation

    In this episode of The Data Science Podcast, Lucas and Luna explore how data scientists are using diffusion models — the technology behind tools like DALL-E and Stable Diffusion — for image generation. They break down the core idea of gradually denoising random pixels into coherent images, discuss training and inference costs, and contrast diffusion models with GANs and autoregressive models. Using a concrete example from a mid-size e-commerce company that used a fine-tuned diffusion model to generate product images in underrepresented categories, they walk through the practical pipeline: dataset preparation, conditioning on text prompts, and handling hallucination artifacts. Lucas explains why diffusion models have become the dominant paradigm in generative image AI since 2022, and Luna questions whether the compute cost will limit adoption for smaller teams. They also touch on ethical considerations around deepfakes and copyright. The episode is grounded in real numbers: training a latent diffusion model from scratch can cost upwards of $600,000 in compute, but fine-tuning an existing open-source model can be done for under $5,000. #DiffusionModels #ImageGeneration #GenerativeAI #DeepLearning #StableDiffusion #DALLE #ComputerVision #MachineLearning #Technology #DataScience #AIEthics #ComputeCost #FineTuning #TextToImage #DenoisingDiffusion #LatentDiffusion #FexingoBusiness #BusinessPodcast Keep every episode free: buymeacoffee.com/fexingo

    9 min

About

Lucas and Luna sit at a data-science workstation, two thin laptops open to scatter plots and clustering visualizations, and ask: what can we actually learn from the numbers? Each episode of The Data Science Podcast with Fexingo is a grounded, specific conversation about a single analytics problem or machine-learning method — from regularization in regression to the bias-variance trade-off in random forests. Lucas leads with a journalistic eye for how models are built and tested in the real world, citing actual case studies like how Netflix used matrix factorization for recommendations or how healthcare researchers apply survival analysis to clinical trials. Luna keeps the discussion honest, asking about data quality, feature engineering pitfalls, and whether a model’s accuracy actually translates to business value. They never resort to buzzwords: instead, they walk through the workflow from data collection to deployment, discussing trade-offs like interpretability versus performance. The show serves data scientists, analysts, and engineers who want to stay sharp on methods without the hype. Listeners walk away with a clearer understanding of why one algorithm beats another on a given dataset, and what that means for their own projects. Can a neural network ever be truly explainable? And if not, should we trust it anyway? #DataScience #MachineLearning #Analytics #DataEngineering #Statistics #Python #RStats #DeepLearning #AI #BigData #DataVisualization #PredictiveModeling #CausalInference #DataQuality #FeatureEngineering #Business #FexingoBusiness #BusinessPodcast #Technology Keep every episode free: buymeacoffee.com/fexingo