Data Science Conversations

Damien Deighan and Philipp Diesinger

Welcome to the Data Science Conversations Podcast hosted by Damien Deighan and Dr Philipp Diesinger. We bring you interesting conversations with the world’s leading Academics working on cutting edge topics with potential for real world impact. We explore how their latest research in Data Science and AI could scale into broader industry applications, so you can expand your knowledge and grow your career. Every 4 or 5 episodes we will feature an industry trailblazer from a strong academic background who has applied research effectively in the real world. Podcast Website: www.datascienceconversations.com

  1. -4 J

    How AI is Transforming Data Analytics and Visualisation in the Enterprise

    Chris Parmer (Chief Product Officer & Co-Founder, Plotly) and Domenic Ravita (VP of Marketing, Plotly) discuss the evolution of AI-powered data analytics and how natural language interfaces are democratizing advanced analytics. Key Topics Discussed AI's Market Category Convergence Domenic describes how AI is collapsing traditional boundaries between business intelligence tools (Power BI, Tableau), data science platforms, and AI coding tools, creating a quantum leap similar to the drag-and-drop revolution 20 years ago.The 30/70 Engineering Reality Chris reveals that LLMs represent only 30% of AI analytics products, with 70% being sophisticated tooling, error correction loops, and multi-agent systems. Raw LLM output succeeds only one-third of the time without extensive supporting infrastructure.Code-First AI Architecture Plotly's approach generates Python code rather than having AI directly process data, creating more rigorous analytics. The system generates 2,000-5,000 lines of code in under two minutes through parallel processing while maintaining 90%+ accuracy.Natural Language as Universal Equalizer Discussion of how natural language interfaces eliminate the learning curves of different analytics tools (Salesforce, Tableau, Google Analytics), potentially democratizing data visualization across organizations by providing a common interface.Vibe Analysis Concept Introduction of "vibe analysis" - the data equivalent of "vibe coding" - enabling fluid, rapid data exploration that keeps analysts in flow states through natural language interactions with AI-powered tools.Transparency and Trust Building Exploration of building user trust through auto-generated specifications in natural language, transparent logging interfaces, and making underlying code assumptions visible and adjustable to prevent misleading results.Human-AI Collaboration Balance Chris emphasizes that while AI accelerates visualization creation and data exploration, human interpretation remains essential for generating insights. The risk lies in systems that attempt to "skip to the finish" with fully automated decision-making.Infrastructure Misconceptions Domenic predicts people will wrongly assume AI analytics requires extensive data warehouses and semantic layers, when effective analysis can work with standard databases and file formats, making advanced analytics more accessible than many realize.

    1 h 11 min
  2. 25/11/2024

    Maximising the Impact of Your Data & AI Consulting Projects

    In our latest episode of the Data Science Conversations Podcast, we spoke with Christoph Sporleder, Managing Partner at Rewire, about the evolving role of consulting in the data and AI space. This conversation is a must listen for anyone dealing with the challenges of integrating AI into business processes or considering an AI project with an external consulting firm. Christoph draws from decades of experience, offering practical advice and actionable insights for organizations and practitioners alike. Key Topics Discussed 1. Evolution of Data and Cloud Computing The shift from local computing to cloud technologies, enabling broader data integration and advanced analytics, with the rise of IoT and machine data. 2. Data Management Challenges Discussion on the evolution from data warehouses to data lakes and the emerging concept of data mesh for better governance and scalability. 3. Importance of Strategy in AI Why a clear strategy is crucial for AI adoption, including aligning organizational leadership and identifying impactful use cases. 4. Sectoral Adoption of Data and AI Differences in adoption across sectors, with early adopters in finance and insurance versus later adoption in manufacturing and infrastructure. 5. Consulting Models and Engagement Insights into consulting engagement types, including strategy consulting, system integration, and body leasing, and their respective challenges and benefits. 6. Challenges in AI Implementation Common pitfalls in AI projects, such as misalignment with business goals, inadequate infrastructure planning, and siloed lighthouse initiatives. 7. Leadership’s Role in AI Success The critical need for senior leadership commitment to drive AI adoption, ensure process integration, and manage organizational change. 8. Effective Collaboration with Consultants Best practices for successful partnerships with consultants, including aligning on objectives, managing personnel transitions, and setting clear engagement expectations. 9. Future Trends in Data and AI Emerging trends like componentized AI architectures, Gen AI integration, and the growing focus on embedding AI within business processes. 10. Tips for Managing Long-Term Projects Strategies for handling staff rotations and maintaining project continuity in consulting engagements, emphasizing planning and communication.

    47 min
  3. 29/08/2024

    The Evolution of GenAI: From GANs to Multi-Agent Systems

    Early Interest in Generative AI Martin's initial exposure to Generative AI in 2016 through a conference talk in Milano, Italy, and his early work with Generative Adversarial Networks (GANs). Development of GANs and Early Language Models since 2016 The evolution of Generative AI from visual content generation to text generation with models like Google's Bard and the increasing popularity of GANs in 2018. Launch of GenerativeAI.net and Online Course Martin's creation of GenerativeAI.net and an online course, which gained traction after being promoted on platforms like Reddit and Hacker News. Defining Generative AI Martin’s explanation of Generative AI as a technology focused on generating content, contrasting it with Discriminative AI, which focuses on classification and selection. Evolution of GenAI Technologies The shift from LSTM models to Transformer models, highlighting key developments like the "Attention Is All You Need" paper and the impact of Transformer architecture on language models. Impact of Computing Power on GenAI The role of increasing computing power and larger datasets in improving the capabilities of Generative AI Generative AI in Business Applications Martin’s insights into the real-world applications of GenAI, including customer service automation, marketing, and software development. Retrieval Augmented Generation (RAG) Architecture The use of RAG architecture in enterprise AI applications, where documents are chunked and queried to provide accurate and relevant responses using large language models. Technological Drivers of GenAI The advancements in chip design, including Nvidia’s focus on GPU improvements and the emergence of new processing unit architectures like the LPU. Small vs. Large Language Models A comparison between small and large language models, discussing their relative efficiency, cost, and performance, especially in specific use cases. Challenges in Implementing GenAI Systems Common challenges faced in deploying GenAI systems, including the costs associated with training and fine-tuning large language models and the importance of clean data. Measuring GenAI Performance Martin’s explanation of the complexities in measuring the performance of GenAI systems, including the use of the Hallucination Leaderboard for evaluating language models. Emerging Trends in GenAI Discussion of future trends such as the rise of multi-agent frameworks, the potential for AI-driven humanoid robots, and the path towards Artificial General Intelligence (AGI).

    43 min
  4. 24/07/2024

    Future AI Trends: Strategy, Hardware and AI Security at Intel

    In this episode, we sit down with Steve Orrin, Federal Chief Technology Officer at Intel Corporation. Steve shares his extensive experience and insights on the transformative power of AI and its parallels with past technological revolutions. He discusses Intel’s pioneering role in enabling these shifts through innovations in microprocessors, wireless connectivity, and more. Steve highlights the pervasive role of AI in various industries and everyday technology, emphasizing the importance of a heterogeneous computing architecture to support diverse AI environments. He talks about the challenges of operationalizing AI, ensuring real-world reliability, and the critical need for robust AI security. Confidential computing emerges as a key solution for protecting AI workloads across different platforms. The episode also explores Intel’s strategic tools like oneAPI and OpenVINO, which streamline AI development and deployment. This episode is a must-listen for anyone interested in the evolving landscape of AI and its real-world applications. Intel's Legacy and Technological Revolutions Historical parallels between past tech revolutions (PC era, internet era) and current AI era.Intel's contributions to major technological shifts, including the development of wireless technology, USB, and cloud computing. AI's Current and Future Landscape AI's pervasive role in everyday technology and various industries.Importance of computing hardware in facilitating AI advancements.AI's integration across different environments: cloud, network, edge, and personal devices. Intel's Approach to AI Focus on heterogeneous computing architectures for diverse AI needs.Development of software tools like oneAPI and OpenVINO to enable cross-platform AI development. Challenges and Solutions in AI Deployment Scaling AI from lab experiments to real-world applications.Ensuring AI security and trustworthiness through transparency and lifecycle management.Addressing biases in AI datasets and continuous monitoring for maintaining AI integrity. AI Security Concerns Protection of AI models and data through hardware security measures like confidential computing.Importance of data privacy and regulatory compliance in AI deployments.Emerging threats such as AI model poisoning, prompt injection attacks, and adversarial attacks. Innovations in AI Hardware and Software Confidential computing as a critical technology for securing AI.Research into using AI for chip layout optimization and process improvements in various industries.Future trends in AI applications, including generative AI for fault detection and process optimization. Collaboration and Standards in AI Security Intel's involvement in developing industry standards and collaborating with competitors and other stakeholders.The role of industry forums and standards bodies like NIST in advancing AI security. Advice for Aspiring AI Security Professionals Importance of hands-on experience with AI technologies.Networking and collaboration with peers and industry experts.Staying informed through industry news, conferences, and educational resources. Exciting Developments in AI Fusion of multiple AI applications for complex problem-solving.Advancements in AI hardware, such as AI PCs and edge devices.Potential transformative impacts of AI on everyday life and business operations.

    1 h 3 min
  5. 06/06/2024

    Enhancing GenAI with Knowledge Graphs: A Deep Dive with Kirk Marple

    In this episode we talk to Kirk Marple about the power of Knowledge Graphs when combined with GenAI models.  Kirk explained the growing relevance of knowledge graphs in the AI era, the practical applications, their integration with LLMs, and the future potential of Graph RAG. Kirk Marple a veteran of Microsoft and General Motors, Kirk has spent the last 30 years in software development and data leadership roles. He also successfully exited the first startup he founded, RadiantGrid, acquired by Wohler Technologies. Now, as the technical founder and CEO of Graphlit, Kirk and his team are streamlining the development of vertical AI apps with their end-to-end, cloud based offering that ingests unstructured data and leverages retrieval augmented generation to improve accuracy, domain specificity, adaptability, and context understanding – all while expediting development. Episode Summary - Introduction to Knowledge Graphs:Knowledge graphs extract relationships between entities like people, places, and things, facilitating efficient information retrieval.They represent intricate interactions and interrelationships, enabling users to "walk the graph" and uncover deeper insights.Importance in the AI Era:Knowledge graphs enhance data retrieval and filtering, crucial for feeding accurate data into large language models (LLMs) and multimodal models.They provide an additional axis for information retrieval, complementing vector search.Industry Use Cases:Commonly used in customer data platforms and CRM models to map relationships within and between companies.Knowledge graphs can convert complex datasets into structured, easily queryable formats.Challenges and Limitations:Familiarity with graph databases and the ETL process for graph data integration is still developing.Graph structures are less common and more complex than traditional relational models.Integrating Knowledge Graphs with LLMs:Knowledge graphs enrich data integration and semantic understanding, adding context to text retrieved by LLMs.They can help reduce hallucinations in LLMs by grounding responses with more accurate and comprehensive context.Graph RAG (Retrieval Augmented Generation):Combines knowledge graphs with RAG to provide additional context for LLM-generated responses.Allows retrieval of data not directly cited in the text, enhancing the breadth of information available for queries.Scalability and Efficiency:Effective graph database architectures can handle large-scale graph data efficiently.Graph RAG requires a robust ingestion pipeline and careful management of data freshness and retrieval processes.Future Developments:Growing interest and implementation of knowledge graphs and Graph RAG in various industries.Potential for new tools and standardization efforts to make these technologies more accessible and effective.Graphlit: Simplifying Knowledge Graphs:The platform focuses on simplifying the creation and use of knowledge graphs for developers.Provides APIs for easy integration, supporting domain-specific vertical AI applications.Offers a unified pipeline for data ingestion, extraction, and knowledge graph construction.Open Source and Community Contributions:Recommendations for...

    45 min

À propos

Welcome to the Data Science Conversations Podcast hosted by Damien Deighan and Dr Philipp Diesinger. We bring you interesting conversations with the world’s leading Academics working on cutting edge topics with potential for real world impact. We explore how their latest research in Data Science and AI could scale into broader industry applications, so you can expand your knowledge and grow your career. Every 4 or 5 episodes we will feature an industry trailblazer from a strong academic background who has applied research effectively in the real world. Podcast Website: www.datascienceconversations.com