Automatic

Eric Lamanna

Podcast for Automatic.co and LLM.co, the AI automation specialists.

  1. 2h ago

    Inside the Firewall: How Local LLMs Are Outsmarting Fraudsters

    Fraud has evolved from clumsy phishing emails into sophisticated, syndicate-driven operations: synthetic identities that build real credit histories over months, deepfaked executive voices authorizing wire transfers, and bot networks sharing exploits like open-source code. The enterprises winning this fight have stopped relying on brittle rule engines and started running large language models entirely within their own walls. This episode unpacks the strategy, the architecture, and the governance challenges involved — drawing on this deep-dive on enterprise local LLM fraud detection. Here's what the episode covers: Why rule engines are losing: Thousands of hand-crafted conditions create a system where one uncovered gap lets attackers through — while generating enough false positives to bury analyst teams and frustrate legitimate customers at the same time.The case for "local": Keeping a model entirely inside a private data center or trusted cloud means no data leaves the firewall, every parameter is auditable, and compliance-heavy industries can actually move a pilot into production.Fine-tuning as a competitive moat: Training on years of proprietary transaction logs — branch IDs, loyalty codes, campaign tags — transforms a general-purpose model into a domain expert that recognizes the precise texture of legitimate commerce and flags subtle deviations at inference speed.The infrastructure reality: Low-latency checkout flows demand quantized weights, token pruning, and distilled networks; global deployments require regional shards and smart routing to balance speed, data sovereignty, and cost simultaneously.Human-AI collaboration, done right: Models that explain alerts in plain narrative language — not just a risk score — build analyst trust, create actionable feedback loops, and enable overnight retraining that keeps pace with shifting fraud patterns (concept drift).Governance that holds up to auditors: Every model checkpoint carries a commit hash, every inference is written to an immutable ledger, fairness testing runs across demographics, and post-incident reviews treat every miss as structured training data rather than something to quietly patch.The episode closes with an honest look at common failure modes — overfitting to historical attack patterns, data science teams optimizing in isolation from fraud operations, and the temptation to treat the model as an infallible oracle — and a phased rollout roadmap that prioritizes shadow scoring and kill-switch safety before any organization-wide expansion. For more on why domain context is the make-or-break factor in enterprise AI, check out the earlier episode Why Generative AI Fails Without Domain Context — And How to Fix It. LLM

    9 min
  2. 20h ago

    Agentic AI in Finance: The Shift From Tools to Autonomous Execution

    The conversation around AI in finance has shifted — from productivity gains and copilot tools to something more structurally disruptive: autonomous agents that plan, execute, and close the loop on complex workflows without continuous human direction. This episode of Automatic unpacks the research behind that shift, drawing on this in-depth analysis of agentic AI in finance and business services to explain why the timing, the technology, and the economic pressure have finally converged. The episode covers the key forces reshaping financial work — from market sizing to deployment strategy — including: The scale of the opportunity: AI spend in financial services sits at roughly $35 billion today and is projected to reach $97 billion by 2027 across banking, insurance, capital markets, and payments — with the broader market exceeding $190 billion by 2030.Why now: Three things aligned simultaneously — large language models crossed into multi-step reasoning, enterprise systems became genuinely interconnected via APIs and cloud infrastructure, and finance teams faced mounting pressure to do more with less.The three-phase evolution: From SaaS workflows (humans as operators), to AI-assisted copilots (humans as directors), to agentic systems (humans as overseers) — and why that final transition carries the most disruptive potential.Where agents are landing first: High-frequency, rules-heavy workflows like financial close and reconciliation, underwriting support, KYC and onboarding, claims processing, regulatory reporting, and FP&A — operational core functions, not experimental edge cases.The BPO and outsourcing reckoning: Business process outsourcing firms that traditionally scaled by adding headcount are now competing against AI-native workflows that promise lower cost per transaction and higher consistency — reshaping how contracts are written and services are priced.The real friction points: Model performance isn't the bottleneck — integration depth and trust infrastructure are. Audit logs, explainability, and human approval gates aren't optional features in regulated environments; they're what makes deployment possible at all.The strategic takeaway is practical: start narrow, automate one high-frequency workflow end to end, prioritize integrations before model optimization, and build for oversight rather than full replacement. The World Economic Forum estimates 32–39% of financial services work has high full-automation potential, with another 34–37% highly suited for augmentation — meaning the majority of the sector's work is already within AI's reach on current planning horizons. More from the show: if you want to understand why these systems often stumble in real-world deployments, the episode Why Generative AI Fails Without Domain Context — And How to Fix It is essential context. Automatic

    9 min
  3. 3d ago

    Why Generative AI Fails Without Domain Context — And How to Fix It

    Generative AI can sound authoritative on almost any topic — until it quietly invents a regulatory policy, misapplies a technical term, or misses a safety-critical distinction that any seasoned domain expert would catch on instinct. This episode of Automatic examines why that failure mode is so persistent, why it's so easy to overlook until something breaks, and what teams deploying AI in high-stakes environments can do about it. The conversation draws on this deep-dive article on grounding generative AI in domain knowledge, which maps the problem with unusual precision. The episode covers the core mechanics behind domain-context failures and walks through a four-part framework for closing the gap between what a general-purpose model knows and what a specialized environment actually demands: Surface learning vs. real expertise: Large language models master statistical correlations, not causal reasoning — a distinction that becomes dangerous when terminology is precise and consequences are real.The vocabulary problem: Without domain grounding, models treat specialized terms as interchangeable, choosing meanings by probability rather than by what the field actually requires.Why context windows aren't enough: Stuffing reference documents into a prompt helps, but the model assigns roughly equal authority to a peer-reviewed standard and a casual forum post — blending them in ways domain experts immediately spot as wrong.Curation over accumulation: A lean, carefully selected corpus of authoritative sources outperforms a massive general dataset in output quality, retrieval speed, and user trust.Capturing unspoken assumptions: The most dangerous knowledge gaps live in things every specialist knows but nobody ever wrote down — and structured knowledge-capture exercises are how those implicit rules get encoded into the system.The context repair flywheel: Keeping domain experts in a continuous feedback loop — not just at launch — turns the model into a fast-learning collaborator and drives hallucination rates down over time in measurable, operational terms.The broader argument is that generative AI isn't failing in specialized domains because the technology is broken — it's failing because general-purpose tools are being dropped into expert environments without the infrastructure to bridge the gap. That infrastructure isn't exotic or prohibitively expensive; it requires curation, deliberate knowledge capture, adaptive guardrails, and genuine expert engagement. More from the show: if this episode resonates, Agentic AI in Law: How Smart Automation Is Reshaping Legal Work explores how similar challenges play out in one of the most demanding domain-specific environments around. LLM

    8 min
  4. 4d ago

    Agentic AI in Law: How Smart Automation Is Reshaping Legal Work

    Legal work has never been short on complexity, volume, or consequence — but the emergence of agentic AI is forcing the profession to rethink how that work gets done. Unlike the passive, prompt-and-respond tools that left many lawyers unimpressed, agentic AI takes initiative: it identifies tasks, makes decisions, and executes — without hand-holding at every step. This episode of Automatic explores what agentic AI means for modern law firms, where it's delivering the most measurable impact, and what separates the firms embracing it from those still running on legacy workflows. Here's what the episode covers: Agentic vs. assistive AI: Why the distinction matters in a high-stakes legal environment — and why most tools lawyers have tried so far don't qualify as truly agentic.Document review and contract analysis: How AI systems ingest thousands of pages, surface compliance gaps, flag clause inconsistencies, and deliver results faster and more accurately than a team of associates — without fatigue.Operational automation: The quiet time drain of billing, time-tracking, scheduling, and routine correspondence — and how agentic tools are handling all of it without manual input.Predictive litigation analytics: Moving beyond gut instinct, AI can now analyze millions of case outcomes, judge behavior patterns, and opposing counsel performance to deliver data-backed probability assessments for litigation strategy.NLP and legal research: How Natural Language Processing understands legal intent — not just keywords — cutting multi-day research projects down to minutes with more comprehensive, less biased results.Consistency and risk reduction: Enforcing standardization across every document, filing, and client communication — protecting firms from the subtle errors that individual variation and human fatigue introduce.The episode is candid about what agentic AI won't do: replace lawyers. What it will do is absorb the work that was never really a good use of a lawyer's time in the first place, freeing legal professionals to focus on judgment, client relationships, and the nuanced advocacy that no system can replicate. More from the show: if you're interested in how AI handles complex documents more broadly, check out the episode From Documents to Decisions: How BYOD-AI Unlocks Your PDF Intelligence. Automatic

    8 min
  5. 5d ago

    From Documents to Decisions: How BYOD-AI Unlocks Your PDF Intelligence

    Most organizations have already done the hard work of creating their knowledge base — handbooks, contracts, compliance manuals, clinical records, safety protocols. The problem isn't that the information doesn't exist; it's that it's buried in PDFs nobody has time to search. This episode of Automatic explores how the Bring Your Own Data AI approach to PDF intelligence is closing that gap between what an organization knows and what its people can actually access in the moment they need it. The episode walks through the full picture of BYOD-AI — what it is, how it works under the hood, why PDFs have historically been so difficult for AI systems to handle, and what it means for security and governance when employees are already reaching for consumer AI tools to fill the void. Key points covered include: BYOD-AI defined: "Bring Your Own Data AI" means grounding a private or hybrid AI system in your organization's own documents — not relying on a generic model trained on the public internet.The technical pipeline: How document ingestion, OCR preprocessing, semantic chunking, and vector embeddings combine to enable concept-based search rather than simple keyword matching.Retrieval-Augmented Generation (RAG): The architecture that keeps AI answers grounded in actual source material, dramatically reducing the risk of the system fabricating responses.Shadow AI and security governance: Why banning AI isn't the answer, and how enterprise BYOD-AI — with role-based access controls, encryption, and audit trails — gives employees a safe on-ramp instead of leaving them to improvise with unmanaged tools.Industry use cases: From legal and compliance teams querying stored contracts, to healthcare professionals surfacing clinical guidelines, to operations teams accessing facility-specific safety protocols — the applications span virtually every sector.The cultural upside: When people can find answers quickly and confidently, they take fewer risky shortcuts — meaning a well-implemented system changes not just document access, but organizational behavior around information.The episode anchors many of these ideas in a concrete scenario — a multi-location restaurant group managing a high-pressure game day — to illustrate how the difference between "the answer is somewhere in a binder" and "the answer is here in two seconds" can be the difference between smooth operations and a genuine crisis. The throughline is straightforward: the data most organizations need already exists. BYOD-AI is the infrastructure that makes it usable. For more from the show, check out the episode Agentic AI in Healthcare: From Assistant to Operator. LLM

    8 min
  6. Jun 6

    Agentic AI in Healthcare: From Assistant to Operator

    Healthcare's administrative burden isn't just a frustration — it consumes somewhere between a quarter and a third of every dollar spent in the U.S. system. This episode of Automatic examines how agentic AI is stepping into that gap, drawing on Automatic's deep-dive on agentic AI for healthcare and life sciences to map where automation is gaining real traction, what's driving the shift right now, and what it means for the organizations trying to get ahead of it. The episode traces a clear evolution — from digitized forms, to AI copilots that assist humans, to autonomous agents that own workflows end to end — and makes the case that we're now entering that third stage. Here's what the discussion covers: The scale of the opportunity: The global healthcare AI market is projected to reach $187 billion by 2030, with McKinsey estimating $200–360 billion in annual value unlockable through automation — context that reframes this as a structural economic shift, not a tech trend.Why now: Three forces converged simultaneously — large language models crossing a clinical reasoning threshold, a decade of healthcare digitization (including FHIR interoperability standards) finally paying off, and a worsening labor shortage projected to hit 124,000 physicians by 2034.Clinical workflow automation: Tools that move beyond note drafting to managing the entire downstream process — coding suggestions, task routing, and approval-ready outputs — representing 62% of the generative AI in healthcare market by clinical application share.Administrative and operational ROI: Prior authorization, revenue cycle management, and denial handling are where buyers are putting money today — administrative process optimization holds the largest single function segment at nearly 33% — because the pain is measurable and the payback window is 12–24 months.Life sciences as a proving ground: Pharmaceutical and biotech workflows — protocol drafting, patient recruitment, regulatory documentation — are documentation-heavy and highly structured, making them among the fastest-growing areas for agentic deployment.What separates winners from also-rans: Integration depth beats model sophistication; trust, auditability, and compliance aren't obstacles to adoption — they're the price of entry in a regulated industry.The episode closes with a practical frame for healthcare leaders: transformation is already happening workflow by workflow, and the organizations pulling ahead aren't waiting for a perfect system — they're proving ROI on one broken process at a time. More from the show: if this episode's themes around AI taking on specialist knowledge work resonate, check out AI for HR: Private Talent Screening, Policy Parsing & Workforce Planning for a look at how agentic systems are reshaping another high-stakes, documentation-heavy domain. Automatic

    9 min
  7. Jun 5

    AI for HR: Private Talent Screening, Policy Parsing & Workforce Planning

    Human resources sits at the intersection of speed, fairness, and confidentiality — a combination that traditional software has never handled gracefully. This episode of Automatic draws on this deep-dive article on AI for HR talent screening, policy parsing, and workforce planning to explore how private, on-premise language model deployments are giving HR teams the leverage to work smarter across three mission-critical functions — without trading employee trust for efficiency. The episode walks through each domain in detail, examining both the immediate productivity gains and the longer-term strategic implications. Key topics covered include: Semantic resume screening: How language models move beyond keyword matching to evaluate contextual competencies, surface adaptable candidates, and apply consistent evaluation logic at any hour — complete with auditable decision trails that satisfy EEOC scrutiny.Bias detection and fairness governance: Why feeding historical hiring data into AI without stripping protected-class indicators can automate discrimination, and how responsible teams monitor outputs with continuous fairness dashboards and versioned retraining cycles.Candidate experience as brand signal: The way faster status updates, constructive rejection feedback, and richer interviewer prep summaries turn the screening funnel into a competitive differentiator rather than a liability.Policy management as a living system: How queryable policy knowledge bases let employees get cited, plain-English answers to HR questions instantly — while the model proactively flags regulatory conflicts before they become fines or litigation.Predictive workforce planning: Using aggregated behavioral signals — engagement scores, tenure patterns, promotion cadence — to surface flight risks early and enable supportive conversations rather than reactive exit interviews.Scenario planning for finance and leadership: How real-time headcount simulations replace week-long spreadsheet exercises, letting CFOs and boards model hiring freezes or expansion decisions during the meeting itself.Running through all three areas is a single architectural requirement: keeping sensitive personnel data — compensation records, medical leave details, performance reviews — behind the organization's own firewall. The episode argues that private deployment isn't just a legal safeguard; it's what makes employees trust the systems designed to support them, which in turn makes those systems more effective. More from the show: if AI memory and context management are on your radar, don't miss The Context Window Trap: Why Bigger AI Memory Isn't Always Better. LLM

    9 min
  8. Jun 4

    The Context Window Trap: Why Bigger AI Memory Isn't Always Better

    The race to build ever-larger AI context windows has produced some genuinely impressive numbers — but impressive specs don't always translate to better products. This episode of Automatic digs into a counterintuitive truth that's quietly tripping up engineering teams across the industry: stuffing more information into a model's context can actively hurt performance, and understanding why is critical for anyone shipping AI-powered features right now. The discussion draws on this in-depth look at AI context and retrieval strategy to unpack what's really going on beneath the surface of the context window arms race. Here's what the episode covers: The "lost in the middle" problem: Research from Stanford shows that language models reliably degrade in accuracy when the information they need is buried in the middle of a long context — recency and primacy bias are real, even at a million tokens.Why the whiteboard metaphor is wrong: A spotlight on a stage is a more accurate model for how attention works — more content on stage doesn't mean the model focuses better; it often means it focuses worse.The hidden costs of giant contexts: Beyond accuracy, large context windows carry real financial and latency penalties — making brute-force context stuffing slow, expensive, and fragile at production scale.Why retrieval-augmented generation (RAG) isn't optional: Mature AI teams are treating RAG pipelines as foundational infrastructure, not a future nice-to-have — feeding models a small, tightly scoped, high-relevance context instead of a flood of raw data.The new bottleneck is retrieval quality: Chunking strategy, embedding model freshness, metadata filtering, and hybrid search (dense vectors combined with sparse keyword search like BM25) all determine whether your system surfaces the right information — or confidently hands the model the wrong answer.Observability as a product advantage: Teams that build proper retrieval layers gain the ability to log, inspect, and tune what the model sees — turning a black box into a system they can actually improve over time.The central argument is clear and practical: the teams getting the most reliable results from AI right now aren't the ones pushing context limits to their maximum — they're the ones being disciplined about the minimum context a model actually needs to do its job well. Chasing spec sheets is a distraction; chasing outcomes is the work. Automatic

    7 min

About

Podcast for Automatic.co and LLM.co, the AI automation specialists.