Episode 14: Steam Engines, 100 Out of 100 & Can Me Mam Use This? 🍺 Week 14. Veterans now, apparently. Kieron is off to pootle on the Thames on the beige boat of joy after this. Neil is enjoying the last few days of summer in the Lake District before the cold winds of June blow through. The pantomime horse of a podcast rolls on. The steam engine analogy — and why it changes everything 🏭 Kieron opens with a cracking insight from behavioural scientist Rory Sutherland of Ogilvy. When electricity replaced steam power, early adopters simply swapped their giant steam engine for a giant electric motor — and changed nothing else. The efficiency gains were marginal. The real prize only arrived when someone realised electric motors could be miniaturised and built into each individual machine, allowing the entire factory layout to be redesigned from scratch. The assembly line followed. The analogy for AI is exact: most organisations are still swapping the steam engine. They're bolting AI onto existing workflows and getting 10 minutes here and there. The transformational gains come when you rethink the factory entirely. Not many organisations are there yet — but the ones that get there first will be very hard to catch. AI curious, AI opportunistic, AI first 🎯 Neil's three categories of organisation, drawn from Gartner. Most are curious — trying one thing, dipping a toe in, not yet convinced. Some are opportunistic — using AI where it obviously helps. Very few are AI first — genuinely rethinking how they operate with AI at the centre. The gap between AI curious and AI first is widening every week. And the AI deniers? Still out there. One of them is close to Neil. The doghouse awaits. Moving to GPT 5.4 — the nuances nobody warns you about 🔧 Leading AI has been migrating customers to GPT 5.4 mini — cheaper on tokens, better at reasoning, faster. But the move from 4.1 hasn't been frictionless. The same prompts behave differently. Tiny changes in how a model interprets an instruction can cause it to pull slightly different data, produce slightly different structures, introduce subtle errors that are genuinely hard to spot. Kieron has spent most of the week in the weeds of it. The quality of the writing and reasoning is better — but getting there takes real expertise, meticulous testing, and a lot of prompt rewriting. This is exactly why "our IT team can build this" is a dangerous sentence. Prompt engineering is so 2025 — Nate B. Jones 💬 Neil references Nate B. Jones' latest Substack: in 2025, you prompted AI like a smart intern — detailed, structured, prescriptive. In 2026, with the newer reasoning models, treat it like a strategic advisor. Give it your hypothesis, your context, your data — and ask it to challenge you, push back, and find things you haven't thought of. Kieron accidentally proved this the day before: his pricing strategy prompt included "I don't know what I don't know, come back with other ideas" — and the output was brilliant. Neil's verdict: you were already doing 2026 prompting without knowing it. 100 out of 100 🏆 Neil's BidWriter story of the week. A customer used KnowledgeFlow BidWriter to produce a first draft, then used the built-in evaluation scoring to identify weak sections. They refined the weak areas, ran it back through evaluation, consistently hit 5 out of 5 for every question — and then the actual bid evaluation team came back with 100 out of 100. A perfect score. On a competitive tender. Neil has never heard of it before. Neither have we. Healthcare and the move from single tool to platform 🏥 The public sector customer conversations are shifting. It's no longer "can I have Policy Buddy?" It's "how do I give a whole cohort of people — across primary care, secondary care, physiotherapy, any healthcare provider — consistent knowledge and consistent messaging?" Linked to the work Leading AI did with North East Lincolnshire and the Care Plus Group on multilingual cancer information. The private sector version: how does an organisation become the trusted hub of knowledge for their customers — explainable, reliable, verifiable? Context layer. Data in order. Answers that can be traced back to source. That's the differentiator. The future of education assessment 🎓 Kieron thinks big. AI detectors don't work. Handing in written work is already mistrusted. But nobody's talking about the real prize: AI can assess on everything, over a whole career — every essay, every contribution, every presentation, every rugby match. A data file on each individual that tells employers far more than a grade in an exam taken on one morning when they weren't feeling great. No more failing people who happen not to suit academic-style assessment. There are vested interests in the way. But it needn't be like this anymore. "Can me mam use this?" — Chris Quickfall from Cognassist 🏆 Kieron gives a well-deserved shoutout to Chris, CEO of Cognassist, who was speaking at the FE Tech conference this week about the upcoming SEND white paper and the growing neurodiversity challenge in colleges. His product design test: his mum is a teacher, not great with technology. If she can use it, they ship it. Four words. Best product design philosophy of the year. Kieron is pootling on the Thames. If you see the slow beige boat of joy, give him a wave. Two mates. A bar. Thirty years of business between them. And all they want to talk about is AI. Pull up a stool — we'll get the beers in. 🍺