The AI Concepts Podcast
The AI Concepts Podcast is my attempt to turn the complex world of artificial intelligence into bite-sized, easy-to-digest episodes. Imagine a space where you can pick any AI topic and immediately grasp it, like flipping through an Audio Lexicon - but even better! Using vivid analogies and storytelling, I guide you through intricate ideas, helping you create mental images that stick. Whether you’re a tech enthusiast, business leader, technologist or just curious, my episodes bridge the gap between cutting-edge AI and everyday understanding. Dive in and let your imagination bring these concepts to life!
The AI Concepts Podcast is my attempt to turn the complex world of artificial intelligence into bite-sized, easy-to-digest episodes. Imagine a space where you can pick any AI topic and immediately grasp it, like flipping through an Audio Lexicon - but even better! Using vivid analogies and storytelling, I guide you through intricate ideas, helping you create mental images that stick. Whether you’re a tech enthusiast, business leader, technologist or just curious, my episodes bridge the gap between cutting-edge AI and everyday understanding. Dive in and let your imagination bring these concepts to life!
Episodes
Wednesday Apr 29, 2026
Module 6: RAG | Chunking - Where You Cut Decides What Gets Found
Wednesday Apr 29, 2026
Wednesday Apr 29, 2026
This episode is about chunking, the quiet step in a RAG pipeline that decides whether your system retrieves the right answer or a confidently wrong one. It covers why the chunk is the real unit of retrieval, the tradeoff between context and precision, the main strategies teams use to split documents, and why testing your chunks against real questions matters more than picking the perfect size.
Monday Apr 27, 2026
Module 6: RAG | Data Ingestion - Before Your Documents Can Be Found
Monday Apr 27, 2026
Monday Apr 27, 2026
This episode is about the step that every RAG system depends on. Before meaning can be stored or retrieved, your raw documents have to become clean text. What goes wrong here breaks the entire pipeline in ways that are surprisingly hard to catch.
Monday Apr 27, 2026
Module 6: RAG | Vector Databases - Where That Meaning Gets Stored
Monday Apr 27, 2026
Monday Apr 27, 2026
This episode is about the infrastructure underneath every RAG system. It covers the purpose-built engine that stores all that meaning and searches millions of vectors in milliseconds, in a way no traditional database can. This is what makes retrieval fast enough to actually work in production.
Monday Apr 27, 2026
Module 6: RAG | Embeddings - Teaching Machines to Understand Meaning
Monday Apr 27, 2026
Monday Apr 27, 2026
This episode is about the layer of RAG that makes semantic search possible. It covers how machines turn language into math that clusters similar ideas together, so a question and its answer can find each other even when they share no words in common. Without this, RAG is just keyword search with extra steps.
Saturday Apr 25, 2026
Module 6: The RAG Pipeline - End to End
Saturday Apr 25, 2026
Saturday Apr 25, 2026
This episode maps out the full RAG pipeline end to end using one concrete scenario, a defense contractor building an AI assistant for fighter jet maintenance crews. It walks through both phases of the architecture, offline and online, following a real question all the way from a raw document to a grounded answer. It also covers why the architecture is modular and closes with the four failure modes that quietly break RAG systems in production.
Friday Apr 24, 2026
Module 6: What is RAG and Why it Exists
Friday Apr 24, 2026
Friday Apr 24, 2026
This episode kicks off Module 6 with RAG (Retrieval Augmented Generation), the #1 architecture every serious enterprise actually uses. Discover why regular LLMs hallucinate on your private data and high-stakes queries, and how RAG fixes it by forcing the model to retrieve real documents first.
Friday Apr 17, 2026
Module 5: Reasoning Models
Friday Apr 17, 2026
Friday Apr 17, 2026
This episode covers reasoning models, the shift from manually guiding a model's thinking to letting the model reason through complex problems on its own before responding. It explains the concept of test-time compute, why reasoning models take longer but perform dramatically better on hard tasks, and how they change the way you should prompt. It walks through when to reach for a reasoning model versus a standard one, and closes by framing the full prompt engineering toolkit in context, from few-shot examples through reasoning models.
Friday Apr 17, 2026
Module 5: Structured Output and the Language of Software
Friday Apr 17, 2026
Friday Apr 17, 2026
This episode covers structured output, how you get a model to respond in predictable, machine-readable formats like JSON instead of natural language paragraphs. It walks through three approaches, from simply asking in the prompt, to JSON mode, to schema-based constraints, and explains why each level adds more reliability. It uses real-world examples to show how structured output turns AI from a conversation partner into a software component that can feed databases, trigger workflows, and drive automation. It closes with practical tips for writing schemas and validating output in production.
Thursday Apr 16, 2026
Module 5: System Prompts and the Invisible Rules
Thursday Apr 16, 2026
Thursday Apr 16, 2026
This episode covers system prompts, the invisible instruction layer that shapes every model interaction before the user says a word. It explains the three-role message format, why the model is trained to treat system instructions as higher authority than user messages, and how persona prompting works by shifting which region of the training distribution the model samples from. It walks through the anatomy of a good system prompt and closes with what happens when system and user instructions conflict, including a preview of the prompt injection problem.
Thursday Apr 16, 2026
Module 5: Chain of Thought Prompting
Thursday Apr 16, 2026
Thursday Apr 16, 2026
This episode covers chain of thought prompting, how asking a model to show its reasoning makes it measurably better at complex tasks, and why that works at a mechanical level. It walks through manual and zero-shot chain of thought, then three advanced extensions: self-consistency, Tree of Thought, and step-back prompting. It closes with when chain of thought actually helps versus when it just adds overhead.




