<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>AI Daddy</title>
    <link>https://www.aidaddy.tech</link>
    <description>AI system design and interview prep for engineers moving into AI from frontend, backend, or data. RAG, agents, MCP, evaluation, and the production patterns you will meet on the job.</description>
    <atom:link href="https://www.aidaddy.tech/rss.xml" rel="self" type="application/rss+xml" />
    <language>en-us</language>
    <lastBuildDate>Fri, 29 May 2026 14:20:47 GMT</lastBuildDate>
    <item>
      <title>AI System Design Interview Question Bank</title>
      <link>https://www.aidaddy.tech/learn/00-interview-prep/01-question-bank</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/00-interview-prep/01-question-bank</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Interview Prep</category>
      <description>A topic-organized bank of 110+ AI system design interview questions with model answers, follow-ups, and signals strong candidates show. Updated through May 2026.</description>
    </item>
    <item>
      <title>LLM Internals</title>
      <link>https://www.aidaddy.tech/learn/01-foundations/01-llm-internals</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/01-foundations/01-llm-internals</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Foundations</category>
      <description>The architectural core of modern LLMs: transformers, MoE, attention math, RoPE, GQA, KV cache, and the inference-optimal scaling shift driving 2026 model design.</description>
    </item>
    <item>
      <title>Tokenization Deep Dive</title>
      <link>https://www.aidaddy.tech/learn/01-foundations/02-tokenization-deep-dive</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/01-foundations/02-tokenization-deep-dive</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Foundations</category>
      <description>Tokenization is the process of converting text into discrete units (tokens) that models can process. It directly impacts model capabilities, costs, and performance.</description>
    </item>
    <item>
      <title>Attention Mechanisms</title>
      <link>https://www.aidaddy.tech/learn/01-foundations/03-attention-mechanisms</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/01-foundations/03-attention-mechanisms</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Foundations</category>
      <description>Attention is the core innovation that enables transformers. This chapter covers the mathematical foundations, variants, and optimizations that are essential for system design and interviews.</description>
    </item>
    <item>
      <title>Transformer Architecture</title>
      <link>https://www.aidaddy.tech/learn/01-foundations/04-transformer-architecture</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/01-foundations/04-transformer-architecture</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Foundations</category>
      <description>This chapter provides a comprehensive view of the complete transformer architecture, bringing together the components from previous chapters into a unified understanding.</description>
    </item>
    <item>
      <title>Embeddings and Vector Spaces</title>
      <link>https://www.aidaddy.tech/learn/01-foundations/05-embeddings-and-vector-spaces</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/01-foundations/05-embeddings-and-vector-spaces</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Foundations</category>
      <description>Embeddings are dense vector representations of text that capture semantic meaning. They are foundational to RAG systems, semantic search, and many AI applications.</description>
    </item>
    <item>
      <title>Inference Pipeline</title>
      <link>https://www.aidaddy.tech/learn/01-foundations/06-inference-pipeline</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/01-foundations/06-inference-pipeline</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Foundations</category>
      <description>This chapter covers how LLMs generate text at inference time, the computational phases involved, and the key metrics for production serving.</description>
    </item>
    <item>
      <title>Model Taxonomy</title>
      <link>https://www.aidaddy.tech/learn/02-model-landscape/01-model-taxonomy</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/02-model-landscape/01-model-taxonomy</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Model Landscape</category>
      <description>This chapter provides a comprehensive guide to the model landscape as of **May 2026**, covering model families, capabilities, and selection criteria for production systems.</description>
    </item>
    <item>
      <title>Capability Assessment</title>
      <link>https://www.aidaddy.tech/learn/02-model-landscape/02-capability-assessment</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/02-model-landscape/02-capability-assessment</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Model Landscape</category>
      <description>This chapter covers how to evaluate and compare model capabilities for your specific use case. Generic benchmarks rarely tell the full story; this guide helps you conduct meaningful assessments.</description>
    </item>
    <item>
      <title>Pricing and Costs</title>
      <link>https://www.aidaddy.tech/learn/02-model-landscape/03-pricing-and-costs</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/02-model-landscape/03-pricing-and-costs</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Model Landscape</category>
      <description>Understanding the cost structure of LLM systems is essential for production planning. This chapter covers pricing models, cost optimization strategies, and total cost of ownership analysis.</description>
    </item>
    <item>
      <title>Prompt Optimization (DSPy)</title>
      <link>https://www.aidaddy.tech/learn/05-prompting-and-context/07-prompt-optimization-dspy</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/05-prompting-and-context/07-prompt-optimization-dspy</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Prompting &amp; Context</category>
      <description>Prompting has moved from the &quot;Hand-tuning&quot; era to the &quot;Programmatic&quot; era. **DSPy (Declarative Self-improving Language Programs)** is the de-facto standard for building robust LLM pipelines where prompts are optimized automatically by…</description>
    </item>
    <item>
      <title>Chunking Strategies</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/02-chunking-strategies</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/02-chunking-strategies</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Chunking is the process of splitting a document into discrete segments for retrieval. Production pipelines have moved beyond blind fixed-size splits to **structure-aware and semantic segments**, with newer techniques like late chunking…</description>
    </item>
    <item>
      <title>Tool Use and MCP</title>
      <link>https://www.aidaddy.tech/learn/07-agentic-systems/03-tool-use-and-mcp</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/07-agentic-systems/03-tool-use-and-mcp</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Agentic Systems</category>
      <description>Tools are the &quot;hands&quot; of an agent. The industry has standardized on the **Model Context Protocol (MCP)**, which replaces fragmented custom tool definitions with a unified, local-first communication layer. MCP has matured rapidly:…</description>
    </item>
    <item>
      <title>LangGraph Orchestration</title>
      <link>https://www.aidaddy.tech/learn/09-frameworks-and-tools/02-langgraph-orchestration</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/09-frameworks-and-tools/02-langgraph-orchestration</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Frameworks &amp; Tools</category>
      <description>LangGraph is the **de facto standard** for building stateful, multi-agent systems. It reached v1.0 in late 2025 and surpassed CrewAI in GitHub stars in early 2026 thanks to enterprise adoption of its graph-based runtime. Unlike simple…</description>
    </item>
    <item>
      <title>Claude Code: The Autonomous Coding Agent</title>
      <link>https://www.aidaddy.tech/learn/09-frameworks-and-tools/09-claude-code</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/09-frameworks-and-tools/09-claude-code</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Frameworks &amp; Tools</category>
      <description>Claude Code is Anthropic&apos;s **terminal-native autonomous coding agent**. Unlike IDE plugins that suggest completions, Claude Code acts as a full-stack software engineer: it reads your codebase, edits files, runs commands, executes tests,…</description>
    </item>
    <item>
      <title>OpenCoder: AI Coding Agents Landscape</title>
      <link>https://www.aidaddy.tech/learn/09-frameworks-and-tools/10-opencoderguide</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/09-frameworks-and-tools/10-opencoderguide</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Frameworks &amp; Tools</category>
      <description>The AI coding agent landscape has exploded. This guide covers open-weight coding models, agentic IDEs, open-source agents, and how to choose the right tool for your engineering workflow.</description>
    </item>
    <item>
      <title>Pydantic AI and Mastra: Typed Agent Frameworks (2026)</title>
      <link>https://www.aidaddy.tech/learn/09-frameworks-and-tools/11-pydantic-ai-and-mastra</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/09-frameworks-and-tools/11-pydantic-ai-and-mastra</guid>
      <pubDate>Fri, 29 May 2026 13:58:08 GMT</pubDate>
      <category>Frameworks &amp; Tools</category>
      <description>By May 2026 the agent framework debate has stopped being &quot;LangGraph or LlamaIndex.&quot; Two newer entrants now own meaningful production share for teams that prioritize type safety over breadth: **Pydantic AI** in the Python world and…</description>
    </item>
    <item>
      <title>Model Selection Guide</title>
      <link>https://www.aidaddy.tech/learn/02-model-landscape/04-model-selection-guide</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/02-model-landscape/04-model-selection-guide</guid>
      <pubDate>Fri, 29 May 2026 13:34:41 GMT</pubDate>
      <category>Model Landscape</category>
      <description>A practical framework for choosing the right LLM for your use case, considering capability, cost, latency, and operational factors.</description>
    </item>
    <item>
      <title>Frequently Asked Questions: AI Engineering, RAG, and Agents</title>
      <link>https://www.aidaddy.tech/learn/00-interview-prep/07-faq</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/00-interview-prep/07-faq</guid>
      <pubDate>Mon, 25 May 2026 00:15:53 GMT</pubDate>
      <category>Interview Prep</category>
      <description>Short, direct answers to the questions people ask most about modern AI system design. Each answer points to the chapter where the topic is covered in depth.</description>
    </item>
    <item>
      <title>KV Cache and Context Caching</title>
      <link>https://www.aidaddy.tech/learn/04-inference-optimization/02-kv-cache-and-context-caching</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/04-inference-optimization/02-kv-cache-and-context-caching</guid>
      <pubDate>Mon, 25 May 2026 00:15:53 GMT</pubDate>
      <category>Inference Optimization</category>
      <description>The KV Cache is the most significant memory consumer in long-context AI systems. Managing this cache effectively is the difference between a system that scales to 2M tokens and one that crashes at 10k.</description>
    </item>
    <item>
      <title>Cost Optimization Playbook</title>
      <link>https://www.aidaddy.tech/learn/04-inference-optimization/07-cost-optimization-playbook</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/04-inference-optimization/07-cost-optimization-playbook</guid>
      <pubDate>Mon, 25 May 2026 00:15:53 GMT</pubDate>
      <category>Inference Optimization</category>
      <description>AI costs are no longer &quot;magic.&quot; They are measurable, predictable, and highly optimizable. With API pricing down 30-60% over the past year, the cost lever is now mostly about *routing* and *caching*, not just picking a cheaper provider.…</description>
    </item>
    <item>
      <title>OpenClaw Deep Dive: The Open-Source Personal AI Agent</title>
      <link>https://www.aidaddy.tech/learn/17-tool-use-and-computer-agents/03-openclaw-deep-dive</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/17-tool-use-and-computer-agents/03-openclaw-deep-dive</guid>
      <pubDate>Mon, 25 May 2026 00:15:53 GMT</pubDate>
      <category>Tool Use &amp; Computer Agents</category>
      <description>OpenClaw is an **open-source, self-hosted personal AI agent** that executes tasks through LLMs using messaging platforms as its primary interface. You talk to it via WhatsApp, Telegram, Slack, Discord, or Signal, and it talks back --…</description>
    </item>
    <item>
      <title>Pretraining Basics</title>
      <link>https://www.aidaddy.tech/learn/03-training-and-adaptation/01-pretraining-basics</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/03-training-and-adaptation/01-pretraining-basics</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Training &amp; Adaptation</category>
      <description>Pretraining is the most computationally expensive phase of building an LLM, where a model learns general knowledge and language patterns from massive datasets.</description>
    </item>
    <item>
      <title>Fine-Tuning Strategies</title>
      <link>https://www.aidaddy.tech/learn/03-training-and-adaptation/02-fine-tuning-strategies</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/03-training-and-adaptation/02-fine-tuning-strategies</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Training &amp; Adaptation</category>
      <description>Fine-tuning adapts a pretrained model to specific tasks, domains, or styles. Today, fine-tuning is less about &quot;teaching facts&quot; and more about &quot;teaching format and behavior.&quot;</description>
    </item>
    <item>
      <title>LoRA, QLoRA, and PEFT</title>
      <link>https://www.aidaddy.tech/learn/03-training-and-adaptation/03-lora-qlora-peft</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/03-training-and-adaptation/03-lora-qlora-peft</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Training &amp; Adaptation</category>
      <description>Parameter-Efficient Fine-Tuning (PEFT) is the industry standard for adapting LLMs. This chapter covers the mechanics and advanced variants of LoRA and other PEFT methods.</description>
    </item>
    <item>
      <title>RLHF and DPO (Alignment)</title>
      <link>https://www.aidaddy.tech/learn/03-training-and-adaptation/04-rlhf-and-dpo</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/03-training-and-adaptation/04-rlhf-and-dpo</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Training &amp; Adaptation</category>
      <description>Alignment is the process of ensuring an LLM&apos;s behavior matches human values and instructions. The field has moved from traditional RLHF to more efficient and scalable methods like DPO and Online RL.</description>
    </item>
    <item>
      <title>Knowledge Distillation</title>
      <link>https://www.aidaddy.tech/learn/03-training-and-adaptation/05-knowledge-distillation</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/03-training-and-adaptation/05-knowledge-distillation</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Training &amp; Adaptation</category>
      <description>Knowledge distillation is the process of transferring the intelligence from a large, complex model (&quot;Teacher&quot;) to a smaller, more efficient one (&quot;Student&quot;). This is the secret to the high performance of today&apos;s small open-weight models…</description>
    </item>
    <item>
      <title>Synthetic Data Generation</title>
      <link>https://www.aidaddy.tech/learn/03-training-and-adaptation/06-synthetic-data-generation</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/03-training-and-adaptation/06-synthetic-data-generation</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Training &amp; Adaptation</category>
      <description>The industry has hit the &quot;Data Wall&quot;, the exhaustion of high-quality human text on the web. Synthetic data is now the primary engine for model improvement, sitting at the core of every modern frontier-model recipe.</description>
    </item>
    <item>
      <title>Quantization Deep Dive</title>
      <link>https://www.aidaddy.tech/learn/03-training-and-adaptation/07-quantization-deep-dive</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/03-training-and-adaptation/07-quantization-deep-dive</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Training &amp; Adaptation</category>
      <description>Quantization is the process of reducing the precision of model weights (e.g., from 16-bit to 4-bit) to save memory and increase inference speed. This is the primary tool for deploying large models on consumer and single-GPU hardware.</description>
    </item>
    <item>
      <title>Inference Fundamentals</title>
      <link>https://www.aidaddy.tech/learn/04-inference-optimization/01-inference-fundamentals</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/04-inference-optimization/01-inference-fundamentals</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Inference Optimization</category>
      <description>Inference is the process of generating predictions from a trained model. Inference optimization has shifted from &quot;simple speedups&quot; to &quot;architectural efficiency&quot; to handle reasoning-heavy workloads on Hopper (H100) and Blackwell (B200)…</description>
    </item>
    <item>
      <title>Speculative Decoding</title>
      <link>https://www.aidaddy.tech/learn/04-inference-optimization/03-speculative-decoding</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/04-inference-optimization/03-speculative-decoding</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Inference Optimization</category>
      <description>Speculative decoding is a now-standard technique that allows large Models (LLMs) to generate multiple tokens per forward pass, effectively breaking the memory-bandwidth bottleneck for sequential decoding.</description>
    </item>
    <item>
      <title>Batching Strategies</title>
      <link>https://www.aidaddy.tech/learn/04-inference-optimization/04-batching-strategies</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/04-inference-optimization/04-batching-strategies</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Inference Optimization</category>
      <description>Batching is the primary lever for increasing LLM throughput and reducing cost. Serving frameworks have moved beyond simple request-level batching to sub-token, iteration-level orchestration.</description>
    </item>
    <item>
      <title>PagedAttention</title>
      <link>https://www.aidaddy.tech/learn/04-inference-optimization/05-paged-attention</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/04-inference-optimization/05-paged-attention</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Inference Optimization</category>
      <description>PagedAttention is the foundational algorithm behind high-throughput serving engines (vLLM, SGLang, TensorRT-LLM). It solves the &quot;Memory Fragmentation&quot; problem that previously limited LLM scalability.</description>
    </item>
    <item>
      <title>Serving Infrastructure</title>
      <link>https://www.aidaddy.tech/learn/04-inference-optimization/06-serving-infrastructure</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/04-inference-optimization/06-serving-infrastructure</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Inference Optimization</category>
      <description>Deploying LLMs at scale requires a robust infrastructure layer that handles load balancing, model parallelism, and multi-tenant isolation. The focus has shifted from &quot;serving a model&quot; to &quot;orchestrating an inference fleet.&quot;</description>
    </item>
    <item>
      <title>Prompt Engineering Fundamentals</title>
      <link>https://www.aidaddy.tech/learn/05-prompting-and-context/01-prompt-engineering-fundamentals</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/05-prompting-and-context/01-prompt-engineering-fundamentals</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Prompting &amp; Context</category>
      <description>Prompt engineering is the design of inputs to steer LLM behavior. It has evolved from &quot;trial and error&quot; to a disciplined architectural practice, with frameworks like DSPy treating it as a compilation problem rather than a writing exercise.</description>
    </item>
    <item>
      <title>Few-Shot and In-Context Learning (ICL)</title>
      <link>https://www.aidaddy.tech/learn/05-prompting-and-context/02-few-shot-and-icl</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/05-prompting-and-context/02-few-shot-and-icl</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Prompting &amp; Context</category>
      <description>In-Context Learning (ICL) is the ability of an LLM to learn a new task simply by seeing examples in the prompt, without any weight updates. Maximizing ICL efficiency is a key lever for prompt stability.</description>
    </item>
    <item>
      <title>Chain-of-Thought (CoT)</title>
      <link>https://www.aidaddy.tech/learn/05-prompting-and-context/03-chain-of-thought</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/05-prompting-and-context/03-chain-of-thought</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Prompting &amp; Context</category>
      <description>Chain-of-Thought (CoT) is the technique of encouraging an LLM to generate intermediate reasoning steps before providing a final answer. It has evolved from a simple prompt phrase into the core architectural feature of reasoning models…</description>
    </item>
    <item>
      <title>Tree-of-Thought (ToT)</title>
      <link>https://www.aidaddy.tech/learn/05-prompting-and-context/04-tree-of-thought</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/05-prompting-and-context/04-tree-of-thought</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Prompting &amp; Context</category>
      <description>Tree-of-Thought (ToT) is an advanced prompting architecture where a model explores multiple reasoning paths, evaluates them, and &quot;backtracks&quot; if a path leads to a dead end. It is the blueprint behind modern autonomous research agents.</description>
    </item>
    <item>
      <title>Context Engineering</title>
      <link>https://www.aidaddy.tech/learn/05-prompting-and-context/05-context-engineering</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/05-prompting-and-context/05-context-engineering</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Prompting &amp; Context</category>
      <description>Context engineering is the science of filling the LLM&apos;s finite &quot;working memory&quot; with the most valuable tokens. With context windows now reaching 1M+ tokens (Claude Sonnet 4.6, Gemini 3.1 Pro, GPT-5.5) and models gaining Extended…</description>
    </item>
    <item>
      <title>Structured Generation</title>
      <link>https://www.aidaddy.tech/learn/05-prompting-and-context/06-structured-generation</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/05-prompting-and-context/06-structured-generation</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Prompting &amp; Context</category>
      <description>Structured Generation is the process of forcing an LLM to produce output in a machine-readable format (JSON, YAML, CSV) with 100% reliability. The discipline has moved from &quot;prompt-based requests&quot; to &quot;engine-level constraints.&quot;</description>
    </item>
    <item>
      <title>Prompt Injection and Defense</title>
      <link>https://www.aidaddy.tech/learn/05-prompting-and-context/08-prompt-injection-defense</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/05-prompting-and-context/08-prompt-injection-defense</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Prompting &amp; Context</category>
      <description>As LLMs become the &quot;operating system&quot; for applications, Prompt Injection is the new &quot;SQL Injection.&quot; It is the #1 LLM risk in the OWASP LLM Top 10, and modern defense treats it as an architectural concern, not just a prompt-writing one.</description>
    </item>
    <item>
      <title>RAG Fundamentals</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/01-rag-fundamentals</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/01-rag-fundamentals</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>How RAG evolved from naive vector search to agentic and graph-based retrieval. When to choose RAG vs. long context, and the three retrieval gaps that cause production failures.</description>
    </item>
    <item>
      <title>Embedding Models</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/03-embedding-models</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/03-embedding-models</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Embedding models convert text into high-dimensional vectors. The frontier has moved past static single-vector representations to **multi-resolution, late-interaction, and multimodal** embeddings.</description>
    </item>
    <item>
      <title>Vector Databases</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/04-vector-databases</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/04-vector-databases</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Vector databases are purpose-built systems for storing, indexing, and searching high-dimensional embeddings. The market has split into **Managed Serverless** and **Specialized High-Performance** engines. We no longer ask &quot;Does it…</description>
    </item>
    <item>
      <title>Hybrid Search</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/05-hybrid-search</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/05-hybrid-search</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Hybrid search combines dense (semantic) and sparse (keyword) retrieval to get the benefits of both. It is the baseline for production RAG: Elasticsearch&apos;s `rrf` retriever, OpenSearch hybrid search, Weaviate, Qdrant, and Azure AI Search…</description>
    </item>
    <item>
      <title>Reranking Strategies</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/06-reranking-strategies</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/06-reranking-strategies</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Reranking is the second stage of retrieval that re-scores a small set of candidates (Top 50-100) using a high-precision model. It is the bridge between &quot;efficient search&quot; and &quot;perfect grounding&quot;: first-stage retrieval optimizes for…</description>
    </item>
    <item>
      <title>GraphRAG</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/07-graph-rag</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/07-graph-rag</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>GraphRAG is the combination of **Knowledge Graphs (KG)** and **Retrieval-Augmented Generation**. While vector RAG is good at &quot;finding a specific chunk,&quot; GraphRAG is designed for **Global Reasoning** across an entire dataset.</description>
    </item>
    <item>
      <title>Agentic RAG</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/08-agentic-rag</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/08-agentic-rag</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Agentic RAG moves from a &quot;Linear Pipeline&quot; to a **&quot;Reasoning Loop.&quot;** Instead of retrieving once, an agent decides *when* and *what* to retrieve to resolve a query. The dominant production patterns are Self-RAG (model emits reflection…</description>
    </item>
    <item>
      <title>Advanced Retrieval Patterns</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/09-advanced-retrieval-patterns</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/09-advanced-retrieval-patterns</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Beyond the basics, production RAG systems use specialized patterns to handle complex query-document gaps. These patterns are the &quot;secret sauce&quot; of high-precision search and are increasingly bundled into managed RAG offerings.</description>
    </item>
    <item>
      <title>Contextual Retrieval</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/10-contextual-retrieval</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/10-contextual-retrieval</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Contextual Retrieval is an ingestion-time technique that solves the #1 cause of RAG failure: **chunks that lose meaning when separated from their source document**. Pioneered by Anthropic in late 2024, it is now a production standard…</description>
    </item>
    <item>
      <title>Late Interaction &amp; ColBERT</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/11-late-interaction-colbert</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/11-late-interaction-colbert</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Late Interaction is a retrieval paradigm that sits between fast-but-imprecise **bi-encoders** and accurate-but-slow **cross-encoders**. ColBERT (Contextualized Late Interaction over BERT) is the defining model in this space, delivering…</description>
    </item>
    <item>
      <title>Multi-Modal RAG</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/12-multimodal-rag</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/12-multimodal-rag</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Multi-modal RAG extends retrieval-augmented generation beyond plain text to handle images, tables, charts, audio, and mixed-layout documents. Production systems now routinely ingest PDFs with diagrams, slide decks, scanned invoices, and…</description>
    </item>
    <item>
      <title>RAG Evaluation Patterns</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/13-rag-evaluation-patterns</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/13-rag-evaluation-patterns</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Evaluation is the hardest unsolved problem in RAG. You can build a retrieval pipeline in a day; knowing whether it actually works takes weeks. The industry has converged on a layered evaluation strategy: the RAG Triad for correctness,…</description>
    </item>
    <item>
      <title>Production RAG at Scale</title>
      <link>https://www.aidaddy.tech/learn/06-retrieval-systems/14-production-rag-at-scale</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/06-retrieval-systems/14-production-rag-at-scale</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Retrieval Systems</category>
      <description>Production RAG is no longer a weekend project. It is a distributed system with retrieval pipelines, caching layers, routing logic, self-correction loops, multi-tenant isolation, and cost controls, all operating under strict latency…</description>
    </item>
    <item>
      <title>Agent Fundamentals</title>
      <link>https://www.aidaddy.tech/learn/07-agentic-systems/01-agent-fundamentals</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/07-agentic-systems/01-agent-fundamentals</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Agentic Systems</category>
      <description>Agents are LLM-powered systems that move beyond &quot;chat&quot; into &quot;autonomous problem solving.&quot; The definition has shifted from simple ReAct loops to **Closed-Loop Reasoning Systems** that use built-in &quot;System 2&quot; thinking (Claude Opus 4.7…</description>
    </item>
    <item>
      <title>Reasoning Loops: ReAct and Beyond</title>
      <link>https://www.aidaddy.tech/learn/07-agentic-systems/02-reasoning-loops-react-and-beyond</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/07-agentic-systems/02-reasoning-loops-react-and-beyond</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Agentic Systems</category>
      <description>Reasoning Loops define the control flow of an agent. While **ReAct** was the 2023 baseline, current systems use more sophisticated patterns like **Plan-and-Solve**, **Self-Reflexion**, and **Inference-Time Scaling** running on top of…</description>
    </item>
    <item>
      <title>Multi-Agent Orchestration</title>
      <link>https://www.aidaddy.tech/learn/07-agentic-systems/04-multi-agent-orchestration</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/07-agentic-systems/04-multi-agent-orchestration</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Agentic Systems</category>
      <description>Complex systems are rarely one agent. They are teams of specialized agents. Orchestration has matured from &quot;Blind Managers&quot; to **Hierarchical Supervisors**, **Dynamic Swarms**, and **Cross-Vendor Agent Networks** enabled by…</description>
    </item>
    <item>
      <title>Agent Memory and State</title>
      <link>https://www.aidaddy.tech/learn/07-agentic-systems/05-agent-memory-and-state</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/07-agentic-systems/05-agent-memory-and-state</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Agentic Systems</category>
      <description>Memory is what allows an agent to learn and maintain context over time. Agent memory has matured from &quot;Chat History&quot; into a **Multi-Tiered Cognitive Architecture** with four named layers (Working, Episodic, Semantic, Procedural), each…</description>
    </item>
    <item>
      <title>Planning and Decomposition</title>
      <link>https://www.aidaddy.tech/learn/07-agentic-systems/06-planning-and-decomposition</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/07-agentic-systems/06-planning-and-decomposition</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Agentic Systems</category>
      <description>Planning is the &quot;System 2&quot; component that allows agents to solve multi-stage problems without &quot;wandering.&quot; Production agents have moved from simple &quot;Chain-of-Thought&quot; to **Recursive Decomposition** and **Tree Search**, with…</description>
    </item>
    <item>
      <title>Error Handling and Recovery</title>
      <link>https://www.aidaddy.tech/learn/07-agentic-systems/07-error-handling-and-recovery</link>
      <guid isPermaLink="true">https://www.aidaddy.tech/learn/07-agentic-systems/07-error-handling-and-recovery</guid>
      <pubDate>Mon, 25 May 2026 00:02:47 GMT</pubDate>
      <category>Agentic Systems</category>
      <description>Agents fail in non-deterministic ways. Error handling has moved from &quot;Try-Catch blocks&quot; to **Agentic Self-Correction** and **Stateful Rollbacks**, with frameworks like LangGraph and Microsoft Agent Framework providing native…</description>
    </item>
  </channel>
</rss>
