<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Ai-Engineering on Jatin Bansal</title><link>https://blog.jatinbansal.com/categories/ai-engineering/</link><description>Recent content in Ai-Engineering on Jatin Bansal</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Tue, 12 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://blog.jatinbansal.com/categories/ai-engineering/index.xml" rel="self" type="application/rss+xml"/><item><title>Chunking Strategies for Retrieval</title><link>https://blog.jatinbansal.com/ai-engineering/chunking-strategies/</link><pubDate>Tue, 12 May 2026 00:00:00 +0000</pubDate><guid>https://blog.jatinbansal.com/ai-engineering/chunking-strategies/</guid><description>Why chunk size is RAG&amp;#39;s most undertuned variable, how recursive, semantic, and structural chunking differ, and when parent-document retrieval wins.</description></item><item><title>LLM Inference: Tokens, Context, and Sampling</title><link>https://blog.jatinbansal.com/ai-engineering/llm-inference-fundamentals/</link><pubDate>Mon, 11 May 2026 00:00:00 +0000</pubDate><guid>https://blog.jatinbansal.com/ai-engineering/llm-inference-fundamentals/</guid><description>How LLMs process text: BPE tokenization, the context window as working memory, KV caching, and sampling parameters that shape output variance.</description></item><item><title>Text Embeddings: Turning Meaning into Geometry</title><link>https://blog.jatinbansal.com/ai-engineering/text-embeddings/</link><pubDate>Mon, 11 May 2026 00:00:00 +0000</pubDate><guid>https://blog.jatinbansal.com/ai-engineering/text-embeddings/</guid><description>How embedding models encode text as dense vectors, why cosine similarity captures meaning, and how to build semantic search in Python and TypeScript.</description></item><item><title>Vector Databases &amp; ANN Indexes</title><link>https://blog.jatinbansal.com/ai-engineering/vector-databases-ann/</link><pubDate>Mon, 11 May 2026 00:00:00 +0000</pubDate><guid>https://blog.jatinbansal.com/ai-engineering/vector-databases-ann/</guid><description>How HNSW, IVF, and ScaNN trade recall for speed, why exact KNN doesn&amp;#39;t scale, and how to pick between pgvector, Qdrant, and Pinecone in production.</description></item></channel></rss>