Retrieval Augmented Generation (RAG) Course

Build Retrieval Augmented Generation (RAG) systems for real-world applications. Gain fundamental understanding and the practical knowledge to develop production-ready RAG applications, from architecture to deployment and evaluation.

Enroll Now

RAG for real-world applications: Learn how retrieval and generation work together, and how to design each component to build reliable, flexible RAG systems.
Search techniques and vector databases: Use techniques like keyword search, semantic search, hybrid search, chunking, and query parsing to support RAG applications across domains like healthcare and e-commerce.
Prompt design, evaluation, and deployment: Craft prompts that make the most of retrieved context, evaluate RAG system performance, and prepare your pipeline for production.

The benefits of RAG

RAG helps large language models generate more accurate and useful responses by retrieving relevant information from knowledge bases of information they weren’t trained on. These sources of information are often private, recent, or domain-specific, which gives an LLM more context to provide grounded answers. In this course, you’ll learn how to design and implement every part of a RAG system, from retrievers and vector databases to large language models, and evaluation platforms. You’ll understand fundamental principles and apply key techniques at both the component and system levels to effectively connect LLMs to relevant external data sources.

Why Enroll?

Large language models are powerful, but without access to the right information, they often make mistakes. RAG fixes that by grounding model responses in relevant, often private or up-to-date data. As LLMs move into real products and workflows, the ability to build robust, reliable RAG systems is becoming a must-have skill for engineers working in AI.

This course, taught by AI engineer and educator Zain Hasan, gives you the hands-on experience and conceptual understanding to design, build, and evaluate production-ready RAG systems.

You’ll learn to choose the right architecture for your use case, work with vector databases like Weaviate, experiment with prompt and retrieval strategies, and monitor performance using tools like Phoenix from Arize.

Throughout the course, you’ll build progressively more advanced components of a RAG system, using real-world datasets from domains like e-commerce, media, and healthcare. You’ll also explore critical tradeoffs, like when to use hybrid retrieval, how to manage context window limits, and how to balance latency and cost, preparing you to make informed engineering decisions in practice.

RAG is at the core of LLM systems that need to be accurate, grounded, and adaptable, whether for internal tools, customer-facing assistants, or specialized applications. This course helps you move beyond proof-of-concept demos into real-world deployment, equipping you with the skills to build, evaluate, and evolve RAG systems as the ecosystem grows.

Start building RAG systems designed for real-world use.

Enroll Now ✨

You’ll earn a certificate upon completing the course, recognizing your skills in building and evaluating RAG systems with real-world tools and techniques

Instructor

Zain Hasan

Zain Hasan is an AI engineer and educator with nearly a decade of experience building and teaching machine learning systems. He combines deep technical knowledge with a practical approach to system design, helping you understand both the theory and the real-world tradeoffs behind RAG architectures. His teaching style feels less like a lecture and more like learning from a sharp, thoughtful teammate who’s been there before.

5 Modules
Self-paced
Intermediate

Learn through real-world projects

Across five modules, you’ll complete hands-on programming assignments that guide you through building each core part of a RAG system, from simple prototypes to production-ready components.

Through hands-on labs, you’ll:

Build your first RAG system by writing retrieval and prompt augmentation functions and passing structured input into an LLM.

Implement and compare retrieval methods like semantic search, BM25, and Reciprocal Rank Fusion to see how each impacts LLM responses.

Scale your RAG system using Weaviate and a real news dataset—chunking, indexing, and retrieving documents with a vector database.

Develop a domain-specific chatbot for a fictional clothing store that answers FAQs and provides product suggestions based on a custom dataset.

Improve chatbot reliability by handling real-world challenges like dynamic pricing and logging user interactions for monitoring and debugging.

Develop a domain-specific chatbot using open-source LLMs hosted by Together AI for a fictional clothing store that answers FAQs and provides product suggestions based on a custom dataset.

Enroll Now!

Course Syllabus

Module 1: RAG Overview

Introduction to RAG, applications of RAG, RAG architecture overview, intro to LLMs, brief intro to Python, LLM calls, intro to information retrieval

Module 2: Information Retrieval and Search Foundations

Retriever architecture overview, metadata filtering, keyword search (TF-IDF and BM25), semantic search, vector embeddings in RAGv, hybrid search, evaluating retrieval, retrieval metrics

Module 3: Information Retrieval with Vector Databases

ANN algorithms, vector databases, intro to the Weaviate API, chunking, query parsing, cross-encoders and ColBERT, reranking

Module 4: LLMs and Text Generation

Transformer architecture, LLM sampling strategies, exploring LLM capabilities, choosing your LLM, prompt engineering, handling hallucinations, evaluating your LLM’s performance, agentic RAG, RAG vs. fine-tuning

Module 5: RAG Systems in Production

What makes production challenging, implementing RAG evaluation strategies, logging, monitoring, and observability, tracing a RAG system, customized evaluation, quantization, cost vs response quality, latency vs response quality, security, multimodal RAG

Recommended Background

Intermediate Python skills required; basic knowledge of generative AI and high school–level math is helpful.

Enroll Now

Learner reviews from other DeepLearning.AI courses

What I loved about the “AI for Everyone” course was the comprehensive coverage of essential AI topics, guided by the expertise of Andrew Ng. The course provided a clear roadmap for initiating and managing AI projects, from project selection to implementation. It also offered insights into building AI teams and introduced the technical tools necessary for AI success

Selami A.
Software QA Manager

Simple enough to make it easy to understand in spite of being a complex topic, inspiring speaker. Time well spent, and a good fit with “lifelong learning” approach.

Chris C.
DeepLearning.AI Learner

What stood out to me about this course was the clarity and simplicity with which complex AI concepts were explained. The real-life examples and case studies helped me grasp the practical implications of AI in different sectors. The interactive nature of the course made learning engaging and enjoyable.

Adeel B.
DeepLearning.AI Learner

I am an educator and looking to incorporate AI into my career and help my colleagues to do the same. The course did a great job explaining AI concepts to people like myself who are just learning about any of this for the first time.

Krystal L.
DeepLearning.AI Learner

I took this course purely out of curiosity. After becoming aware of ChatGPT and Midjourney and then taking a short course on engineering the prompts to get the desired result, I became more intrigued with the topic of AI. I found this most helpful with regards to getting an idea about what AI actually is as opposed to what Hollywood conditioned me to believe it might be.

John S.
DeepLearning.AI Learner

Loved the content. It brought simplicity to the complex topic of AI, separated signal from noise, presented a great flow and covered the most relevant topics.

Andrew’s knowledge and passion about the subject of AI was amazing. It was inspiring to listen to him, even via recorded videos. Its really great to be in this era of technology, as it makes it possible to get access to the wealth of knowledge so easily.

Muhammad S.
DeepLearning.AI Learner

Frequently Asked Questions

Want to learn more about Generative AI?

Keep learning with updates on curated news, courses, and events, as well as Andrew’s thoughts from DeepLearning.AI!

First name*

Last name*

Email*

Where do you live?*

I want to get the latest news, courses, and workshops and events announcements*