CourseIntermediate5 Course Modules

Retrieval Augmented Generation (RAG)

Instructors: Zain Hasan

DeepLearning.AI logo

Retrieval Augmented Generation (RAG)

Intermediate

5 Courses

49 Video Lessons

9 Reading Lessons

9 Practices

5 Graded Assignments

Instructor: Zain Hasan

DeepLearning.AI

DeepLearning.AI

What you'll learn

  • RAG for real-world applications: Learn how retrieval and generation work together, and how to design each component to build reliable, flexible RAG systems.

  • Search techniques and vector databases: Use techniques like keyword search, semantic search, hybrid search, chunking, and query parsing to support RAG applications across domains like healthcare and e-commerce.

  • Prompt design, evaluation, and deployment: Craft prompts that make the most of retrieved context, evaluate RAG system performance, and prepare your pipeline for production.

The benefits of RAG

RAG helps large language models generate more accurate and useful responses by retrieving relevant information from knowledge bases of information they weren’t trained on. These sources of information are often private, recent, or domain-specific, which gives an LLM more context to provide grounded answers. In this course, you’ll learn how to design and implement every part of a RAG system, from retrievers and vector databases to large language models, and evaluation platforms. You’ll understand fundamental principles and apply key techniques at both the component and system levels to effectively connect LLMs to relevant external data sources.

Why Enroll?

Large language models are powerful, but without access to the right information, they often make mistakes. RAG fixes that by grounding model responses in relevant, often private or up-to-date data. As LLMs move into real products and workflows, the ability to build robust, reliable RAG systems is becoming a must-have skill for engineers working in AI.

This course, taught by AI engineer and educator Zain Hasan, gives you the hands-on experience and conceptual understanding to design, build, and evaluate production-ready RAG systems.

You’ll learn to choose the right architecture for your use case, work with vector databases like Weaviate, experiment with prompt and retrieval strategies, and monitor performance using tools like Phoenix from Arize.

Throughout the course, you’ll build progressively more advanced components of a RAG system, using real-world datasets from domains like e-commerce, media, and healthcare. You’ll also explore critical tradeoffs, like when to use hybrid retrieval, how to manage context window limits, and how to balance latency and cost, preparing you to make informed engineering decisions in practice.

RAG is at the core of LLM systems that need to be accurate, grounded, and adaptable, whether for internal tools, customer-facing assistants, or specialized applications. This course helps you move beyond proof-of-concept demos into real-world deployment, equipping you with the skills to build, evaluate, and evolve RAG systems as the ecosystem grows.

Start building RAG systems designed for real-world use.

You’ll earn a certificate upon completing the course, recognizing your skills in building and evaluating RAG systems with real-world tools and techniques

Instructor

Zain Hasan

Zain Hasan

Senior AI/ML Developer Relations Engineer at Together.ai, AI/ML Researcher and Lecturer at the University of Toronto

Learn through real-world projects

  • Build your first RAG system by writing retrieval and prompt augmentation functions and passing structured input into an LLM.

  • Implement and compare retrieval methods like semantic search, BM25, and Reciprocal Rank Fusion to see how each impacts LLM responses.

  • Scale your RAG system using Weaviate and a real news dataset—chunking, indexing, and retrieving documents with a vector database.

  • Develop a domain-specific chatbot for a fictional clothing store that answers FAQs and provides product suggestions based on a custom dataset.

  • Improve chatbot reliability by handling real-world challenges like dynamic pricing and logging user interactions for monitoring and debugging.

  • Develop a domain-specific chatbot using open-source LLMs hosted by Together AI for a fictional clothing store that answers FAQs and provides product suggestions based on a custom dataset.

Recommended Background

Intermediate Python skills required; basic knowledge of generative AI and high school–level math is helpful.

Learner Reviews

Frequently Asked Questions

Course Outline

49 Video Lessons • 9 Reading Lessons • 9 Practices • 5 Graded Assignments

A conversation with Andrew Ng

Video • 8 mins

Module 1 introduction

Video • 1 min

Introduction to RAG

Video • 5 mins

Applications of RAG

Video • 4 mins

RAG architecture overview

Video • 5 mins

Introduction to LLMs

Video • 9 mins

- A brief Python refresher

Code Example

LLM Calls and Crafting Simple Augmented Prompts

Code Example

Introduction to information retrieval

Video • 5 mins

[IMPORTANT] Have questions, issues or ideas? Join our Forum!

Reading • 2 mins

Module 1 Quiz

Graded・Quiz • 20 mins

Introduction to RAG systems

Graded・Code Assignment • 3 hours

Module 1 conclusion

Video • 1 min

Lecture Notes M1

Reading • 1 min

Retrieval Augmented Generation (RAG)

Intermediate

5 Courses

49 Video Lessons

9 Reading Lessons

9 Practices

5 Graded Assignments

Instructor: Zain Hasan

DeepLearning.AI

DeepLearning.AI

Want to learn more about Generative AI?

Keep learning with updates on curated AI news, courses, and events, as well as Andrew's thoughts from DeepLearning.AI!

Enroll for Free

Also available on Coursera