Learn to evaluate programs utilizing LLMs as well as generative image models using platform-independent tools
Evaluating and Debugging Generative AI
Instructor: Carey Phelps
Earn an accomplishment with PRO

- Intermediate
- 50 mins
- 7 Video Lessons
- 5 Code Examples
- 1 Graded Assignment PRO
- Earn an accomplishment with PRO
- Instructor: Carey Phelps
Weights and Biases- Learn more aboutMembership PRO Plan
What you'll learn
Instrument a training notebook, and add tracking, versioning, and logging
Implement monitoring and tracing of LLMs over time in complex interactions
About this course
Machine learning and AI projects require managing diverse data sources, vast data volumes, model and parameter development, and conducting numerous test and evaluation experiments. Overseeing and tracking these aspects of a program can quickly become an overwhelming task.
This course will introduce you to Machine Learning Operations tools that manage this workload. You will learn to use the Weights & Biases platform which makes it easy to track your experiments, run and version your data, and collaborate with your team.
This course will teach you to:
- Instrument a Jupyter notebook
- Manage hyperparameter config
- Log run metrics
- Collect artifacts for dataset and model versioning
- Log experiment results
- Trace prompts and responses to LLMs over time in complex interactions
When you complete this course, you will have a systematic workflow at your disposal to boost your productivity and accelerate your journey toward breakthrough results.
Who should join?
Anyone who has familiarity with Python and PyTorch or similar framework and an interest in managing, versioning, and debugging their machine learning workflow.
Course Outline
7 Lessons・5 Code Examples- IntroductionVideo・3 mins
- Instrument W&BVideo with Code Example・10 mins
- Training a Diffusion Model with W&BVideo with Code Example・5 mins
- Evaluating Diffusion ModelsVideo with Code Example・7 mins
- LLM Evaluation and Tracing with W&BVideo with Code Example・14 mins
- Finetuning a language modelVideo with Code Example・7 mins
- ConclusionVideo・1 min
- Quiz
Graded・Quiz
・10 mins

Elevate your learning experience with Pro
Upgrade to Pro and gain unlimited accomplishments on your resume
Instructor
Course access is free for a limited time during the DeepLearning.AI learning platform beta!
Want to learn more about generative AI?
Keep learning with updates on curated AI news, courses, events, as well as Andrew’s thoughts from DeepLearning.AI!

