Diagram showing how open source tool Giskard works

Testing for Large Language Models: Meet Giskard, an automated quality manager for LLMs.

An open source tool automatically tests language and tabular-data models for social biases and other common issues. GiskardĀ is a software framework that evaluates models using a suite of heuristics and tests based on GPT-4.

