Trick Me If You Can

1 Post

Screen captures of online platform Dynabench
Trick Me If You Can

Dynamic Benchmarks

Benchmarks provide a scientific basis for evaluating model performance, but they don’t necessarily map well to human cognitive abilities. Facebook aims to close the gap through a dynamic benchmarking method that keeps humans in the loop.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox