University College Dublin

2 Posts

Abeba Birhane
University College Dublin

Abeba Birhane: Clean Up Web Datasets

From language to vision models, deep neural networks are marked by improved performance, higher efficiency, and better generalizations. Yet, these systems are also marked by perpetuation of bias and injustice.
3 min read
Series of example of accurate and inaccurate matching images to text
University College Dublin

Crawl the Web, Absorb the Bias: Language Models Absorb Biases from Web Training Data

The emerging generation of trillion-parameter models needs datasets of billions of examples, but the most readily available source of examples on that scale — the web — is polluted with bias and antisocial expressions. A new study examines the issue.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox