Gradient Descent

4 Posts

Two people rock climbing
Gradient Descent

Gradient Descent: It’s All Downhill

Imagine hiking in the mountains past dusk and finding that you can’t see much beyond your feet. And your phone’s battery died so you can’t use a GPS app to find your way home. You might find the quickest path down via gradient descent.
2 min read
Graphs comparing SGD + Momentum, Adam and AdaBelief
Gradient Descent

Striding Toward the Minimum

When you’re training a deep learning model, it can take days for an optimization algorithm to minimize the loss function. A new approach could save time.
2 min read
Graphs related to a comparison and evaluation of 14 different optimizers
Gradient Descent

Optimizer Shootout

Everyone has a favorite optimization method, but it’s not always clear which one works best in a given situation. New research aims to establish a set of benchmarks. Researchers evaluated 14 popular optimizers using the Deep Optimization Benchmark Suite some of them introduced last year.
2 min read
Series of images with graphs and data related to optimization algorithms
Gradient Descent

When Optimization is Suboptimal

Bias arises in machine learning when we fit an overly simple function to a more complex problem. A theoretical study shows that gradient descent itself may introduce such bias and render algorithms unable to fit data properly.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox