DeepNorm

1 Post

DeepNet Graph Layers vs years
DeepNorm

Pile on the Layers!

Adding layers to a neural network puts the “deep” in deep learning, but it also increases the chance that the network will get stuck during training. A new approach effectively trains transformers with an order of magnitude more layers than previous methods.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox