Mixtape

1 Post

Graph related to Mixture of Softmaxes (MoS)
Mixtape

Upgrading Softmax

Softmax commonly computes probabilities in a classifier’s output layer. But softmax isn’t always accurate in complex tasks — say, in a natural-language task, when the length of word vectors is much smaller than the number of words in the vocabulary.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox