Beyond Swish: the LiSHT activation function Chris17 November 201929 December 20203 Comments Deep neural networks perform linear operations to combine weight vectors with input vectors. The values that are the outputs of...
Why Swish could perform better than ReLu Chris30 May 20192 February 20207 Comments Neural networks are composed of various layers of neurons. Mathematically, a neuron is nothing but the dot product between the...