Activation Functions & Optimization

References

  1. I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. Cambridge, MA, USA: MIT Press, 2016.
  2. M. A. Nielsen, Neural Networks and Deep Learning. Determination Press, 2015.
  3. D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization,” International Conference on Learning Representations (ICLR), 2015.