About Blog & Videos Research Art

Fast and Efficient Training of Neural Networks

February 5, 2020

References in the Video

  • CodeEmporium
  • Code for the video
  • Code behind the DCGAN with Apex
  • Mixed Precision Training
  • Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
  • Highly Scalable Deep Learning Training System with Mixed-Precision: Training ImageNet in Four Minutes
  • Don't Decay the Learning Rate, Increase the Batch Size
  • On the Variance of the Adaptive Learning Rate and Beyond
  • Cyclical Learning Rates for Training Neural Networks
  • The 1cycle policy
  • Super-Convergence: Very Fast Training of Neural Networks Using Large Learning Rates
  • Bag of Tricks for Image Classification with Convolutional Neural Networks
  • mixup: Beyond Empirical Risk Minimization
  • Deep Double Descent
  • Deep Double Descent: Where Bigger Models and More Data Hurt
  • Reconciling modern machine learning practice and the bias-variance trade-of