Goal: Our assignment will provide students with the foundations of gradient descent with a primary focus on stochastic gradient descent (SGD). In the coding assignment, students will implement and explore various properties of SGD (learning rate, batch size) and its performance on various loss functions. They will work with real data to practice their technical skills and to motivate variations to explore in different situations. Additionally, they will be introduced to variations on the gradient descent algorithm.
- Multivariable Calculus Review
- Define multivariable concepts (for review)
- Gradients
- Saddle Points, Local/Global Maxima/Minima
- Introduction to Gradient Descent
- Establish graphical intuition/visualizations
- Define different parameters and discuss how to tune them
- Learning Rate (Explore different behaviors of convergence)
- Introduce Batch gradient descent, mini-batch)
- Stochastic Gradient Descent
- SGD: Explanation and Motivation, why it is necessary
- Pros and Cons of SGD
- Briefly establish bounds on convergence
- Teach students how to implement SGD in sklearn, using built in functions and compare with student implementation
- Introduction to sklearn (relevant methods used in practice)
- Quick recap of jupyter notebook
- Guide student through manual implementation of SGD
- Show students examples of how SGD performs based on different loss functions:
- Examples of loss functions with saddle points, local minima versus global minima
- Explore different loss functions
- When to use certain loss functions? (Squared-Loss, L2 regularization, logistic loss)
- Ask them to compute gradients of these losses - maybe generate plots of squared-loss v.s. Regularized loss
- Lastly, students explore different variations of gradient descent
- https://ruder.io/optimizing-gradient-descent/
- Show some specific cases where accelerated GD/GD with momentum does much better
- AdamOptimizer
(Specific learning goals for the Coding Assignment can be found at the start of the coding assignment)
- Quiz questions and the Coding Assignment have been implemented in Google Colab. **** Note: Some solutions do not appear in the github preview of the files, please download the .ipynb files and view in jupyter or colab to see all solutions ****
- The quiz and project .ipynb files are located in this repository.
- Download the files, then either use them locally with jupyter notebook or upload them to Google Drive and select open with Google Colab.