This lectures covers the least mean square method for linear regression. Along the way, we will encounter the ideas of learning by minimizing a loss function, and gradient and stochastic gradient descent.
Lectures

 Older videos: [spring 2023, lecture 1], [spring 2023, lecture 2], [fall 2018], [fall 2017]
Links and resources
 Chapter 3.1 of Christopher Bishop, Pattern Recognition and Machine Learning.