NLP with Neural Networks

CS 6957, Fall 2023

Vanishing gradient revisited: Highway/Residual connections

In this lecture, we will revisit the vanishing gradient problem. The general techniques used to make recurrent networks robust can be applied for general deep neural networks.

Lectures and readings