Deep Learning for NLP

CS 6956, Spring 2019

Vanishing gradient revisited: Highway/Residual connections

In this lecture, we will revisit the vanishing gradient problem. The general techniques used to make recurrent networks robust can be applied for general deep neural networks.

Lectures and readings