You can find me on twitter @bhutanisanyam1 During ‘Training’ of a Deep Model, we backpropogate our Gradients through the Network’s layers. Learning During , once the gradient value grows extremely large, it causes an overflow (i.e. NaN) which is easily detectable at runtime or in a less extreme situation, the Model starts overshooting past our Minima; this issue is called the . experimentation Gradient Explosion Problem This is when they get exponentially large from being multiplied by numbers larger than 1, consider the example: Source: Hinton’s Coursera Lecture Videos. Gradient clipping will ‘clip’ the gradients or cap them to a Threshold value to prevent the gradients from getting too large. In the above image, Gradient is clipped from Overshooting and our cost function follows the Dotted values rather than its original trajectory. L2 Norm Clipping There exist various ways to perform gradient clipping, but the a common one is to normalize the gradients of a parameter vector when its L2 norm exceeds a certain threshold: new_gradients = gradients * threshold / l2_norm(gradients) We can do this in Tensorflow using the Function tf.clip_by_norm(t, clip_norm, axes=None, name=None) This normalises so that its is less than or equal to t L2-norm clip_norm This operation is typically used to clip gradients before applying them with an optimizer. You can find me on twitter @bhutanisanyam1 Subscribe to my Newsletter for a Weekly curated list of Deep learning, Computer Vision Articles and are two articles on my Learning Path to Self Driving Cars Here Here

Twitter

Reinforcement Learning Part 0

IMDB Sentiment Analysis using a pre-trained Model

Connect with me on Twitter

Gradient Clipping

About Author

Comments

TOPICS

THIS ARTICLE WAS FEATURED IN

Related Stories

A Full Time ML Role, 1 Million Blog Views, 10k Podcast Downloads: A Community Taught ML Engineer

It Is Okay If You Don't Know What You Like. We Do (feat. Deep Recommendation Algorithms)

10 Machine Learning, Data Science, and Deep Learning Courses for Programmers in 2020

10 Computer Vision Startups on Product Hunt with the Most Upvotes

10 Best Entry Level Machine Learning Tutorials

10 Best + Free Machine Learning Courses Collection

A Full Time ML Role, 1 Million Blog Views, 10k Podcast Downloads: A Community Taught ML Engineer

It Is Okay If You Don't Know What You Like. We Do (feat. Deep Recommendation Algorithms)

10 Machine Learning, Data Science, and Deep Learning Courses for Programmers in 2020

10 Computer Vision Startups on Product Hunt with the Most Upvotes

10 Best Entry Level Machine Learning Tutorials

10 Best + Free Machine Learning Courses Collection

Light-Mode

Classic

Newspaper

Minty

Dark-Mode

Neon Noir

Minty

HN StartUps