RL Course by David Silver – Lecture 7: Policy Gradient Methods Channel: Google DeepMind312K views • 28/06/2016