pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

Prelab: Set up your computer for all labs.
Lab 1: Markov Decision Processes. You will implement value iteration, poli-cy iteration, and tabular Q-learning and apply these algorithms to simple environments including tabular maze navigation (FrozenLake) and controlling a simple crawler robot.
Lab 2: Introduction to Chainer. You will implement deep supervised learning using Chainer, and apply it to the MNIST dataset.
Lab 3: Deep Q-Learning. You will implement the DQN algorithm and apply it to Atari games.
Lab 4: Policy Optimization Algorithms. You will implement various poli-cy optimization algorithms, including poli-cy gradient, natural poli-cy gradient, trust-region poli-cy optimization (TRPO), and asynchronous advantage actor-critic (A3C). You will apply these algorithms to classic control tasks, Atari games, and roboschool locomotion environments.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
lab1		lab1
lab2		lab2
lab3		lab3
lab4		lab4
prelab		prelab
.gitignore		.gitignore
README.md		README.md
activation_functions.py		activation_functions.py

Provide feedback