Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
HW1		HW1
HW2		HW2
HW3		HW3
.gitignore		.gitignore
README.md		README.md
Review.pdf		Review.pdf

Repository files navigation

reinforcement_learning

projets for reinforcement learning class

contains:

HW1: first homework (Reinforcement Learning Basics)

Do-it-yourself implementation of value iteration, policy iteration

HW2: second homework (Multi-Armed Bandit)

Do-it-yourself implementation of Multi-Armed Bandit algorithms : UCB (upper confidence bound), greedy policy...

HW3: second homework (Reinforcement Learning)

Do-it-yourself on :

On-Policy Reinforcement Learning with Parametric Policy
Off-Policy Reinforcement Learning with Value Function Approximation

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Alternative Proxies:

Alternative Proxy