Skip to content

rldotai/rl-algorithms

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

rl-algorithms

Reinforcement learning algorithms.

There are many different variants on the basic ideas of reinforcement learning. I have implemented some of them, with a focus on linear function approximation.

Extending these algorithms (for example, with nonlinear function approximators such as neural nets) is relatively straightforward once you are familiar with the underlying ideas.

To facilitate this, the algorithms listed are written in a straightforward style and thoroughly commented, with references to the relevant papers and some explanation of the reasoning behind the code.

Implemented Algorithms

TODO

  • Q-Learning
  • SARSA
  • Distributional RL algorithms
  • Other second-order TD algorithms (e.g., NTD)
  • Actor-Critic algorithms

Contributing

Send me a pull request if you have code to contribute.

Alternatively, raise an issue and provide me with a link to the paper describing the algorithm, and I will read and implement it when I get a chance.

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy