Skip to content

An implementation of the Normalized Advantage Function Reinforcement Learning Algorithm with Prioritized Experience Replay

License

Notifications You must be signed in to change notification settings

MathPhysSim/PER-NAF

Repository files navigation

PER-NAF

An implementation of the Normalized Advantage Function Reinforcement Learning Algorithm with Prioritized Experience Replay

Summary

Thanks openAI and Kim!

Some Advices from experience in RL

  • Normalize the state and action space as well as the reward is a good practice
  • Visualise as much as possible to get an intuition about the method as possible bugs
  • If it does not make sense it is a bug with very high probability

Coding makes happy 🙃

About

An implementation of the Normalized Advantage Function Reinforcement Learning Algorithm with Prioritized Experience Replay

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 2

  •  
  •  

Languages

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy