Skip to content
View twni2016's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Highlights

  • Pro

Block or report twni2016

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. llm-reasoning-uft llm-reasoning-uft Public

    Code for Teaching Large Language Models to Reason through Learning and Forgetting

    Python 8

  2. self-predictive-rl self-predictive-rl Public

    Bridging State and History Representations: Understanding Self-Predictive RL -- ICLR 2024

    Jupyter Notebook 20 2

  3. Memory-RL Memory-RL Public

    When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)

    Python 62 5

  4. pomdp-baselines pomdp-baselines Public

    Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

    Python 319 45

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy