Content-Length: 187020 | pFad | http://github.com/UbeCc

24 UbeCc (Haoran Wang) · GitHub

UbeCc

Follow

Haoran Wang UbeCc

Follow

I am not a beast of burden. I am a LLaMA! 不是牛马是拉马（我不是奶龙）

32 followers · 109 following

Tsinghua University
Beijing, China
09:23 (UTC +08:00)
@UbecWang

Achievements

Achievements

Highlights

Pro

Pinned Loading

OpenRLHF/OpenRLHF OpenRLHF/OpenRLHF Public

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4k 393
dlo-drl2024 dlo-drl2024 Public

project for Deep Reinforcement Learning spring 24, Tsinghua Univ.

Python 4
Generalization-of-Transformers Generalization-of-Transformers Public

Code for paper "Generalization of Transformers with In-Context Learning: An Empirical Study"

Python