FANFAN

Fanshaoliu

PhD, Fudan University

Fudan

Shanghai

Languages

Python100%

Repos

Stars

Forks

Top Language

Python

Loading contributions...

Top Repositories

PURE

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"

用tf2重写莫烦大佬的RL代码

Official Task Suite Implementation of Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

0Python

pylift

Uplift modeling and evaluation library. Actively maintained pypi version.

0Python

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

0Python

Repositories

Fanshaoliu/PUREFork

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"

10Updated 3 months ago

Fanshaoliu/OpenRLHF

No description provided.

Python00Updated 5 months ago

Fanshaoliu/VIMABenchFork

Official Task Suite Implementation of Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python00Updated 2 years ago

Fanshaoliu/RL_tf2

用tf2重写莫烦大佬的RL代码

Python10Updated 3 years ago

Fanshaoliu/pyliftFork

Uplift modeling and evaluation library. Actively maintained pypi version.

Python00Updated 2 years ago

Fanshaoliu/decision-transformerFork

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Python00Updated 2 years ago

Fanshaoliu/hiro_pytorch

No description provided.

Python00Updated 4 years ago

FANFAN

Languages

Loading contributions...

Top Repositories

Repositories

Gists

Recent Activity