Repos
7
Stars
2
Forks
0
Top Language
Python
Loading contributions...
Top Repositories
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
用tf2重写莫烦大佬的RL代码
Official Task Suite Implementation of Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
Uplift modeling and evaluation library. Actively maintained pypi version.
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Repositories
7Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
No description provided.
Official Task Suite Implementation of Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
用tf2重写莫烦大佬的RL代码
Uplift modeling and evaluation library. Actively maintained pypi version.
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
No description provided.