GitHunt
AS

ashworks1706/rlhf-from-scratch

A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.

No README found.

Languages

Jupyter Notebook97.6%Python2.4%

Contributors

Apache License 2.0
Created September 14, 2025
Updated March 8, 2026
ashworks1706/rlhf-from-scratch | GitHunt