ashworks1706/rlhf-from-scratch

A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.

No README found.

Jupyter Notebook97.6%Python2.4%

Apache License 2.0

Created September 14, 2025

Updated March 8, 2026