GitHunt
AR

ArmaanSethi/Reinforcement-Learning-From-Human-Feedback

A conceptual and hands-on introduction to tuning and evaluating large language models (LLMs) using Reinforcement Learning from Human Feedback.

Reinforcement Learning From Human Feedback

A conceptual and hands-on introduction to tuning and evaluating large language models (LLMs) using Reinforcement Learning from Human Feedback.

  • Get a conceptual understanding of Reinforcement Learning from Human Feedback (RLHF), as well as the datasets needed for this technique
  • Fine-tune the Llama 2 model using RLHF with the open source Google Cloud Pipeline Components Library
  • Evaluate tuned model performance against the base model with evaluation methods

Course: https://learn.deeplearning.ai/reinforcement-learning-from-human-feedback

Shared Certificate: https://learn.deeplearning.ai/accomplishments/16085575-20fa-4550-8887-9a6688088b7e

Languages

Jupyter Notebook93.7%Python6.3%

Contributors

Created June 21, 2024
Updated June 21, 2024