Top Repositories
PeRL: Parameter-Efficient Reinforcement Learning
FeatureAlignment = Alignment + Mechanistic Interpretability
PyTorch implementation of StableMask (ICML'24)
This is an open collector of useful tutorials and other websites
EMNLP'24 Findings
Repositories
38No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
PeRL: Parameter-Efficient Reinforcement Learning
slime is an LLM post-training framework for RL Scaling.
verl: Volcano Engine Reinforcement Learning for LLMs
No description provided.
FeatureAlignment = Alignment + Mechanistic Interpretability
EMNLP'24 Findings
PyTorch implementation of StableMask (ICML'24)
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
This is an open collector of useful tutorials and other websites
Listing of papers about machine learning for proteins.
TonyCrane's Public Notebook
No description provided.