Hamish Ivison
hamishivi
『Antipodean Abroad, Amateur Human』
Languages
Loading contributions...
Top Repositories
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"
Exploration of automated dataset selection approaches at large scales.
Generating flashcards from lecture notes
Generating pokemon (and other things) with GANs
training recipes for hf
Repositories
38personal blog :)
Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"
Official Repository of Absolute Zero Reasoner
Generating flashcards from lecture notes
Exploration of automated dataset selection approaches at large scales.
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Tools for merging pretrained large language models.
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
AllenAI's post-training codebase
A Simulation Framework for RLHF and alternatives.
No description provided.
Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"
DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Generating pokemon (and other things) with GANs
AI Logging for Interpretability and Explainability🔬
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
training recipes for hf
Quick minimal debug script.
Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
Quick gantry test
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Task-based datasets, preprocessing, and evaluation for sequence models.
A pytorch implementation of a stack neural module network.
No description provided.
A recommendation website for your steam backlog
A freecodecamp stock watching web app
No description provided.
A collection of freecodecamp microservices
A freecodecamp project