GitHunt
JO

JonasGeiping/gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

No README found.

Languages

Python86.0%C++11.3%Cuda1.0%C0.8%Dockerfile0.7%Shell0.3%Makefile0.0%
Apache License 2.0
Created April 25, 2023
Updated April 25, 2023