triston-lee / gpt-neox Goto Github PK
View Code? Open in Web Editor NEWThis project forked from eleutherai/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
License: Apache License 2.0