Repository navigation

#

deepspeed-library

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python
7281
1 个月前