Repository navigation

#

deepspeed-library

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python
7161
9 小时前