Repository navigation

#

distributed-training

GokuMohandas/Made-With-ML
Jupyter Notebook
42070
1 年前

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Python
35065
11 天前

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++
23145
17 小时前
Python
8533
6 小时前
IDEA-CCNL/Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python
4143
1 年前

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

Python
3917
9 天前

A high performance and generic framework for distributed DNN training

Python
3695
2 年前
determined-ai/determined

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

Go
3177
5 个月前

Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.

Python
2238
16 天前

DLRover: An Automatic Distributed Deep Learning System

Python
1524
1 天前

Collective communications library with various primitives for multi-machine training.

C++
1342
23 天前

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

C++
1121
7 个月前
Jupyter Notebook
881
4 个月前

Best practices & guides on how to write distributed pytorch training code

Python
467
6 个月前

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

Jupyter Notebook
461
1 年前
Python
446
2 年前