Repository navigation

deepspeed

Website
Wikipedia

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

cuda-kernels deepspeed fastertransformer llm-inference turbomind internlm llama 大语言模型 codellama llama2 llama3

Python

7133

610

6 天前

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

ai-safety alpaca datasets deepspeed large-language-models llama 大语言模型 reinforcement-learning reinforcement-learning-from-human-feedback rlhf transformers vicuna safety gpt transformer beaver

Python

1535

124

1 个月前

zjunlp / KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

llama large-language-models pre-trained-language-models language-model instruction-following 深度学习 chinese english instructions models reasoning gpt-3 deepspeed instruction-tuning lora pre-training pre-trained-model

Python

1347

132

9 个月前

LambdaLabsML / distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

CUDA deepspeed distributed-training gpu gpu-cluster kuberentes nccl PyTorch slurm cluster mpi sharding

Python

488

7 个月前

antgroup / glake

GLake: optimizing GPU memory management and IO transmission.

deepspeed gpu 大语言模型 memory onnx PyTorch

Python

479

6 个月前

Coobiw / MPP-LLaVA

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

multimodal-large-language-models deepspeed pipeline-parallelism mllm qwen fine-tuning pretraining

Jupyter Notebook

473

7 个月前

shm007g / LLaMA-Cult-and-More

Large Language Models for All, 🦙 Cult and More, Stay in touch !

alpaca ChatGPT gpt llama ggml gpt4 gptq vicuna PyTorch Tensorflow transformers deepspeed 大语言模型

HTML

444

2 年前

Xirider / finetune-gpt2xl

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

huggingface huggingface-transformers deepspeed gpt2 gpt3 finetuning gpt-neo

Python

436

2 年前

OpenMOSS / CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

深度学习 deepspeed 自然语言处理 PyTorch

Python

415

1 年前

openpsi-project / ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

大语言模型 llm-training reinforcement-learning-from-human-feedback reinforcement-learning distributed-systems distributed-computing large-language-models llm-framework deepspeed transformers

Python

319

5 个月前

sunzeyeah / RLHF

Implementation of Chinese ChatGPT

ChatGPT 深度学习 deepspeed glm 自然语言处理 PyTorch

Python

287

2 年前

stanleylsx / llms_tool

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

baichuan bloom chatglm falcon internlm llama llama2 moss qwen chatglm2 PyTorch deepspeed baichuan2 mistral chatglm3

Python

220

2 年前

bobo0810 / LearnDeepSpeed

DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）

deepspeed Example large-language-models

Python

178

2 年前

git-cloner / llama2-lora-fine-tuning

llama2 finetuning with deepspeed and lora

deepspeed finetuning llama2 lora

Python

176

2 年前

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

lora chatglm chatglm-6b ChatGPT finetune gpt 大语言模型 PyTorch rlhf llama deepspeed peft ppo

Python

138

2 年前

HomebrewML / revlib

Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload

PyTorch 深度学习 deepspeed xla tpu

Python

129

3 年前

CoinCheung / gdGPT

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

deepspeed 大语言模型 pipeline 自然语言处理 PyTorch bloom flash-attention baichuan2-7b mixtral-8x7b llama2

Python

2 年前

OpenCSGs / llm-inference

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.

deepspeed llama-cpp llm-inference ray transformer vllm

Python

1 年前