Repository navigation

llm-training

Website
Wikipedia

Find secrets with Gitleaks 🔑

安全 Git Go secret gitleaks devsecops Hacktoberfest CI/CD 命令行界面 data-loss-prevention dlp Open Source ai-powered 大语言模型 llm-inference llm-training

23513

1799

9 天前

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

大语言模型 llm-inference llm-serving llm-training llmops

HTML

21137

2483

2 个月前

ludwig-ai / ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

深度学习 deep learning 机器学习自然语言处理 natural-language 机器视觉 data-centric 数据科学 PyTorch 神经网络大语言模型 llm-training fine-tuning llama mistral llama2

Python

11595

1219

12 天前

skypilot-org / skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).

Python

8793

798

4 小时前

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

llm-training triton finetuning gemma2 llama llama3 大语言模型 mistral phi3 Hacktoberfest

Python

5711

410

2 天前

h2oai / h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

人工智能聊天机器人 ChatGPT fine-tuning finetuning generative generative-ai gpt llama llama2 大语言模型 llm-training

Python

4651

494

9 天前

databricks / dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

databricks gen-ai generative-ai 大语言模型 llm-inference llm-training mosaic-ai

Python

2568

243

1 年前

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

flash-attention 大语言模型 llm-serving llm-training moe PyTorch transformer

Python

1911

114

6 个月前

intelligent-machine-learning / dlrover

DLRover: An Automatic Distributed Deep Learning System

distributed-training Kubernetes llm-training Hacktoberfest

Python

1559

198

6 天前

utkuozdemir / nvidia_gpu_exporter

Nvidia GPU exporter for prometheus using nvidia-smi binary

prometheus prometheus-exporter Nvidia 监控人工智能加密货币 gaming 大语言模型 llm-training

1273

134

3 天前

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

llm-agents llm-training reinforcement-learning large-language-models deepseek-r1 grpo agent-framework

Python

959

2 天前

volcengine / veScale

A PyTorch Native LLM Training Framework

llm-training PyTorch

Python

874

22 天前

sail-sg / Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

bert-model convnext 深度学习 fairseq optimizer resnet timm vit transformer-xl 人工智能 diffusion dreamfusion gpt2 PyTorch cuda-programming llm-training 大语言模型 moe

Python

798

4 个月前

ghimiresunil / LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing

LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.

bert huggingface large-language-models llm-inference llm-training Open Source transformers

Jupyter Notebook

701

118

9 个月前

feifeibear / long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

attention-is-all-you-need llm-inference llm-training PyTorch

Python

571

16 天前

rohan-paul / LLM-FineTuning-Large-Language-Models

LLM (Large Language Model) FineTuning

gpt-3 gpt3-turbo large-language-models llama2 大语言模型 llm-inference llm-serving llm-training mistral-7b PyTorch

Jupyter Notebook

562

136

6 个月前

anarchy-ai / LLM-VM

irresponsible innovation. Try now at https://chat.dev/

人工智能深度学习 distillation 大语言模型 llm-agent llm-inference llm-local llm-training 机器学习

Python

487

137

1 年前

mallorbc / Finetune_LLMs

Repo for fine-tuning Casual LLMs

Docker falcon gpt gpt-3 gpt-35-turbo gpt-4 llama llama2 大语言模型 llm-training mpt

Python

455

2 年前

yinizhilian / ICLR2025-Papers-with-Code

历年ICLR论文和开源项目合集，包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.

iclr2024 llm-agent llm-training 大语言模型 transformer gpt llama3 机器学习 Python llm-framework 自然语言处理

448

7 个月前

FlagAI-Open / Aquila2

The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.

大语言模型 llm-inference llm-training

Python

446

1 年前