Repository navigation

#

llm-training

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML
21137
2 个月前

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

Python
4651
9 天前

Code examples and resources for DBRX, a large language model developed by Databricks

Python
2568
1 年前

MoBA: Mixture of Block Attention for Long-Context LLMs

Python
1911
6 个月前

DLRover: An Automatic Distributed Deep Learning System

Python
1559
6 天前

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python
959
2 天前

A PyTorch Native LLM Training Framework

Python
874
22 天前

LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.

Jupyter Notebook
701
9 个月前

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python
571
16 天前

历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.

448
7 个月前

The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.

Python
446
1 年前