Repository navigation

#

sft

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

TypeScript
8084
7 小时前

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen2.5, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, ...).

Python
7034
2 天前

chatglm 6b finetuning and alpaca finetuning

Python
1542
1 个月前

An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions. (DeepSeek-V3/R1 满血版 671B 全参数微调的开源解决方案,包含从训练到推理的完整代码和脚本,以及实践中积累一些经验和结论。)

Python
649
1 个月前

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

Python
626
2 年前

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

Jupyter Notebook
541
7 个月前

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Python
528
6 个月前

Awesome-RAG: Collect typical RAG papers and systems.

359
3 个月前

Ethereum Semi Fungible Standard (ERC-1155)

TypeScript
321
5 个月前

ERC-3525 Reference Implementation

Solidity
112
1 年前

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

Python
105
6 个月前

ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory

Python
89
14 天前

🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

Python
80
5 个月前

SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for papers, thereby assisting researchers in improving the quality of their work.

Python
61
5 个月前

moss chat finetuning

Python
50
1 年前

https://twitter.com/MoveScriptions

Move
46
7 个月前

本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。

Python
44
1 年前

Fine-Tuning Dataset Auto-Generation for Graph Query Languages.

Python
35
1 个月前

Elven Tools CLI - command line tool for launching NFTs collections on the MultiversX blockchain (Plus other tools).

TypeScript
24
9 个月前

LDPC MATLAB simulation using BPSK + AWGN modulation decoded using Sum Product and Min Sum Algorithm

MATLAB
20
10 个月前