Repository navigation

finetuning-llms

Website
Wikipedia

e-p-armstrong / augmentoolkit

Create Custom LLMs

人工智能 dataset-generation finetuning-llms

Python

1710

234

4 天前

adithya-s-k / AI-Engineering.academy

Mastering Applied AI, One Concept at a Time

fine-tuning finetuning finetuning-llms inference large-language-models 大语言模型 Python quantization

Jupyter Notebook

1033

113

1 个月前

dvgodoy / FineTuningLLMs

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"

fine-tuning finetuning finetuning-llms hugging-face huggingface large-language-models llamacpp lora ollama peft PyTorch transformers

Jupyter Notebook

481

5 个月前

GURPREETKAURJETHRA / END-TO-END-GENERATIVE-AI-PROJECTS

End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects

chainlit finetuning-llms gemini generative-ai gradio-python-llm huggingface langchain large-language-models llama llama-index 大语言模型 llmops lora mistral openai-api qlora llama3 gpt4o

363

109

7 个月前

Goekdeniz-Guelmez / mlx-lm-lora

Train Large Language Models on MLX.

Apple 深度学习 dpo 机器学习 training finetuning-llms rlhf supervised-machine-learning

Python

146

19 天前

Itachi-Uchiha581 / Auto-Data

Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).

人工智能 data finetuning-llms generative-ai 大语言模型 llm-training Python

Python

102

10 个月前

Simplifine-gamedev / Simplifine

🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨

cloud fine-tuning fine-tuning-llm finetuning-llms large-language-models llama 大语言模型 llm-training Open Source 人工智能 gpt instruction-tuning llama3 lora mistral moe peft phi qwen

Python

1 年前

AlphaPav / mem-kk-logic

On Memorization of Large Language Models in Logical Reasoning

finetuning-llms 大语言模型 logical-reasoning 数学

Python

5 个月前

llama-farm / llamafarm

Deploy any AI model, agents, database, RAG, and pipeline locally in minutes

人工智能 Edge edge-computing llama3 llama4 models finetuning-llms prompt-engineering rag

Python

4 小时前

yrom / finetune-index-tts

IndexTTS Fine-tuning notebooks

fine-tuning tts Jupyter Notebook finetuning-llms

Jupyter Notebook

2 个月前

codelion / ellora

Enhancing LLMs with LoRA

fine-tuning finetuning lora qlora reinforcement-learning training chain-of-thought data-generation distillation finetuning-llms fine-tuning-llm quantization quantization-aware-training reasoning

Jupyter Notebook

13 天前

yuki-2025 / MediNotes

MediNotes: SOAP Note Generation through Ambient Listening, Large Language Model Fine-Tuning, and RAG

finetuning-llms generation llm-training

Python

4 天前

NVIDIA-NeMo / Automodel

Fine-tune any Hugging Face LLM or VLM on day-0 using PyTorch-native features for GPU-accelerated distributed training with superior performance and memory efficiency.

大语言模型 llm-training vlm finetuning finetuning-llms 机器学习 Python 人工智能 gemma3 gemma3n huggingface llama llama3 mistral openai PyTorch qwen qwen3

Python

10 小时前

wangermeng2021 / llm-webui

A Gradio web UI for Large Language Models. Supports LoRA/QLoRA finetuning,RAG(Retrieval-augmented generation) and Chat

finetuning-llms large-language-models 大语言模型 lora qlora rag retrieval-augmented-generation webui llama2 mistral-7b

Python

2 年前

neuralwork / instruct-finetune-mistral

Fine-tune Mistral 7B to generate fashion style suggestions

finetuning-llms huggingface 大语言模型 llm-inference mistral quantization peft

Python

2 年前

MonkWarrior08 / Dataset_Generator_for_Fine-tuning

A Streamlit app for generating high-quality Q&A training datasets from text and PDFs, leveraging Gemini, Claude, and OpenAI for LLM fine-tuning.

ai-tools dataset-generation fine-tuning finetuning-llms llm-training 大语言模型 Streamlit

Python

1 个月前

Prismadic / magnet

the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly

embeddings fine-tuning finetuning-llms llm-training langchain mistral milvus nats nats-streaming apple-silicon huggingface inference-api MLX distributed-computing distributed-systems claude gemini

Python

10 个月前

BaohaoLiao / mefts

[NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning

finetuning-llms 大语言模型 peft

Python

2 年前

samadon1 / LLM-From-Scratch

Medical Language Model fine-tuned using pretraining, instruction tuning, and Direct Preference Optimization (DPO). Progresses from general medical knowledge to specific instruction following, with experiments in preference alignment for improved medical text generation and understanding.

finetuning-llms 大语言模型 llm-training

Jupyter Notebook

1 年前

zhaoyl18 / SEIKO

SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.

diffusion-models finetuning finetuning-llms online-learning stable-diffusion text-to-image

Python

1 年前