Repository navigation

unsloth

Website
Wikipedia

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

fine-tuning llama 大语言模型 mistral gemma llama3 unsloth deepseek deepseek-r1 gemma3 text-to-speech tts qwen qwen3 agent openai gpt-oss voice-cloning reinforcement-learning

Python

46566

3807

2 天前

unslothai / notebooks

100+ Fine-tuning LLM Notebooks on Google Colab, Kaggle, and more.

unsloth

Jupyter Notebook

3700

514

1 天前

lukehinds / deepfabric

Create large-scale synthetic training data for model distillation and evaluation

人工智能数据科学 dataset huggingface 机器学习 synthetic-data agents distillation evaluation fine-tuning kaggle unsloth

Python

581

3 天前

neural-maze / rick-llm

Make Llama 3.1 8B talk in Rick Sanchez’s style

llama3 大语言模型 ollama unsloth huggingface

Jupyter Notebook

116

9 个月前

GAD-cell / vlm-grpo

An implementation of GRPO for Unsloth's VLMs training

grpo huggingface unsloth vlm reinforcement-learning

Python

2 个月前

sinanuozdemir / oreilly-pytorch-dl

Code for Deep Learning for Modern AI

bert 深度学习 llama3 大语言模型 neural-networks distillation llama mnist quantization clip diffusion dreambooth multimodal unsloth

Jupyter Notebook

7 个月前

Breeze648 / MedCoT-7B

本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调，通过 QLoRA 量化和 Unsloth 加速训练，显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势，实现高效、准确且具有解释性的医学问答系统。

人工智能 chain-of-thought deepseek-r1 distillation 大语言模型 lora medical-application 自然语言处理 qlora qwen unsloth

Python

7 个月前

qqqqqf-q / Qing-Digital-Self

数字分身项目,并且包含了搭建(复现)教程 Qing's digital self, including setup tutorial

人工智能聊天机器人大语言模型 qlora qwen unsloth digital-twin finetune finetune-llm huggingface

Python

6 天前

Cre4T3Tiv3 / unsloth-llama3-alpaca-lora

Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs for instruction-following specialization. Demonstrates cutting-edge parameter-efficient fine-tuning with Unsloth integration.

4bit alpaca colab finetuning gradio huggingface instruction-tuning llama3 大语言模型 lora Open Source peft qlora transformers unsloth

Jupyter Notebook

3 个月前

shaheennabi / Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project

Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning 🎋

finetuning gguf huggingface meta qlora quantization Open Source production-ready unsloth gpu inference training peft

Jupyter Notebook

8 个月前

Eviltr0N / Make-AI-Clone-of-Yourself

Cloning Yourself using your whatsapp chat history and training a model on it.

人工智能 finetuning llama3 llama3-finetune unsloth WhatsApp whatsapp-clone 大语言模型 ollama

Jupyter Notebook

1 年前

0xZee / DeepSeek-R1-FineTuning

Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation

deepseek-r1 lora qlora reinforcement-learning unsloth

Jupyter Notebook

8 个月前

deep-div / Fine-Tuning-LLMs-and-VisionModels

Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to fine-tuning various large language models using popular frameworks. Includes examples, scripts, and tips for efficient training on custom datasets.

deepseek finetuning-llms gemma generative-ai huggingface large-language-models llama 大语言模型 transformers unsloth

Jupyter Notebook

1 个月前

alisonmitchell / Biomedical-Knowledge-Graph

Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformers and LLMs for NER and Linking, and Relation Extraction.

arxiv coreference-resolution groq knowledge-graph llamaindex named-entity-recognition relation-extraction unsloth langchain biomedical

Jupyter Notebook

10 个月前

QuangNguyen2910 / AutClothingChatbot

PTIT's Major Project: Website Programming - This repo contains a chatbot for a clothing store. The chatbot acts as an employee with specific knowledge about clothing consultation, website support, and store information.

聊天机器人 langchain 大语言模型 rag unsloth vector-database

Jupyter Notebook

1 年前

muhammad-fiaz / finetune-web-ui

Finetune Web UI is a user-interface for training and deploying pre-trained models.

datasets fine-tuning finetune finetuning-llms generative-ai gpt huggingface large-language-models transformers unsloth gradio

Python

2 个月前

IAmSkyDra / finetune-quantize-llms

Materials for CSE Summer School Hackathon 2024

大语言模型 unsloth

Jupyter Notebook

1 年前

SrikarVeluvali / Astor-AI

AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, ensuring privacy and security in every interaction. Designed for ease of use, AstorAI provides reliable health information on various topics 24/7.

Flask huggingface llama3 大语言模型 MongoDB ollama React transformers unsloth

Jupyter Notebook

1 年前