Repository navigation

#

unsloth

unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python
44400
3 小时前

100+ Fine-tuning LLM Notebooks on Google Colab, Kaggle, and more.

Jupyter Notebook
3443
7 小时前

Make Llama 3.1 8B talk in Rick Sanchez’s style

Jupyter Notebook
116
7 个月前

An implementation of GRPO for Unsloth's VLMs training

Python
67
13 天前

本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实现高效、准确且具有解释性的医学问答系统。

Python
22
5 个月前

Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning 🎋

Jupyter Notebook
21
7 个月前

Fine-tuned 4-bit LoRA adapter for LLaMA 3 using Alpaca-style and QLoRA-grounded instructions, built with Unsloth for fast local training.

Jupyter Notebook
20
1 个月前

清凤的数字分身,并且包含了搭建教程 Qing's digital self, including setup tutorial

Python
15
16 小时前

Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation

Jupyter Notebook
14
6 个月前

Cloning Yourself using your whatsapp chat history and training a model on it.

Jupyter Notebook
14
1 年前

Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to fine-tuning various large language models using popular frameworks. Includes examples, scripts, and tips for efficient training on custom datasets.

Jupyter Notebook
14
3 个月前

Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformers and LLMs for NER and Linking, and Relation Extraction.

Jupyter Notebook
14
8 个月前

PTIT's Major Project: Website Programming - This repo contains a chatbot for a clothing store. The chatbot acts as an employee with specific knowledge about clothing consultation, website support, and store information.

Jupyter Notebook
12
1 年前

Finetune Web UI is a user-interface for training and deploying pre-trained models.

Python
10
14 天前

Materials for CSE Summer School Hackathon 2024

Jupyter Notebook
9
9 个月前

AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, ensuring privacy and security in every interaction. Designed for ease of use, AstorAI provides reliable health information on various topics 24/7.

Jupyter Notebook
9
9 个月前

Finetuning of Gemma-2 2B for structured output

Jupyter Notebook
9
1 年前

LLM-powered financial analyst using LoRA-tuned Llama-3 and RAG pipeline to answer complex queries over SEC 10-K filings with contextual accuracy.

Jupyter Notebook
8
6 个月前

ResurrectAI is an AI-driven chat application designed to bring the wisdom and knowledge of great historical personalities to life. Leveraging advanced language models and fine-tuning techniques, ResurrectAI enables users to interact with AI avatars of iconic figures, gaining access to their insights, guidance, and philosophical teaching in realtime

Dart
8
10 个月前