Repository navigation

#

finetuning-llms

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"

Jupyter Notebook
481
5 个月前

Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).

Python
102
10 个月前

🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨

Python
94
1 年前

On Memorization of Large Language Models in Logical Reasoning

Python
71
5 个月前

Deploy any AI model, agents, database, RAG, and pipeline locally in minutes

Python
67
4 小时前

IndexTTS Fine-tuning notebooks

Jupyter Notebook
50
2 个月前

MediNotes: SOAP Note Generation through Ambient Listening, Large Language Model Fine-Tuning, and RAG

Python
47
4 天前

Fine-tune any Hugging Face LLM or VLM on day-0 using PyTorch-native features for GPU-accelerated distributed training with superior performance and memory efficiency.

Python
41
10 小时前

A Gradio web UI for Large Language Models. Supports LoRA/QLoRA finetuning,RAG(Retrieval-augmented generation) and Chat

Python
36
2 年前

Fine-tune Mistral 7B to generate fashion style suggestions

Python
34
2 年前

A Streamlit app for generating high-quality Q&A training datasets from text and PDFs, leveraging Gemini, Claude, and OpenAI for LLM fine-tuning.

Python
33
1 个月前

[NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning

Python
31
2 年前

Medical Language Model fine-tuned using pretraining, instruction tuning, and Direct Preference Optimization (DPO). Progresses from general medical knowledge to specific instruction following, with experiments in preference alignment for improved medical text generation and understanding.

Jupyter Notebook
29
1 年前

SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.

Python
23
1 年前