Repository navigation

#

lora

labmlai/annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Python
60091
8 个月前
unslothai/unsloth

Finetune Llama 4, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python
37252
1 天前
ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python
18803
1 年前

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python
18143
2 天前

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook
14795
1 天前

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python
11767
4 个月前

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML
8125
6 个月前

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook
7314
1 年前

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen2.5, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, ...).

Python
7030
2 天前

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python
6342
6 个月前

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Python
5218
1 个月前

This repository contains the official firmware for Meshtastic, an open-source, off-grid mesh communication system.

C++
4567
4 小时前

ESP32/ESP8285-based High-Performance Radio Link for RC applications

C++
4020
14 小时前

基于ChatGLM-6B + LoRA的Fintune方案

Python
3763
1 年前
1technophile/OpenMQTTGateway

MQTT gateway for ESP8266 or ESP32 with bidirectional 433mhz/315mhz/868mhz, Infrared communications, BLE, Bluetooth, beacons detection, mi flora, mi jia, LYWSD02, LYWSD03MMC, Mi Scale, TPMS, BBQ thermometer compatibility & LoRa.

C++
3741
7 小时前

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Python
3697
2 年前

Generate, animate and schedule your AI characters 🤖

TypeScript
3379
8 天前