Repository navigation
model-merging
- Website
- Wikipedia
Tools for merging pretrained large language models.
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
All-in-one UI for merged LLMs in Hugging Face
[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models
Exploring Model Kinship for Merging Large Language Models
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
[ECCV 2024] MagMax: Leveraging Model Merging for Seamless Continual Learning (official repository)
[ICLR 2025] CAMEx: Curvature-Aware Merging of Experts
flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popular merge methods such as model soups, SLERP, ties-MERGING or DARE.
Flexible library for merging large language models (LLMs) via evolutionary optimization.
Official PyTorch implementation of LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation.
A model merging project for generalizing Featured Finite State Machines (FFSMs) to unify behaviors across Software Product Lines (SPLs)
Merge transformers without using like a bajillion GB of RAM