Repository navigation

#

model-merging

Tools for merging pretrained large language models.

Python
5563
10 天前

FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion

Python
121
1 天前

Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)

Python
88
2 年前

AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.

Python
77
6 个月前

ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse

Python
51
2 年前

Representation Surgery for Multi-Task Model Merging. ICML, 2024.

Python
44
6 个月前

Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]

Python
43
6 个月前

DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling

Python
31
9 个月前

[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models

Jupyter Notebook
24
6 个月前

Exploring Model Kinship for Merging Large Language Models

Python
23
4 天前

Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic

Python
23
3 个月前

[ECCV 2024] MagMax: Leveraging Model Merging for Seamless Continual Learning (official repository)

Python
20
9 个月前

[ICLR 2025] CAMEx: Curvature-Aware Merging of Experts

Python
18
2 个月前

flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popular merge methods such as model soups, SLERP, ties-MERGING or DARE.

Python
17
2 个月前

Flexible library for merging large language models (LLMs) via evolutionary optimization.

Jupyter Notebook
16
3 小时前

Official PyTorch implementation of LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation.

Python
13
9 天前

A model merging project for generalizing Featured Finite State Machines (FFSMs) to unify behaviors across Software Product Lines (SPLs)

Java
10
6 个月前

Merge transformers without using like a bajillion GB of RAM

Python
10
2 年前