Repository navigation

#

phi-3-vision

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python
2547
6 天前

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python
836
9 个月前

Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon

Jupyter Notebook
265
7 个月前

Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which include - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data both on text and vision.

Jupyter Notebook
33
4 个月前

Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface

0
1 年前

Phi-3-Vision-128K-Instruct Demo

Jupyter Notebook
0
10 个月前

Microsoft Phi-3 Vision-the first Multimodal model By Microsoft- Demo With Huggingface

Jupyter Notebook
0
1 年前