Repository navigation

#

molmo

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python
1186
1 小时前

A framework to enable autonomous android and computer use using any LLM (local or remote)

Python
428
2 个月前

A framework to enable autonomous android and computer use using any LLM (local or remote)

Python
232
3 个月前

An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.

Jupyter Notebook
24
2 个月前

ide-cap-chan is a utility for batch image captioning with natural language using various VL models

Python
11
2 天前

Colaboratory上でallenai/Molmoをお試しするサンプル

Jupyter Notebook
2
7 个月前

正如你所见, allenai molmo wrapper

Python
1
6 个月前

The project introduces a two-stage framework for AI-generated image detection. The first stage employs a high-accuracy classification model to differentiate between real and synthetic images. The second stage integrates an interpretability framework that identifies and highlights visual artifacts, providing transparency in decision-making.

Jupyter Notebook
1
9 天前