Repository navigation

#

clip

开放源码的无App推送服务,iOS14+扫码即用。亦支持快应用/iOS和Mac客户端、Android客户端、自制设备

C
4920
7 个月前
Jupyter Notebook
3656
2 个月前

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python
3122
7 天前

Image to prompt with BLIP and CLIP

Python
2904
1 年前

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook
2653
2 个月前

Android UI 快速开发,专治原生控件各种不服

Java
1945
2 年前
QIN2DIM/hcaptcha-challenger
Python
1945
2 个月前

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

Python
1682
9 个月前

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Python
1441
2 个月前

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

1216
1 年前
unum-cloud/uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Python
1179
1 个月前

This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.

Jupyter Notebook
1131
8 个月前

Stable Diffusion in NCNN with c++, supported txt2img and img2img

C++
1049
2 年前