Repository navigation

#

clip-model

Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos.

Swift
2835
4 个月前

Simple implementation of OpenAI CLIP model in PyTorch.

Jupyter Notebook
680
1 年前

[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models

Python
160
1 年前

[NeurIPS 2023 Oral] Quilt-1M: One Million Image-Text Pairs for Histopathology.

Python
158
1 年前

根据文本描述搜索本地图片的工具,powered by Rust + candle + CLIP

Rust
148
1 年前

The most impactful papers related to contrastive pretraining for multimodal models!

Python
65
1 年前

Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retrieval"

Python
45
1 年前

[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer

Python
36
2 年前

Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!

Python
24
3 个月前

A blazing fast CLIP gRPC service in rust.

Rust
15
2 年前

A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder

Jupyter Notebook
11
2 年前

Text to image search & Image Similarity Search using @Typesense

TypeScript
8
3 个月前

[ NeurIPS 2023 R0-FoMo Workshop ] Official Codebase for "Estimating Uncertainty in Multimodal Foundation Models using Public Internet Data"

Jupyter Notebook
6
1 年前

Traverse the space of concepts with a multi-modal similarity index in FiftyOne

TypeScript
5
1 年前