Repository navigation

#

blip

Website
Wikipedia

gokayfem / awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

clip llava vlm multimodal blip cogvlm internlm kosmos vision-language-model Awesome Lists

Markdown

1027

53

7 个月前

SkalskiP / awesome-foundation-and-multimodal-models

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

blip clip foundational-models grounding-dino llava multimodal segment-anything 机器视觉自然语言处理 open-vocabulary-detection open-vocabulary-segmentation image-captioning

Python

634

45

2 年前

jina-ai / agentchain

Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks

人工智能大语言模型机器学习 multimodal nlproc langchain stable-diffusion blip Whisper

Python

605

54

2 年前

mertyg / vision-language-models-are-bows

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

multimodal PyTorch vision-language blip clip compositionality

Python

286

19

2 年前

outpoot / bliptext

The wiki where you edit a word every 30sec, with 2.1M Wikipedia articles ported to a custom markdown format. Real-time text editing, beautiful UI & more. Vandalize articles today!

blip Svelte sveltekit Wiki wikipedia

Svelte

159

17

4 个月前

MikeWangWZHL / VidIL

Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

blip clip gpt-3 vision-language

Python

115

1

3 年前

nick8592 / text-guided-image-colorization

This repository provides an interactive image colorization tool that leverages Stable Diffusion (SDXL) and BLIP for user-controlled color generation. With a retrained model using the ControlNet approach, users can upload images and specify colors for different objects, enhancing the colorization process through a user-friendly Gradio interface.

controlnet gradio image-colorization stable-diffusion blip

Python

99

9

10 个月前

microsoft / Data-Discovery-Toolkit

A data discovery and manipulation toolset for unstructured data

blip Keras 机器学习 openai powerbi evaluation-metrics search

Jupyter Notebook

54

13

2 年前

cobanov / image-captioning

Image captioning using python and BLIP

image-captioning blip image-text-retrieval vision-language

Python

49

10

2 年前

BUAADreamer / SPN4CIR

[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives

blip blip2 clip data-generation image-retrieval llama llava multimodal-learning transformer cross-modal-retrieval

Python

39

4

1 个月前

takenet / blip-sdk-js

The Javascript SDK for BLiP

blip 聊天机器人

JavaScript

35

12

2 个月前

securade / sentinel

Securade.ai Sentinel - A monitoring and surveillance application that enables visual Q&A and video captioning for existing CCTV cameras.

人工智能 blip cctv 机器视觉 generative-ai 机器学习 rtsp-stream surveillance video-analytics visual-question-answering vlm

Python

22

6

6 个月前

YonLiud / Emergency-Caller-FiveM

FiveM Script to allow civilians to dial 911, giving out their location, name, and reason they called, adding a blip to the map too

fivem blip Hacktoberfest Lua

Lua

20

6

3 年前

CodeWizardsDev / wizard-blips

Free Advanced Fivem Blip System, Highly Customizable

blip fivem Lua Script

Lua

18

3

4 个月前

entrpn / serving-model-cards

Collection of OSS models that are containerized into a serving container

机器学习 vertex-ai blip clip dreambooth Google 云 ml-training stable-diffusion vertexai esrgan hugging-face huggingface huggingface-diffusers huggingface-transformers t5 dataflow

Python

16

1

2 年前

zer0int / CLIP-Interrogator-LongCLIP-hallucinwords

CLIP Interrogator, fully in HuggingFace Transformers 🤗, with LongCLIP & CLIP's own words and / or *your* own words!

blip blip2 clip

Python

16

1

3 个月前

eren23 / sam-clip-diffusion

SAM + CLIP + DIFFUSION for image to edit objects in images using plain text

clip sam blip diffusion huggingface huggingface-transformers image-editing inpainting segment-anything stable-diffusion transformer object-segmentation semantic-segmentation

Python

15

0

2 年前

ghostofpokemon / oCaption

oCaption: Leveraging OpenAI's GPT-4 Vision for Advanced Image Captioning

blip gpt openai openai-api sdxl vision

Python

13

1

2 年前

neechbear / blip

Bash Library for Indolent Programmers

Bash Shell sh blip

Shell

10

4

4 年前

gyuilLim / Youtube-scene-search-with-text

Finding scenes that you want by text automatically

Jupyter Notebook

10

0

9 个月前