Repository navigation

#

image-understanding

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python
937
10 个月前

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python
688
19 天前

🎩 An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT models 🤖💬 It also allows image generation/editing/understanding 🖼️, speech-to-text conversion 🎤, and text-to-speech synthesis 🔈

318
12 天前

A Unified Framework for Image-to-Graph Generation. Paper accepted @ ECCV22.

129
2 年前

WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

Python
96
1 年前

This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"

Python
66
2 个月前

This GitHub repository shows how to integrate openai GPT-3 language model and ChatGPT API into a Unity project. It can be a useful way to add natural language processing capabilities to your application.

C#
37
2 年前

Collection of open datasets in computer vision.

35
7 年前

HumanVLM (LLaVA-based): Foundation for Human-Scene Vision-Language Model (Journal of Information Fusion 2025)

Python
11
7 个月前

📘 CVIU78101: Introduction to Computer Vision for Image Understanding Course

Jupyter Notebook
11
7 天前

🖼️📄E2E Multi-modal Document Preprocessing for Search Indexing with Azure Document Intelligence

Python
5
20 天前

A reimplementation of the paper Human-Aligned Image Models Improve Visual Decoding from the Brain

Jupyter Notebook
1
10 小时前

🏷This repository contains the lab sheets of Image Understanding & Processing (SE4130) Module in Year 4 Semester 1.

Jupyter Notebook
0
3 年前

Annuncio generates product advertisements from user inputs, utilizing Aria for descriptions, Allegro for promotional videos, and hashtags for social media discoverability.

Python
0
10 个月前

2022-1 Image Understanding Assignments & Projects

MATLAB
0
3 年前

This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"

Jupyter Notebook
0
3 小时前