Repository navigation

#

gpt4vision

Convert different model APIs into the OpenAI API format out of the box.

Go
150
1 年前

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Python
108
12 天前

Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024)

Python
35
5 个月前

This is a tool that uses GPT4 Vision to operate your computer

Rust
29
1 年前

This repository offers a Python framework for a retrieval-augmented generation (RAG) pipeline using text and images from MHTML documents, leveraging Azure AI and OpenAI services. It includes ingestion and enrichment flows, a RAG with Vision pipeline, and evaluation tools.

Python
25
4 个月前

Capture images with HoloLens and receive descriptive responses from OpenAI's GPT-4V(ision).

ShaderLab
14
1 年前

Developed an IoT-based construction site inspector using a Raspberry Pi 4 that autonomously navigates and inspects construction sites. The system features two DC motors for line-following and a servo-mounted ultrasonic sensor for real-time obstacle detection.

Python
1
6 个月前

VisionQuery GPT-4v is a cutting-edge tool that combines screenshot-based queries with OpenAI's GPT-4. It enables users to capture screens, ask questions, and receive insightful answers from GPT-4v, revolutionizing digital interaction and understanding.

Jupyter Notebook
1
1 年前

Web-based user interface for GPT4All and set it up to be hosted on GitHub Pages. This will allow users to interact with the model through a browser. We'll use Flask for the backend and some modern HTML/CSS/JavaScript for the frontend.

Python
1
9 个月前

Camera powered with AI on the web

TypeScript
0
1 年前