Repository navigation

#

gemini-pro-vision

Deploy your private Gemini application for free with one click, supporting Gemini 1.5, Gemini 2.0 models.

TypeScript
1390
14 小时前

A proxy for converting the OpenAI API protocol to the Google Gemini Pro protocol.

Go
637
13 天前
TypeScript
400
16 天前

A simple playground Web UI for using the Gemini Pro Vision and Gemini Pro AI models with Next.js

TypeScript
84
1 年前

Unlock the potential of Google's Gemini AI models with this versatile toolkit. Offering seamless chat, text generation, and multimodal interactions, supporting various file types, including PDF's, images, videos, audio, text and more. Enjoy real-time responses, customizable parameters, and easy integration for diverse AI tasks.

Python
69
2 个月前

Gptmap will guide you through creating a comprehensive Android application using a modern toolkit, highlighting the integration of AI technologies and illustrating the real-world applications of these advanced technologies, providing valuable insights and best practices.

Kotlin
55
1 年前

A Discord bot leveraging Google Gemini. Has image recognition, conversation engagement, and content understanding.

JavaScript
53
2 个月前

DarkGPT Chat Explorer is an interactive web application that allows users to engage in conversations with various GPT (Generative Pre-trained Transformer) models in real-time. This repository contains the source code for the application.

Python
43
3 个月前

An innovative AI conversation API leveraging Google's Gemini for multimodal understanding. Combines FastAPI, Langchain, and Redis for robust, scalable, and privacy-conscious text and image-based interactions

Python
40
1 年前

An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.

Python
38
4 个月前

LINE Bot Gemini Pro Vision 名片機器人,透過 Notion 當你的 Database (Golang)

Go
38
1 年前

This project aims to combine the latest LLMs, Multi-Step Asynchronous Function Calling, Natural Language Processing, and Text-to-Speech.

Python
37
1 年前

Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"

36
1 年前

The Gemini API wrapper for Delphi utilizes advanced models developed by Google to provide robust capabilities, including interactive chat, text embeddings, code generation, image and video prompting, audio analysis and transcription, fine-tuning, caching, and integration with Google Search.

Pascal
30
1 个月前
Python
30
4 个月前

Gemini2tg is a project which deploys Google Gemini's API to a Telegram bot, giving you your own AI bot powerd by Gemini

Python
29
6 个月前

Mini-Bard client for Angular using Gemini Pro via API key from Google AI Studio

TypeScript
29
2 个月前