Repository navigation

#

minicpm-v

MiniCPM-V 4.0: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python
20091
8 天前

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

Python
687
1 小时前

[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

Python
396
3 个月前

Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.

Python
150
3 个月前
C++
110
2 天前

軽量VLMのMiniCPM-V2.6のColaboratoryサンプル

Jupyter Notebook
4
1 年前

PicQ: Demo for MiniCPM-o 2.6 to answer questions about images using natural language.

Python
4
7 个月前

VidiQA: Demo for MiniCPM-V 2.6 to answer questions about videos using natural language.

Python
2
9 个月前