Repository navigation

#

foundation-models

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python
22257
8 个月前
Python
21085
2 个月前

Janus-Series: Unified Multimodal Understanding and Generation Models

Python
17133
3 个月前

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python
4821
12 天前

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python
3787
1 年前
Python
3484
6 个月前

⚡ TabPFN: Foundation Model for Tabular Data ⚡

Python
3342
2 天前
EvolvingLMMs-Lab/Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python
3247
1 年前

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python
3214
3 个月前

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3150
1 年前

EVA Series: Visual Representation Fantasies from BAAI

Python
2468
9 个月前

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python
1933
1 年前

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook
1719
4 个月前