Repository navigation

#

foundation-models

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python
23331
1 年前

Janus-Series: Unified Multimodal Understanding and Generation Models

Python
17511
7 个月前

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python
5400
3 个月前

⚡ TabPFN: Foundation Model for Tabular Data ⚡

Python
4279
14 小时前

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python
3953
1 年前
Python
3548
3 个月前

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python
3285
7 个月前
EvolvingLMMs-Lab/Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python
3266
1 年前

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3241
4 个月前

EVA Series: Visual Representation Fantasies from BAAI

Python
2553
1 年前

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python
2035
1 年前

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook
1838
11 天前