Repository navigation

foundation-models

Website
Wikipedia

Making large AI models cheaper, faster and more accessible

深度学习 hpc large-scale data-parallelism pipeline-parallelism model-parallelism 人工智能 big-model distributed-computing inference heterogeneous-training foundation-models

Python

41188

4532

5 天前

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

gpt-4 聊天机器人 ChatGPT llama multimodal llava foundation-models instruction-tuning multi-modality visual-language-learning llama-2 llama2 vision-language-model

Python

23657

2634

1 年前

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

自然语言处理 pre-trained-model unilm minilm layoutlm layoutxlm beit document-ai trocr beit-3 foundation-models xlm-e deepnet 大语言模型 multimodal mllm kosmos kosmos-1 textdiffuser bitnet

Python

21754

2662

3 个月前

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

any-to-any foundation-models 大语言模型 multimodal vision-language-pretraining unified-model

Python

17566

2243

8 个月前

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

foundation-models music-generation huggingface llama audio-generation voice-cloning 大语言模型人工智能深度学习 gpt

Python

5553

638

4 个月前

modelscope / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

数据分析数据科学 large-language-models 大语言模型数据可视化 instruction-tuning pre-training multi-modal synthetic-data data data-pipeline data-processing foundation-models

Python

5278

276

2 天前

PriorLabs / TabPFN

⚡ TabPFN: Foundation Model for Tabular Data ⚡

数据科学 foundation-models 机器学习 tabpfn tabular-data

Python

4523

449

2 天前

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

vision-language-model vision-language-pretraining foundation-models

Python

3966

581

1 年前

amazon-science / chronos-forecasting

Chronos: Pretrained Models for Probabilistic Time Series Forecasting

forecasting large-language-models 大语言模型机器学习 time-series foundation-models pretrained-models time-series-forecasting timeseries 人工智能 huggingface huggingface-transformers transformers

Python

3696

426

1 个月前

NExT-GPT / NExT-GPT

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

ChatGPT foundation-models gpt-4 instruction-tuning large-language-models 大语言模型 multi-modal-chatgpt multimodal visual-language-learning mllm

Python

3565

359

5 个月前

OpenGVLab / Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

captioning-videos ChatGPT gradio langchain video-question-answering video-understanding stablelm chat Video big-model foundation-models large-language-models

Python

3305

267

9 个月前

EvolvingLMMs-Lab / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

gpt-4 visual-language-learning artificial-inteligence 深度学习 foundation-models multi-modality 机器学习 ChatGPT instruction-tuning large-scale-models embodied-ai

Python

3270

209

2 年前

CLUEbenchmark / SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

ChatGPT chinese evaluation foundation-models gpt-4

3258

112

1 个月前

baaivision / EVA

EVA Series: Visual Representation Fantasies from BAAI

foundation-models representation-learning vision-transformer

Python

2577

187

1 年前

autodistill / autodistill

Images to inference with no labeling (use foundation models to train supervised models).

机器视觉 auto-labeling 深度学习 foundation-models grounding-dino image-annotation image-classification instance-segmentation labeling-tool 机器学习 multimodal object-detection PyTorch segment-anything yolov5 yolov8

Python

2406

197

5 个月前

hyp1231 / awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

Awesome Lists embodied-agent embodied-ai foundation-model foundation-models generative-agents generative-ai generative-model generative-models 大语言模型 large-language-models ChatGPT gpt-4

2130

172

5 个月前

KaiyangZhou / CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

foundation-models multimodal-learning prompt-learning

Python

2079

227

1 年前

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

foundation-models video-understanding vision-transformer action-recognition multimodal temporal-action-localization video-question-answering zero-shot-classification benchmark contrastive-learning self-supervised instruction-tuning video-clip

Python

2063

127

2 个月前

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

深度学习 evaluation foundation-models instruction-following large-language-models leaderboard 自然语言处理 rlhf

Jupyter Notebook

1870

288

2 个月前

NVlabs / MambaVision

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

深度学习 foundation-models image-classification mamba self-attention vision-transformer visual-recognition huggingface-transformers transformers instance-segmentation object-detection semantic-segmentation

Python

1771

100

2 个月前