Repository navigation

pretraining

Website
Wikipedia

LlamaFamily / Llama-Chinese

Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用

llama 大语言模型 pretraining agent llama4 rl

Python

14705

1307

6 个月前

microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

自然语言处理 agi gpt 大语言模型 lm pretraining prompt lmops promptist x-prompt language-model

Python

4144

338

3 个月前

OFA-Sys / OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

multimodal pretraining image-captioning text-to-image-synthesis visual-question-answering referring-expression-comprehension vision-language pretrained-models prompt prompt-tuning chinese

Python

2536

248

1 年前

X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

聊天机器人 ChatGPT large-language-models llama multimodal damo mplug instruction-tuning pretraining mplug-owl huggingface PyTorch transformer alpaca visual-recognition gpt gpt4 gpt4-api dialogue Video

Python

2521

189

6 个月前

ChandlerBang / awesome-self-supervised-gnn

Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).

graph-neural-networks pretraining self-supervised-learning 深度学习机器学习 pre-training

Python

1687

160

2 年前

keyu-tian / SparK

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

bert convnet convolutional-neural-networks masked-image-modeling pre-trained-model self-supervised-learning sparse-convolution TLS (Transport Layer Security)cnn iclr 深度学习 object-detection PyTorch instance-segmentation mask-rcnn pretrain pretraining

Python

1355

2 年前

yuewang-cuhk / awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

vision-and-language pretraining multimodal-deep-learning bert

1156

104

3 年前

qqlu / Entity

EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation

image-segmentation segmentation PyTorch instance-segmentation panoptic-segmentation semantic-segmentation object-detection fcos condinst detectron2 pretrained-weights pretrained-models 机器视觉深度学习 cnn pretraining

Jupyter Notebook

1030

2 年前

YehLi / xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

image-captioning video-captioning vision-and-language pretraining cross-modal-retrieval visual-question-answering tden

Python

969

106

3 年前

deepmodeling / Uni-Mol

Official Repository for the Uni-Mol Series Methods

pre-trained-model pretraining 深度学习

Python

947

151

4 个月前

seal-rg / recurrent-pretraining

Pretraining and inference code for a large-scale depth-recurrent language model

大语言模型 pretraining reasoning

Python

830

1 个月前

PKU-YuanGroup / LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

multi-modal pretraining zero-shot

Python

830

2 年前

Alibaba-MIIL / ImageNet21K

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper

pretraining multi-label-classification vision-transformer mixer

Python

774

3 年前

zubair-irshad / Awesome-Robotics-3D

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

3D benchmarks 机器视觉 gaussian-splatting 大语言模型 manipulation nerf policy-learning pretraining Robotics scene-graph Simulation vision-language-model vlm diffusion-models foundation-models navigation

762

3 个月前

AGI-Arena / MARS

The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models

fine-tuning large-language-models optimization-algorithms optimizer pretraining

Python

704

1 个月前

cxcscmu / Craw4LLM

Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"

爬虫 crawling large-language-models 大语言模型 pre-training pretraining web-crawler web-crawling

Python

637

7 个月前

westlake-repl / SaProt

Saprot: Protein Language Model with Structural Alphabet (AA+3Di)

alphafold2 pretraining protein protein-structure representation-learning

Python

508

2 个月前

PITI-Synthesis / PITI

PITI: Pretraining is All You Need for Image-to-Image Translation

机器视觉 image-generation image-synthesis image-to-image-translation pretraining

Python

497

1 年前

PaddlePaddle / PaddleFleetX

飞桨大模型开发套件，提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

paddlepaddle benchmark large-scale model-parallelism data-parallelism pipeline-parallelism cloud elastic lightning pretraining self-supervised-learning unsupervised-learning

Python

475

166

1 年前

Coobiw / MPP-LLaVA

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

multimodal-large-language-models deepspeed pipeline-parallelism mllm qwen fine-tuning pretraining

Jupyter Notebook

474

7 个月前