Repository navigation

#

pretrain

keyu-tian/SparK

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Python
1355
2 年前

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

980
3 年前

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Python
413
2 年前

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

Python
360
1 年前

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

Python
308
1 天前

BERT-CCPoem is an BERT-based pre-trained model particularly for Chinese classical poetry

Python
155
3 年前

Bert-based models(BERT, MTB, CP) for relation extraction.

Python
103
3 年前

MatDGL is a neural network package that allows researchers to train custom models for crystal modeling tasks. It aims to accelerate the research and application of material science.

Python
51
1 年前

code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)

Python
24
3 年前

Official code repository for the paper "Pushing the Limits of Pre-training for Time Series Forecasting in the CloudOps Domain"

Python
23
7 个月前

[CCIR 2023] Self-supervised learning for Sequential Recommender Systems

Python
23
2 年前

ALBERT trained on Mongolian text corpus

Jupyter Notebook
18
5 年前

Make your Generative AI LM model from the scratch (Including pretraining / SFT with LoRA)

Python
16
6 个月前

macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor

Python
15
2 年前

Running Large Language Model easily.

Python
10
3 天前

This repository provides code solution for Data Fusion Contest task 1

Jupyter Notebook
8
4 年前

Understanding "A Lite BERT". An Transformer approach for learning self-supervised Language Models.

Python
7
3 年前