Repository navigation

#

pretrain

keyu-tian/SparK

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Python
1354
2 年前

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

983
3 年前

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Python
413
2 年前

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

Python
381
5 天前

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

Python
362
1 年前

BERT-CCPoem is an BERT-based pre-trained model particularly for Chinese classical poetry

Python
156
4 年前

Bert-based models(BERT, MTB, CP) for relation extraction.

Python
103
3 年前

MatDGL is a neural network package that allows researchers to train custom models for crystal modeling tasks. It aims to accelerate the research and application of material science.

Python
51
1 年前

code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)

Python
24
3 年前

[CCIR 2023] Self-supervised learning for Sequential Recommender Systems

Python
24
2 年前

Official code repository for the paper "Pushing the Limits of Pre-training for Time Series Forecasting in the CloudOps Domain"

Python
23
25 天前

ALBERT trained on Mongolian text corpus

Jupyter Notebook
18
5 年前

Make your Generative AI LM model from the scratch (Including pretraining / SFT with LoRA)

Python
15
8 个月前

macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor

Python
15
2 年前
Python
10
18 天前

This repository provides code solution for Data Fusion Contest task 1

Jupyter Notebook
8
5 年前

Understanding "A Lite BERT". An Transformer approach for learning self-supervised Language Models.

Python
7
3 年前