Repository navigation

#

pretrain

keyu-tian/SparK

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Python
1336
1 年前

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

953
3 年前

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Python
410
1 年前

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

Python
351
9 个月前

BERT-CCPoem is an BERT-based pre-trained model particularly for Chinese classical poetry

Python
153
3 年前

Bert-based models(BERT, MTB, CP) for relation extraction.

Python
102
3 年前

MatDGL is a neural network package that allows researchers to train custom models for crystal modeling tasks. It aims to accelerate the research and application of material science.

Python
52
9 个月前

code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)

Python
24
2 年前

Official code repository for the paper "Pushing the Limits of Pre-training for Time Series Forecasting in the CloudOps Domain"

Python
23
3 个月前

[CCIR 2023] Self-supervised learning for Sequential Recommender Systems

Python
20
1 年前

ALBERT trained on Mongolian text corpus

Jupyter Notebook
18
4 年前

Make your Generative AI LM model from the scratch (Including pretraining / SFT with LoRA)

Python
16
2 个月前

macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor

Python
13
1 年前

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

Python
11
18 小时前

This repository provides code solution for Data Fusion Contest task 1

Jupyter Notebook
8
4 年前

Running Large Language Model easily.

Python
8
4 天前

Understanding "A Lite BERT". An Transformer approach for learning self-supervised Language Models.

Python
7
2 年前