Repository navigation

xlm-roberta

Website
Wikipedia

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

bert pre-training fine-tuning gpt chinese 自然语言处理 PyTorch elmo classification ner t5 unilm roberta albert gpt-2 model-zoo bart xlm-roberta

Python

3060

523

1 年前

Tencent / TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

albert bart bert chinese classification elmo fine-tuning gpt gpt-2 model-zoo 自然语言处理 ner pre-training PyTorch roberta t5 unilm xlm-roberta

Python

1070

148

9 个月前

explosion / curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

bert falcon llama 大语言模型自然语言处理 PyTorch transformer xlm-roberta transformers albert llms roberta

Python

883

1 年前

nlp-uoregon / trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

自然语言处理 PyTorch language-model xlm-roberta 机器学习深度学习人工智能 universal-dependencies multilingual adapters tokenization part-of-speech-tagging dependency-parsing

Python

749

103

6 个月前

iflytek / cino

CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)

自然语言处理 PyTorch xlm-roberta chinese-nlp transformers

Python

242

2 年前

csebuetnlp / banglabert

This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: NAACL-2022.

sentiment-classification document-classification named-entity-recognition natural-language-inference bert xlm-roberta

Python

241

2 年前

EveripediaNetwork / fastc

Unattended Lightweight Text Classifiers with LLM Embeddings

bert deberta e5 embeddings 大语言模型 minilm roberta text-classification xlm-roberta

Python

185

7 个月前

tensordot / syntaxdot

Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.

dependency-parsing xlm-roberta pretrained-models bert part-of-speech-tagging

Rust

1 年前

GeekDream-x / SemEval2022-Task8-TonyX

Deep-learning system proposed by HFL for SemEval-2022 Task 8: Multilingual News Similarity

Bukkit multilingual 自然语言处理 computational-linguistics semantic-similarity xlm-roberta 深度学习机器学习

Python

3 年前

hate-alert / Tutorial-Resources

Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021

Twitter 教程自然语言处理 bert-model xlm-roberta huggingface-transformers huggingface

Python

3 年前

Data-Science-kosta / Long-texts-Sentiment-Analysis-RoBERTa

PyTorch implementation of Sentiment Analysis of the long texts written in Serbian language (which is underused language) using pretrained Multilingual RoBERTa based model (XLM-R) on the small dataset.

roberta multilingual sentiment-analysis PyTorch pytorch-implementation xlm-roberta lstm text-classification bert

Jupyter Notebook

2 年前

Data-Science-kosta / Twitter-Sentiment-Analysis-RoBERTa

Sentiment Analysis of tweets written in underused Slavic languages (Serbian, Bosnian and Croatian) using pretrained multilingual RoBERTa based model XLM-R on 2 different datasets.

twitter-api Twitter tweet sentiment-analysis roberta pretrained-models xlm-roberta bert API

Jupyter Notebook

4 年前

Kirill-Kravtsov / drophead-pytorch

An implementation of drophead regularization for pytorch transformers

PyTorch dropout transformers regularization self-attention bert roberta xlm-roberta

Python

4 年前

crux82 / AILC-lectures2021-lab

This is a Pytorch (+ Huggingface transformers) implementation of a "simple" text classifier defined using BERT-based models. In this lab we will see how it is simple to use BERT for a sentence classification task, obtaining state-of-the-art results in few lines of python code.

bert sentence-classification sentiment-analysis english italian multilingual roberta xlm-roberta albert

Jupyter Notebook

4 年前

ashwanitanwar / nmt-transfer-learning-xlm-r

Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning

neural-machine-translation transfer-learning xlm-roberta language-model self-attention transformer-architecture

Python

2 年前

MLArtist / intent-detection-using-XLM-Roberta

This repository is a comprehensive project that leverages the XLM-Roberta model for intent detection. This repository is a valuable resource for developers looking to build and fine-tune intent detection models based on state-of-the-art techniques.

intent intent-classification intent-recognition xlm-roberta 自然语言处理 natural-language-understanding 聊天机器人 conversational-ai Flask torch transformer

Jupyter Notebook

1 年前

cambridgeltl / BLICEr

Improving Bilingual Lexicon Induction with Cross-Encoder Reranking (Findings of EMNLP 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

bilingual-lexicon-extraction fasttext-embeddings PyTorch reranking self-learning word-embeddings xlm-roberta information-retrieval machine-translation

Python

2 年前

SapienzaNLP / guardians-mt-eval

Official repository of the ACL 2024 paper "Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!".

acl machine-translation natural-language-generation 自然语言处理 huggingface xlm-roberta

Python

5 个月前

haozhg / lmd

Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models

深度学习 language-models 自然语言处理 pretrained-models Python PyTorch bert transformers roberta xlm-roberta

Python

2 年前

rasyosef / amharic-news-category-classification

notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification dataset and the transformers library

bert fine-tuning huggingface text-classification transformers xlm-roberta

Jupyter Notebook

1 年前