Repository navigation
vietnamese-nlp
- Website
- Wikipedia
Underthesea - Vietnamese NLP Toolkit
PhoGPT: Generative Pre-training for Vietnamese (2023)
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
A Vietnamese natural language processing toolkit (NAACL 2018)
Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for the most common Vietnamese NLP tasks.
Vietnamese NLP Toolkit for Node
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
VietASR - Vietnamese Automatic Speech Recognition
A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)
Vietnamese question answering system with BERT
A Large-scale Vietnamese News Text Classification Corpus
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)
A Fast and Accurate Vietnamese Word Segmenter (LREC 2018)
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking
Vietnamese Automatic Speech Recognition
COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)
Electra pre-trained model using Vietnamese corpus
Vietnamese sensitive words (including teencode) was created by ML algorithm