Repository navigation

#

languagemodel

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Python
5067
3 个月前
JavaScript
895
2 年前

This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).

Python
525
3 个月前

[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation

Python
471
1 年前

Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).

Python
155
7 个月前

Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"

Python
137
2 个月前

高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task

Python
58
5 年前

TurkishBERTweet: Fast and Reliable Large Language Model for Social Media Analysis

Python
36
7 个月前

Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language

Python
36
1 年前

The PreTENS shared task hosted at SemEval 2022 aims at focusing on semantic competence with specific attention on the evaluation of language models with respect to the recognition of appropriate taxonomic relations between two nominal arguments (i.e. cases where one is a supercategory of the other, or in extensional terms, one denotes a superset of the other).

Jupyter Notebook
12
3 年前

Code for "Semi-supervised Formality Style Transfer using Language Model Discriminator and Mutual Information Maximization"

Python
8
5 年前

A 78.5% word sense disambiguator based on Transformers and RoBERTa (PyTorch)

Python
5
5 年前

Informal to formal dataset mask MLM

Python
5
22 天前

translatorlab: a machine translation tool that uses artificial intelligence models to provide accurate and fast translations between different languages

Python
2
2 天前

A project that harnesses the Stanford NLP library to gauge sentiment from provided text via an intuitive graphical interface.

Java
2
2 年前

This is an API which uses Google PaLM API to give responses to user requests

HTML
1
4 个月前

This project scrapes and cleans Shakespeare's public domain texts, trains a character-level LSTM model in PyTorch, and generates fresh, Shakespeare-like text. Perfect for literature and NLP enthusiasts, it provides metrics (loss, perplexity, accuracy) and a platform for tuning hyperparameters and exploring the art of AI-driven language modeling.

Jupyter Notebook
1
4 个月前

The PowerShell Random Text Generator is a script that generates random text based on a given model.

PowerShell
1
9 个月前

The project generates a sentence given a pre-defined starting phrase from the user such as "Ilbierah kont" and the script attempts to build a sentence off of that phrase. Structurally, the generator works in an n-gram fashion but the main structures used to generate the sentences were the unigram, bigram and trigram. The perplexity for each n-gram model was also calculated

Jupyter Notebook
0
4 年前