Repository navigation

#

lemmatizer

An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation

Python
731
24 天前

Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.

Python
634
4 年前

CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.

Java
478
2 年前

A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggestion, please make a pull request. We are very open to accepting any contributions.

Python
294
4 年前
Python
170
2 个月前

Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy

Ruby
111
4 年前

Tokenizers and lemmatizers for Go

Go
110
1 年前

Elasticsearch lemmatizer for 15 languages

Java
107
8 个月前

🧪 Cutting-edge experimental spaCy components and features

Python
101
1 年前

Lemmatization for Turkish Language

Python
97
7 年前

Morfologik Polish Lemmatizer plugin for Elasticsearch

Java
92
16 小时前

A lemmatizer implemented in Go

Go
88
3 个月前

An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English. LLMs, FSTs and More!

Python
82
3 个月前

🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪

Python
77
4 年前

Грамматический Словарь Русского Языка (+ английский, японский, etc)

C++
75
5 年前