Repository navigation

#

lemmatizer

An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation

Python
716
23 天前

Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.

Python
630
4 年前

CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.

Java
475
2 年前

A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggestion, please make a pull request. We are very open to accepting any contributions.

Python
295
3 年前
Python
154
5 个月前

Tokenizers and lemmatizers for Go

Go
110
1 年前

Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy

Ruby
108
4 年前

Elasticsearch lemmatizer for 15 languages

Java
105
4 个月前

🧪 Cutting-edge experimental spaCy components and features

Python
98
1 年前

Lemmatization for Turkish Language

Python
96
7 年前

Morfologik Polish Lemmatizer plugin for Elasticsearch

Java
88
1 天前

A lemmatizer implemented in Go

Go
86
11 天前

An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English. LLMs, FSTs and More!

Python
78
5 个月前

🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪

Python
76
4 年前

Грамматический Словарь Русского Языка (+ английский, японский, etc)

C++
75
5 年前