Repository navigation

#

lemmatizer

An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation

Python
736
2 天前

Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.

Python
632
4 年前

CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.

Java
479
2 年前

A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggestion, please make a pull request. We are very open to accepting any contributions.

Python
294
4 年前
Python
175
4 个月前

Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy

Ruby
111
4 年前

Tokenizers and lemmatizers for Go

Go
111
1 个月前

Elasticsearch lemmatizer for 15 languages

Java
108
10 个月前

🧪 Cutting-edge experimental spaCy components and features

Python
101
1 年前

Lemmatization for Turkish Language

Python
98
7 年前

Morfologik Polish Lemmatizer plugin for Elasticsearch

Java
93
9 天前

A lemmatizer implemented in Go

Go
88
5 个月前

An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English. LLMs, FSTs and More!

Python
83
4 个月前

🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪

Python
77
4 年前

Грамматический Словарь Русского Языка (+ английский, японский, etc)

C++
75
5 年前