Repository navigation

#

gensim

dipanjanS/text-analytics-with-python

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

Jupyter Notebook
1668
4 年前
kavgan/nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

Jupyter Notebook
1169
4 年前

Data repository for pretrained NLP models and NLP corpora.

Python
1016
7 年前

中文詞向量訓練教學

Python
518
2 年前

Fast word vectors with little memory usage in Python

Python
417
4 年前

AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.

Jupyter Notebook
399
4 年前

Log Anomaly Detection - Machine learning to detect abnormal events logs

Jupyter Notebook
328
2 年前

ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python

273
4 年前

This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.

Python
245
2 年前

Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.

Jupyter Notebook
236
8 个月前

Pre-trained Word2Vec Model for Turkish

Python
212
7 年前

Web-ify your word2vec: framework to serve distributional semantic models online

Python
200
2 个月前

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

Python
198
8 年前