Repository navigation

gensim

Website
Wikipedia

piskvorky / gensim

Topic Modelling for Humans

gensim topic-modeling information-retrieval 机器学习自然语言处理数据科学 Python data-mining word2vec word-embeddings 神经网络 fasttext

Python

16143

4406

1 个月前

dipanjanS / text-analytics-with-python

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

text-classification Python natural-language 自然语言处理 clustering sentiment semantic sentiment-analysis nltk stanford-nlp spaCy pattern scikit-learn gensim

Jupyter Notebook

1679

850

5 年前

explosion / sense2vec

🦆 Contextually-keyed word vectors

spaCy 自然语言处理 word2vec Python sense2vec gensim gensim-word2vec 机器学习

Python

1657

241

4 个月前

plasticityai / magnitude

A fast, efficient universal vector embedding utility package.

Python 自然语言处理机器学习 vectors embeddings word2vec fasttext glove gensim fast memory-efficient word-embeddings

Python

1650

120

2 年前

kavgan / nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

自然语言处理 word2vec text-classification gensim 机器学习 text-mining

Jupyter Notebook

1175

790

5 年前

piskvorky / gensim-data

Data repository for pretrained NLP models and NLP corpora.

dataset gensim pretrained-models

Python

1031

139

7 年前

oborchers / Fast_Sentence_Embeddings

Compute Sentence Embeddings Fast!

sentence-embeddings sentence-representation sentence-similarity gensim fasttext cython embeddings maxpooling fse

Jupyter Notebook

622

2 年前

zake7749 / word2vec-tutorial

中文詞向量訓練教學

gensim word2vec

Python

516

163

3 年前

ThoughtRiver / lmdb-embeddings

Fast word vectors with little memory usage in Python

word vectors embeddings lmdb gensim memory speed text word2vec fasttext glove

Python

416

4 年前

bakrianoo / aravec

AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.

自然语言处理 gensim arabic text-mining word2vec

Jupyter Notebook

404

4 年前

5hirish / adam_qas

ADAM - A Question Answering System. Inspired from IBM Watson

Python spaCy 自然语言处理 question-answering adam scikit-learn gensim pandas wikipedia elasticsearch spacy-extension

Python

356

106

6 年前

AICoE / log-anomaly-detector

Log Anomaly Detection - Machine learning to detect abnormal events logs

人工智能 log anomaly-detection 机器学习 word2vec som gensim stream-processing Kubernetes aiops

Jupyter Notebook

334

138

2 年前

30lm32 / ml-projects

ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python

Keras Tensorflow random-forest gensim word2vec Docker timeseries-analysis imbalanced-data svm 自然语言处理机器学习 geolocation 深度学习 text-classification tensorboard mlflow ab-testing

279

109

5 年前

benedekrozemberczki / GEMSEC

The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).

clustering deepwalk node2vec word2vec Tensorflow Facebook deezer community-detection matrix-factorization embedding 神经网络 unsupervised-learning gensim 机器学习 network-embedding graph-embedding

Python

260

3 年前

davidberenstein1957 / concise-concepts

This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.

ner spaCy gensim 自然语言处理机器学习 Hacktoberfest

Python

244

2 年前

devmount / GermanWordEmbeddings

Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.

神经网络 word2vec word-embeddings model training evaluation 深度学习深度神经网络自然语言处理 gensim

Jupyter Notebook

239

1 年前

akoksal / Turkish-Word2Vec

Pre-trained Word2Vec Model for Turkish

word2vec 自然语言处理 gensim turkish

Python

215

7 年前

benedekrozemberczki / Splitter

A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).

deepwalk PyTorch node2vec gensim 机器学习 word2vec factorization 深度学习深度神经网络 graph-neural-network node-embedding community-detection clustering network-embedding graph-embedding graph-representation-learning

Python

212

2 年前

alisonmitchell / Stock-Prediction

Technical and sentiment analysis to predict the stock market with machine learning models based on historical time series data and news article sentiment collected using APIs and web scraping.

Python 机器学习 keras-tensorflow NumPy scikit-learn pandas seaborn matplotlib plotly SciPy mplfinance beautifulsoup nltk spaCy gensim 自然语言处理 bert huggingface

Jupyter Notebook

202

5 个月前

akutuzov / webvectors

Web-ify your word2vec: framework to serve distributional semantic models online

gensim word2vec Web app Flask

Python

201

6 个月前