Repository navigation

#

sentence-embeddings

shibing624/text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python
4682
9 天前

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python
3542
6 个月前

xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能

Python
1278
2 年前

Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need

Python
830
1 年前

A Python vector database you just need - no more, no less.

Python
604
1 年前

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

Jupyter Notebook
590
2 年前

Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)

Python
427
5 年前

The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!

Python
380
2 年前

A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.

315
2 年前