Repository navigation

#

sentence-embeddings

shibing624/text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python
4834
2 个月前

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python
3584
10 个月前

xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能

Python
1290
3 年前

Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need

Python
835
2 年前

A Python vector database you just need - no more, no less.

Python
630
1 年前

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

Jupyter Notebook
595
2 年前

Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)

Python
431
5 年前

The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!

Python
380
2 年前

A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.

315
2 年前