Repository navigation

#

document-embedding

TypeScript
848
2 年前

A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)

Python
91
2 个月前

🍊 PAUSE (Positive and Annealed Unlabeled Sentence Embedding), accepted by EMNLP'2021 🌴

Python
25
1 年前

Container-first, JSON-configurable, NLP REST service based on Flair

Python
10
5 年前

We address the task of learning contextualized word, sentence and document representations with a hierarchical language model by stacking Transformer-based encoders on a sentence level and subsequently on a document level and performing masked token prediction.

Jupyter Notebook
7
2 年前

Telegram Data Clustering Contest (Bossy Gnu's submission )

C++
4
4 年前

Dive into the world of Word2Vec and Doc2Vec models to uncover insights and applications.

Jupyter Notebook
2
1 年前

An open-source framework to create and test document embeddings using topic models.

Python
1
5 年前

This Streamlit application demonstrates the integration of ChatGroq (Llama3 model), OpenAIEmbeddings, and FAISS for document embedding and retrieval.

Python
1
9 个月前

Medical Retrieval-Augmented Generation (RAG) Knowledge Base - A Next.js and LangChain-powered app that processes and stores medical documents as vector embeddings in Pinecone for efficient similarity search.

TypeScript
0
6 个月前

Applying NLP to understand people's sentiment about Covid-19 and Government actions in Italy, conditional on their political affiliation.

Jupyter Notebook
0
4 年前

Improving document embedding with weighted average of word embedding through topic modeling

R
0
4 年前