Repository navigation

#

longformer

Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre-train from scratch. We investigated if multilingual models could inherit these properties by making it an Efficient Transformer (s.a. the Longformer architecture).

Jupyter Notebook
33
4 年前

2020 AI研习社 金融用户评论分类

Python
14
5 年前

Convert pretrained RoBerta models to various long-document transformer models

Python
11
3 年前

Longformer Encoder Decoder model for the legal domain, trained for long document abstractive summarization task.

10
4 年前

This GitHub repository implements a novel approach for detecting Initial Public Offering (IPO) underpricing using pre-trained Transformers. The models, extended to handle large S-1 filings, leverage both textual information and financial indicators, outperforming traditional machine learning methods.

Python
6
5 个月前

Industrial Text Scoring using Multimodal Deep Natural Language Processing 🚀 | Code for IEA AIE 2022 paper

Python
5
2 年前

[제 13회 투빅스 컨퍼런스] YoYAK - Yes or Yes, Attention with gap-sentence for Korean long sequence

Jupyter Notebook
5
3 年前

Kaggle NLP competition - Top 2% solution (36/2060)

Jupyter Notebook
4
3 年前

This project applies the Longformer model to sentiment analysis using the IMDB movie review dataset. The Longformer model, introduced in "Longformer: The Long-Document Transformer," tackles long document processing with sliding-window and global attention mechanisms. The implementation leverages PyTorch, following the paper's architecture

Python
4
2 年前

Fine-tuned Longformer for Summarization of Machine Learning Articles

Jupyter Notebook
4
2 年前

A summarization website that can generate summaries from either YouTube videos or PDF files.

Python
4
7 个月前

Project as part of COMP34812: Natural Language Understanding

Jupyter Notebook
3
2 年前

A hyperpartisan news article classification system using BERT-based techniques. The goal was to leverage state-of-the-art transformer models like BERT, ROBERTa, and Longformer to accurately classify news articles as hyperpartisan or non-hyperpartisan.

Jupyter Notebook
1
1 年前

This project was developed for a Kaggle competition focused on detecting Personally Identifiable Information (PII) in student writing. The primary objective was to build a robust model capable of identifying PII with high recall. The DeBERTa v3 transformer model was chosen for this task after comparing its performance with other transformer models.

Jupyter Notebook
1
10 个月前

Longformer model trained on HotpotQA

Python
0
2 年前

Focus - Understanding contextual retrievability.

Jupyter Notebook
0
5 个月前