Repository navigation

#

audio-classification

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook
1330
2 年前

Code for YouTube series: Deep Learning for Audio Classification

Jupyter Notebook
566
3 年前

Urban sound classification using Deep Learning

Jupyter Notebook
520
3 年前

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

Jupyter Notebook
509
2 年前

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Python
425
1 年前

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python
402
1 年前

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Python
393
3 年前

UrbanSound classification using Convolutional Recurrent Networks in PyTorch

Python
388
4 年前

Efficient Training of Audio Transformers with Patchout

Python
345
2 年前

A multi-channel neural network audio classifier using Keras

Python
268
4 年前

[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Python
182
2 个月前
Python
155
3 个月前

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Jupyter Notebook
150
3 年前

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

Python
148
2 年前

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

Python
148
9 个月前

🎶 dead simple audio classification

Python
137
6 年前

Audio classification with VGGish as feature extractor in TensorFlow

Python
130
4 年前