Repository navigation
librosa
- Website
- Wikipedia
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
Understanding emotions from audio files using neural networks and multiple datasets.
Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)
A Machine Learning Approach of Emotional Model
LibrosaCpp is a c++ implemention of librosa to compute short-time fourier transform coefficients,mel spectrogram or mfcc
AudioMuse AI: Leverages Librosa for sonic analysis and AI-powered clustering to create smart, tempo and mood-based playlists within Jellyfin and Navidrome API
Easier audio-based machine learning with TensorFlow.
(monophonic) audio to midi converter using Python and librosa
Human Emotion Understanding using multimodal dataset.
In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal registered with USB microphone
基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法
NineSong aims to provide Cloud native and AI extended solutions for data sharing in various ToC businesses
Music Synthesis with Python talk, originally given at PyGotham 2017.
Music genre classification model using CRNN