Repository navigation

#

language-identification

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python
1930
12 小时前
pemistahl/lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python
1330
1 个月前
pemistahl/lingua-go
Go
1237
2 个月前

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

Rust
950
5 天前

Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.

TypeScript
355
19 天前

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

Python
303
7 年前

⚡️ 80x faster Fasttext language detection out of the box | Split text by language

Python
187
21 天前
Python
154
5 个月前

💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

Python
127
5 个月前

Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks

Python
106
7 年前

Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch

Python
105
5 年前

Fast and accurate natural language detection. Detector written in Javascript. Nito-ELD, ELD.

JavaScript
62
6 个月前

This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.

Python
54
5 年前

End to End Dialect Identification using Convolutional Neural Network

Python
52
5 年前

✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux

Jupyter Notebook
51
2 个月前