Repository navigation

language-identification

Website
Wikipedia

A collection of sample apps to demonstrate how to use Google's ML Kit APIs on Android and iOS

Barcode barcode-scanner face-detection image-labeling object-detection text-recognition language-identification translation smart-reply mlkit mlkit-android Google ml-kit mlkit-genai mlkit-genai-image-description mlkit-genai-proofreading mlkit-genai-rewriting mlkit-genai-summarization

Java

3965

3040

1 个月前

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

speaker-diarization speaker-verification language-identification modelscope

Python

2439

220

2 个月前

pemistahl / lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

自然语言处理 language-detection language-recognition language-identification language-classification python-library

Python

1509

4 个月前

pemistahl / lingua-go

The most accurate natural language detection library for Go, suitable for short text and mixed-language text

自然语言处理 language-detection language-recognition language-classification language-identification language-processing golang-library Go language-modeling text-processing

1284

8 个月前

pemistahl / lingua-rs

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

Rust rust-library rust-crate language-detection language-classification language-recognition 自然语言处理 language-identification language-processing

Rust

1000

12 天前

pemistahl / lingua

The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

language-detection language-processing kotlin-library java-library nlp-library 自然语言处理 natural-language Android Library language-identification language-classification language-recognition

Kotlin

774

6 个月前

echogarden-project / echogarden

Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.

language-identification speech speech-alignment speech-recognition speech-synthesis speech-to-text speech-translation text-to-speech language-detection source-separation 命令行界面 Node.js

TypeScript

404

1 个月前

apcode / tensorflow_fasttext

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

fasttext Tensorflow language-identification

Python

303

7 年前

textpipe / textpipe

Textpipe: clean and extract metadata from text

自然语言处理 named-entity-recognition text-processing text-analysis language-identification

Python

302

4 年前

LlmKira / fast-langdetect

⚡️ 80x faster Fasttext language detection out of the box | Split text by language

fasttext 国际化 (i18n)language-identification svc tts

Python

245

18 天前

vunb / vntk

Vietnamese NLP Toolkit for Node

vietnamese-nlp 自然语言处理 vietnamese language-identification named-entity-recognition pos-tagging

JavaScript

216

2 年前

adbar / simplemma

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

自然语言处理 lemmatizer tokenization wordlist morphological-analysis corpus-tools Parsing language-detection language-identification

Python

175

4 个月前

cisnlp / GlotLID

💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

language-detection language-identification language-classification language-recognition

Python

161

4 个月前

HPI-DeepLearning / crnn-lid

Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks

language-identification 深度学习机器视觉 cnns Keras

Python

105

7 年前

KrishnaDN / x-vector-pytorch

Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch

language-recognition language-identification speech

Python

105

5 年前

SpeechFlow-io / Spoken_language_identification

A TensorFlow-based spoken language identification

Python speech Tensorflow cnn language-identification language-recognition

Python

3 年前

nitotm / efficient-language-detector-js

Fast and accurate natural language detection. Detector written in Javascript. Nito-ELD, ELD.

JavaScript language language-detection language-identification natural-language 自然语言处理 Node.js

JavaScript

1 年前

DoodleBears / split-lang

✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux

自然语言处理 Python split 国际化 (i18n)language-identification tts pip fasttext

Jupyter Notebook

17 天前

microsoft / LID-tool

This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.

language-identification 自然语言处理 Python

Python

5 年前

nitotm / efficient-language-detector

Fast and accurate natural language detection. Detector written in PHP. Nito-ELD, ELD.

language-detection natural-language 自然语言处理 PHP language language-identification language-classification

PHP

3 个月前