Repository navigation

#

end-to-end

JavaScript
8733
2 年前

Facebook AI Research's Automatic Speech Recognition Toolkit

C++
6421
5 个月前

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Python
1976
1 年前

Shows how the CFT modules can be composed to build a secure cloud foundation

HCL
1306
4 天前

[ICLR'23 Spotlight & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

Python
1260
2 个月前

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Python
943
8 个月前

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Python
917
2 个月前

实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.

Python
886
1 个月前

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

Python
818
4 个月前

User Simulation for Task-Completion Dialogues

OpenEdge ABL
805
2 年前

Tanker client-side encryption SDK for JavaScript

TypeScript
799
3 个月前

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Python
786
2 年前

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python
697
1 年前

[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer

Python
689
1 年前

End-to-end Lane Detection for Self-Driving Cars (ICCV 2019 Workshop)

Python
652
5 年前