Repository navigation

#

end-to-end

JavaScript
8710
2 年前

Facebook AI Research's Automatic Speech Recognition Toolkit

C++
6441
10 个月前

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Python
1980
2 年前

[ICLR'23 Spotlight & ECCV'24 & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

Python
1391
7 个月前

Shows how the CFT modules can be composed to build a secure cloud foundation

HCL
1382
1 天前

实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.

Python
1105
6 个月前

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Python
1086
7 天前

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Python
942
1 年前

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

Python
850
10 个月前

User Simulation for Task-Completion Dialogues

OpenEdge ABL
804
2 年前

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Python
804
2 年前

Tanker client-side encryption SDK for JavaScript

TypeScript
800
5 个月前

[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer

Python
738
2 年前

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python
710
2 年前

国内首个占据栅格网络全栈课程《从BEV到Occupancy Network,算法原理与工程实践》,包含端侧部署。Surrounding Semantic Occupancy Perception Course for Autonomous Driving (docs, ppt and source code) 在线课程主页:http://111.229.117.200:8100/ (作者独立搭建)

Python
678
1 年前