Repository navigation
texttospeech
- Website
- Wikipedia
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Generate TikTok Text-to-Speech voices in your browser
Text to speech package for Golang.
I will share about Machine Learning and Deep Learning.
A simple tool to demo text-to-speech using various services' voices. HTML5 and Vanilla JS.
Text to Speech NativeScript plugin for Android & iOS 📢
Whooby is a text-to-speech android application to communicate within a group or community .
The only Text to Speech Telegram Inline Bot
Code snippets showing how to record I2S audio and store as .wav file on ESP32 with SD card, how to transcribe pre-recorded audio via Deepgram SpeechToText (STT) API, how to generate audio from text via TextToSpeech (TTS) API from OpenAI a/o SpeechGen a/o Google TTS. Triggering ESP32 actions via Voice.
Text To Speech Demo in ReactJS Application using Azure Avatar AI Service.
This repo is text to speech with learnable audio encoder without alignment with transcript reference
Simple application for continuous speach to text without google dialog
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to speech .
Explore AI Capabilities for Your .NET Projects with OpenAI's API: Unlock the power of AI in your applications
🦸🏻♂️🎺 Talkify is a comprehensive, cross-platform Swift library for adding advanced speech features to your applications. It efficiently manages voice-to-text and text-to-speech capabilities using the power of AVFoundation and Speech frameworks.
Text to Speech for Android Application with Google API
GLUE is a lightweight, Python-based collection of scripts to support you at succeeding with speech and text use-cases based on Microsoft Azure Cognitive Services.
💻🔊 A chrome extension that converts text on the web to speech. This was my final project to the Harvard's CS50x course.
ESP32-based voice device for chatting with multiple custom AI bots. Recording questions with I2S microphone, transcribing via ElevenLabs or Deepgram STT, creating response with Groq or Open AI LLM. TTS audio output with custom AI voices via I2S & speaker. Supporting ongoing dialogues, calling bots ‘by name’, real-time web search via keyword.