A PyTorch-based Speech Toolkit
-
Updated
May 14, 2024 - Python
A PyTorch-based Speech Toolkit
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files using 2 ACTIVITIES
Audio Analyser, a cutting-edge application designed to transform audio recordings into actionable insights using Microsoft Azure AI. It offers audio recording, speech-to-text conversion, and in-depth text analysis, providing users with comprehensive and insightful reports.
An innovative, open-source voice assistant powered by OpenAI's GPT-3, designed to provide interactive, conversational experiences through both voice and text inputs
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
🗣️ ⌨️ Speech-to-text on key for Linux
An easy to use tool to automatically detect and record speech in Unity.
React component and hook to initiate a SpeechRecognition session
nlw-expert-react, comverte notas de áudios em testo
Voice assistent for Desktop
Some useful composition api
A very fast speech recognizer built using selenium(webkit speech recognizer) with python
Python programs that takes in microphone audio and then returns audio and text for English and Spanish.
Add a description, image, and links to the speechrecognition topic page so that developers can more easily learn about it.
To associate your repository with the speechrecognition topic, visit your repo's landing page and select "manage topics."