Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
-
Updated
Aug 15, 2025 - Svelte
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Lightweight and powerful real-time audio/speech translation tool based on Windows LiveCaptions.
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
Tero Subtitler is an open source, cross-platform, and free subtitle editing software.
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
Simple self-hosted web application, which can be used to convert audio to subtitles by OpenAI's Whisper model
This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically.
NPM Library to transcribe Audio & Videos completely in browser with WebGPU and WebCodecs. 100% private and offline with WASM fallbacks
Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.
Persian ASR dataset
"Speech-to-Text Realtime with Extension" is a browser extension that converts speech to text in real-time. It supports multiple languages, making it ideal for note-taking, customer service, and accessibility. Easy to install and use on popular browsers.
Simple Python audio transcriber using OpenAI's Whisper speech recognition model
Vocal Prism — Privacy-first, local AI audio transcription for macOS (Whisper → CoreML, Apple Silicon‑optimized).
State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>
Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.
core shell functions building blocks for advanced AI pipelines
A SwiftUI App For People Who Need To Take Down Important Information Quickly.
An advanced study tool that transforms raw audio recordings and PDF slides into structured, professional LaTeX university notes. Powered by fast local transcription (Whisper) and Google Gemini AI for intelligent summarization and context integration.
Add a description, image, and links to the audio-to-text topic page so that developers can more easily learn about it.
To associate your repository with the audio-to-text topic, visit your repo's landing page and select "manage topics."