A simple HTTP server written in Rust for the OpenAI Whisper speech-to-text model.
- 🎧 Transcribe audio files
- 🔄 OpenAI API compatibility
- 🌈 Models
tiny.entinybase.enbasesmall.ensmallmedium.enmediumlargelarge-v1
- 🌎 Languages
- 🇬🇧 English (en)
- 🇨🇳 Chinese (zh)
- 🇩🇪 German (de)
- 🇪🇸 Spanish (es)
- 🇷🇺 Russian (ru)
- Clone this repository:
git clone https://github.com/tiero/whisperd.git- Navigate to the repository and build:
cd whisperd
cargo build --release- Run the server:
./target/release/whisperd serve --model_path path_to_whisper_modelNow, the server is running at http://localhost:8000 and ready to transcribe!
- Start the transcription server:
whisperd serve --port 5000 --model_path <path_to_model> - Transcribe a given audio file (this downloads the model automatically from HuggingFace):
whisperd transcribe --audio <path_to_audio>For more advanced options, use:
whisperd --helpPull requests and issues are welcome!
This project is licensed under the MIT License - see the LICENSE file for details.