🎤 Transcribe audio files using Qwen ASR. STT
🎤 Transcribe audio files using Qwen ASR. STT: Skill featuring: ASR, sends, voice, messages, wants, them.
🎤 Transcribe audio files using Qwen ASR. STT: Skill featuring: ASR, sends, voice, messages, wants, them.
Real data. Real impact.
Emerging
Developers
Per week
Open source
Skills give you superpowers. Install in 30 seconds.
Qwen ASR provides speech-to-text transcription powered by Qwen's open-source automatic speech recognition model. It accepts common audio formats including WAV, MP3, and OGG, converting them into written transcripts with multilingual support — all without requiring API keys or complex configuration.
uv runner for clean dependency management and executionThe skill uploads audio to Qwen's ASR demo service using Python's gradio_client library. It handles file transmission, processes the audio through the Qwen speech recognition model on the remote endpoint, and retrieves the resulting transcript. Input can be provided as a file path argument or piped via stdin for integration into larger processing pipelines.
Install the
uv package manager via Homebrew (brew install uv) or pip, which handles Python dependency management automatically. Run transcriptions with uv run scripts/main.py -f audio.wav or pipe audio input with cat audio.wav | uv run scripts/main.py > transcript.txt. Note that audio is transmitted to a third-party demo service, so avoid processing sensitive or confidential audio through this skill.MIT-0 (Free to use, modify, and redistribute. No a
No automatic installation available. Please visit the source repository for installation instructions.
View Installation Instructions1,500+ AI skills, agents & workflows. Install in 30 seconds. Part of the Torly.ai family.
© 2026 Torly.ai. All rights reserved.