AI Skill Market Insights

Real data. Real impact.

Popularity

Rising

Emerging

Active Users

0+

Developers

Time Saved

2+ hrs

Per week

Source

GitHub

Open source

Be Part of the 0+ Developer Community

Skills give you superpowers. Install in 30 seconds.

Qwen ASR provides speech-to-text transcription powered by Qwen's open-source automatic speech recognition model. It accepts common audio formats including WAV, MP3, and OGG, converting them into written transcripts with multilingual support — all without requiring API keys or complex configuration.

Key Features

Zero-configuration setup — no API keys or authentication tokens required to start transcribing
Multilingual speech recognition powered by a model trained on diverse web audio data, handling multiple languages
Flexible input handling supporting WAV, MP3, OGG, and other common audio formats via file path or stdin piping
Simple CLI interface using the
```
uv
```
runner for clean dependency management and execution
Free and open-source model with no per-request costs or usage limits beyond the demo service's capacity

Use Cases

Transcribing voice memos, meeting recordings, or interview audio as part of an agent research workflow
Converting audio messages in messaging apps to searchable, analyzable text
Building transcription pipelines that process batches of audio files into text for downstream analysis
Adding speech-to-text capability to agent workflows without managing API credentials or billing

How It Works

The skill uploads audio to Qwen's ASR demo service using Python's gradio_client library. It handles file transmission, processes the audio through the Qwen speech recognition model on the remote endpoint, and retrieves the resulting transcript. Input can be provided as a file path argument or piped via stdin for integration into larger processing pipelines.

Getting Started

Install the

uv

package manager via Homebrew (

brew install uv

) or pip, which handles Python dependency management automatically. Run transcriptions with

uv run scripts/main.py -f audio.wav

or pipe audio input with

cat audio.wav | uv run scripts/main.py > transcript.txt

. Note that audio is transmitted to a third-party demo service, so avoid processing sensitive or confidential audio through this skill.

License

MIT-0 (Free to use, modify, and redistribute. No a

🎤 Transcribe audio files using Qwen ASR. STT

AI Skill Market Insights

Be Part of the 0+ Developer Community

Key Features

Use Cases

How It Works

Getting Started

License

Quick Start

Manual Installation

TEAR & SHARE

Tags

using-git-worktrees

using-superpowers

Claude Code History Files Finder

Audio Summary

Audio Handler

Channels

Learn

Compare

Company

Agents