████████╗██████╗ ██╗ ██╗
╚══██╔══╝██╔══██╗╚██╗██╔╝
██║ ██████╔╝ ╚███╔╝
██║ ██╔══██╗ ██╔██╗
██║ ██║ ██║██╔╝ ██╗
╚═╝ ╚═╝ ╚═╝╚═╝ ╚═╝ Local Whisper transcription with agent-ready output. One command to transcribe any URL or file. Ship with a skill that auto-corrects whisper mistakes.
bun add -g @crafter/trxnpx skills add crafter-station/trx -g$ bun add -g @crafter/trx
$ trx init
✓ whisper-cli (1.8.4)
✓ yt-dlp (2026.02.04)
✓ ffmpeg (7.1)
Select Whisper model:
> small (~466 MB) — recommended
✓ Model downloaded
Install agent skill? Yes
✓ trx is ready
$ trx "https://youtube.com/watch?v=dQw4w9WgXcQ"
Downloading media...
Cleaning audio...
Transcribing with Whisper...
✓ Done: video.txt, video.srtThere's a TikTok reel, a YouTube tutorial, an Instagram story with exactly what you need. But you can't search inside video. You can't pipe it to an agent. You can't grep it.
trx turns any video into text with one command. Whisper runs locally -- no API keys, no rate limits, no cost. The small model handles 99 languages on your CPU. And the agent skill fixes what Whisper gets wrong.
$ trx init --model small
✓ whisper-cli installed
✓ yt-dlp installed
✓ ffmpeg installed
✓ ggml-small.bin downloaded
✓ Agent skill installed$ trx "https://tiktok.com/@dev/video/123"
Downloading media...
Cleaning audio...
Transcribing...
✓ dev-123.txt (1,247 words)$ trx doctor --output json
{
"healthy": true,
"dependencies": {
"whisper-cli": { "installed": true },
"ffmpeg": { "installed": true }
}
}$ trx schema transcribe
{
"command": "transcribe",
"flags": {
"--language": { "default": "auto" },
"--dry-run": { "type": "boolean" },
"--fields": { "description": "..." }
}
}