Changelog
Release history for the trx CLI.
0.4.0 2026-04-06
OpenAI API backend + large-v3-turbo
Added
- OpenAI API transcription backend —
trx transcribe -b openaiuses the OpenAI API instead of local Whisper. Supportsgpt-4o-transcribe,gpt-4o-mini-transcribe, andwhisper-1. large-v3-turbomodel — near-large accuracy at ~3x speed for local transcription.--backendflag ontrx transcribeandtrx initto choose betweenlocalandopenai.- Backend selection in
trx init— interactive prompt or--backend openaiflag. trx doctorshows backend status — API key detection, active model, backend type.- OS-specific open command — uses
openon macOS,xdg-openon Linux,starton Windows. - Quoted paths with spaces — file paths containing spaces are now quoted in the terminal output for ctrl+click support.
Fixed
- SRT timestamps for
gpt-4o-transcribemodels now use real audio duration viaffprobeinstead of placeholder values.
0.3.2 2026-04-03
Cross-platform support
Added
- Cross-platform
trx init— Linux support viaapt-get+ source compilation for whisper-cli, Windows support viawinget+ pre-built binaries. - Platform detection shown during init:
trx init (linux).
Fixed
- Config defaults merge —
readConfig()now merges saved config with defaults, preventing crashes whenwhisperFlagsis missing. - FFmpeg path collision — when input is a
.wavin the current directory, output no longer collides with input path.
0.3.0 2026-03-31
Word timestamps + language auto-detect
Added
- Word-level timestamps via
--wordsflag — generates SRT with per-word timing. - Auto-detect language by default —
-lflag now only forces a specific language override.
Fixed
- Vercel deploy issues resolved (bun.lock conflicts, peer dependency warnings).
0.2.0 2026-03-30
Progress bar + agent skill
Added
- Live progress bar during Whisper transcription with percentage indicator.
- Word count summary shown after transcription completes.
openhint — suggests command to open the output file.- Agent skill auto-install —
trx initoffers to install the Claude Code skill. - OG images generated via Sharp for social sharing.
- 21 e2e tests covering the full CLI surface.
- Monorepo structure with separate website package at
trx.crafter.run.
Fixed
- Spinner hang after transcription — properly guards callbacks and exits.
0.1.0 2026-03-30
Initial release
Added
trx transcribe— transcribe any URL or local file via local Whisper.trx init— one-command setup: installs whisper-cli, yt-dlp, ffmpeg, downloads model.trx doctor— health check for all dependencies and configuration.trx schema— runtime introspection for agent self-service.- Agent-first JSON output —
--output jsonfor piping to other tools. --dry-run— validate input and preview execution plan without running.--fields— select specific output fields to save tokens.- Input validation — rejects path traversals, control characters, URL-encoded paths.
- 99 language support via Whisper’s built-in language detection.
- 5 model sizes — tiny, base, small, medium, large.