Changelog

Release history for the trx CLI.

0.4.0 2026-04-06

OpenAI API backend + large-v3-turbo

Added

  • OpenAI API transcription backendtrx transcribe -b openai uses the OpenAI API instead of local Whisper. Supports gpt-4o-transcribe, gpt-4o-mini-transcribe, and whisper-1.
  • large-v3-turbo model — near-large accuracy at ~3x speed for local transcription.
  • --backend flag on trx transcribe and trx init to choose between local and openai.
  • Backend selection in trx init — interactive prompt or --backend openai flag.
  • trx doctor shows backend status — API key detection, active model, backend type.
  • OS-specific open command — uses open on macOS, xdg-open on Linux, start on Windows.
  • Quoted paths with spaces — file paths containing spaces are now quoted in the terminal output for ctrl+click support.

Fixed

  • SRT timestamps for gpt-4o-transcribe models now use real audio duration via ffprobe instead of placeholder values.
0.3.2 2026-04-03

Cross-platform support

Added

  • Cross-platform trx init — Linux support via apt-get + source compilation for whisper-cli, Windows support via winget + pre-built binaries.
  • Platform detection shown during init: trx init (linux).

Fixed

  • Config defaults merge — readConfig() now merges saved config with defaults, preventing crashes when whisperFlags is missing.
  • FFmpeg path collision — when input is a .wav in the current directory, output no longer collides with input path.
0.3.0 2026-03-31

Word timestamps + language auto-detect

Added

  • Word-level timestamps via --words flag — generates SRT with per-word timing.
  • Auto-detect language by default-l flag now only forces a specific language override.

Fixed

  • Vercel deploy issues resolved (bun.lock conflicts, peer dependency warnings).
0.2.0 2026-03-30

Progress bar + agent skill

Added

  • Live progress bar during Whisper transcription with percentage indicator.
  • Word count summary shown after transcription completes.
  • open hint — suggests command to open the output file.
  • Agent skill auto-installtrx init offers to install the Claude Code skill.
  • OG images generated via Sharp for social sharing.
  • 21 e2e tests covering the full CLI surface.
  • Monorepo structure with separate website package at trx.crafter.run.

Fixed

  • Spinner hang after transcription — properly guards callbacks and exits.
0.1.0 2026-03-30

Initial release

Added

  • trx transcribe — transcribe any URL or local file via local Whisper.
  • trx init — one-command setup: installs whisper-cli, yt-dlp, ffmpeg, downloads model.
  • trx doctor — health check for all dependencies and configuration.
  • trx schema — runtime introspection for agent self-service.
  • Agent-first JSON output--output json for piping to other tools.
  • --dry-run — validate input and preview execution plan without running.
  • --fields — select specific output fields to save tokens.
  • Input validation — rejects path traversals, control characters, URL-encoded paths.
  • 99 language support via Whisper’s built-in language detection.
  • 5 model sizes — tiny, base, small, medium, large.