Transcription & Subtitles

50+ languages

Automatic speech recognition across 50+ languages with word-level timestamps and confidence scores.

Speaker diarization

Identify and attribute speech to individual speakers. Filter transcripts and search results by person.

Multi-format export

Download transcripts as SRT, VTT, DOCX, TXT, or JSON. Every format includes timestamps and speaker labels.

How transcription works

Upload or stream

Ingest media via API, dashboard, or live RTMP stream

AI processing

Speech-to-text runs automatically — webhook fires on completion

Retrieve transcript

Query the transcript via REST API — download as JSON, SRT, VTT, or DOCX

Embed or integrate

Interactive subtitles in the player widget, or process text downstream

Capabilities

Automatic transcription in 50+ languages

Speaker diarization and attribution

Word-level timestamps and confidence scores

Subtitle export: SRT, VTT, DOCX, TXT, JSON

Live transcription for real-time captions

Custom vocabulary and proper noun injection

WCAG 2.1 AA and BITV 2.0 accessibility compliance

Interactive subtitles with click-to-jump in the player widget

Webhook notification on transcript completion

Transcript search via full-text and semantic API

API

Transcription via API

Retrieve transcripts, subtitles, and speaker data programmatically.

Transcript endpoint

Retrieve the full transcript with word-level timestamps, speaker labels, and confidence scores.

Subtitle export

Request SRT, VTT, DOCX, or TXT via format parameter. All formats include speaker labels and timestamps.

Speaker API

Query speakers per asset. Filter search results and RAG queries by individual speaker identity.

Webhooks

Receive real-time notification when transcription completes. Trigger downstream processing automatically.

Transcript search

Full-text and semantic search across all transcripts. Find spoken words by keyword or natural language.

Accessibility

WCAG 2.1 AA and BITV 2.0 compliant subtitles. Meet EU Web Accessibility Directive requirements for public-sector video content.

FoundationAPI & Ingest FoundationEmbedding & Widgets ScenarioAI Media Analysis

Ready to get started?

Schedule Demo View Transcript API docs