How It Works

From Raw Transcript to Actionable Insight

A five-step pipeline that takes any conversation recording and produces structured, quantified analysis. Every step is observable, every run is cancellable.

The Pipeline

Each step runs in sequence. SSE streaming shows progress in real time.

1

Upload

Upload VTT, SRT, or plain text transcripts up to 50MB. Drag-and-drop or API upload. File extension and content signature are both validated.

2

Parse

Automatic format detection identifies the file type by signature, extension, and content analysis. Speakers are extracted, timestamps normalized, and the transcript is stored in a canonical schema.

3

Analyze

Select from 7 analysis types. Each runs as an IronCrew flow with one or more AI agents. Results stream back via SSE with per-phase progress updates.

4

Review

Examine structured results: summaries, topic clusters, extracted quotes, sentiment timelines, speaker profiles, behavioral DNA classification, and interviewer scorecards. Every finding links back to source evidence.

5

Export

Export analysis results as JSON for programmatic consumption, CSV for spreadsheets, or formatted Markdown for reports and presentations.

Format Support

Built-in format detection handles the messy reality of transcript files.

VTT

WebVTT format with full timestamp and speaker label parsing. Handles multi-line cues, notes, and style blocks.

SRT

SubRip subtitle format. Timestamps are converted to the canonical schema. Speaker labels extracted from cue text.

Plain Text

Unstructured text with speaker prefixes (e.g., "Speaker 1:"). Heuristic detection identifies speaker turns without timestamps.

transcribeit CLI

Need to transcribe audio or video first? The transcribeit CLI handles local-first transcription with speaker diarization. Rust-based, runs entirely on your machine.

Cognitive-OCR

Need document context? Extract structured content from PDF, DOCX, PPTX, and images. Use as additional context for generating new or analyzing existing transcripts.

Real-Time Analysis

Watch analysis unfold in real time. Cancel any run mid-flight.

SSE Streaming

Every analysis streams progress via Server-Sent Events. The UI shows each agent phase as it completes, with inline results and timing information.

No polling, no page refreshes. The connection stays open from start to finish, delivering structured events for each phase transition.

GET /api/transcripts/:id/analyze/stream?type=speakers
event: phase
data: {"phase":"speaker_detection","status":"complete"}
event: phase
data: {"phase":"behavioral_dna","status":"running"}
event: result
data: {"speakers":3,"dna":"Evidence Harmonizer"}
--- abort available ---
POST /api/ironcrew/abort?flow=...&run_id=...

Export & Reports

Get results in the format your downstream systems need.

JSON Export

Full structured output with all analysis fields, confidence scores, and source references. Ready for programmatic consumption.

CSV Export

Flattened tabular output for spreadsheets and BI tools. One row per finding with all metadata columns.

Markdown Reports

Formatted reports with headings, tables, and inline quotes. Ready for presentations, wikis, or documentation.

Try the Pipeline

Upload a transcript and watch the full analysis pipeline run. We will walk you through every step.