Turn videos into structured data
Submit a URL. Get structured transcripts, summaries, and brand and object detection — in a single API call.
Example response. Try the playground free.
{
"transcription": {
"text": "Welcome to our product launch. Today we're introducing a new approach to video intelligence — structured AI analysis you can plug into any pipeline.",
"segments": [
{
"start": 0,
"end": 4.2,
"text": "Welcome to our product launch.",
"speaker": 0
},
{
"start": 4.2,
"end": 12.8,
"text": "Today we're introducing a new approach to video intelligence — structured AI analysis you can plug into any pipeline.",
"speaker": 0
}
],
"language": "en",
"duration_seconds": 245,
"word_count": 487,
"speakers_detected": 1
},
"summary": {
"text": "A product launch covering a structured video analysis API with brand detection, custom queries, and a native MCP integration.",
"key_topics": [
"video analysis API",
"brand detection",
"custom queries",
"MCP integration"
]
},
"brands": [
{
"name": "Figma",
"confidence": 0.94,
"total_appearances": 3,
"first_seen": 12,
"last_seen": 128,
"appearances": [
{
"timestamp": 12,
"source": "visual",
"context": "Logo visible in dashboard demo"
},
{
"timestamp": 45,
"source": "audio",
"speech_context": "mentioned by speaker"
},
{
"timestamp": 128,
"source": "visual",
"context": "Featured in tools comparison"
}
]
}
],
"queries": [
{
"query": "What products are mentioned?",
"answer": "Figma is referenced — used for design workflows, visible in the dashboard demo, and called out by name during the tools comparison segment.",
"evidence": [
{
"timestamp": 45,
"text": "...we use Figma for design workflows..."
},
{
"timestamp": 128,
"text": "...switching over to the tools comparison..."
}
]
}
]
}Custom queries
Ask anything about the video.
Send natural-language questions with any job. Every answer comes with timestamped evidence.
Yes. Pricing is mentioned twice — once in the intro at $49/mo for Pro, and once near the end framing it against competitors.
Core capabilities
Pick what you need. Skip what you don't.
Toggle transcription, summary, and brand and object detection per request. Structured JSON, every time.
Speech-to-text with timestamped segments, speaker diarization, and word-level timestamps.
Section summaries with key topics and chapter breakdowns.
Brands in speech and on screen, with confidence scores and temporal tracking.
Objects, products, and visual elements with categories and bounding boxes.
Proof frames
Every detection comes with receipts.
Each result links to the exact frame, timestamp, and audio mention. Verify anything in one click.
MCP
Plug into Claude.
Connect the MCP server and your agent can analyze videos, pull results, and fetch proof frames natively.
Analyzed. Found 3 brands:
- • Nike94% · 0:23, 1:14
- • MacBook Pro88% · 1:47
- • Starbucks91% · 3:12
Used 2 credits · 248 remaining
Pricing
Pay for what you analyze.
Credits consumed per minute of video. Start free, scale when you're ready.
$0
Get started with video analysis
$49/mo
For teams and production workloads
Custom
Dedicated support and custom limits