Skip to content
Story AI ✦ Features Pricing Blog Try Demo — free →

For Storytellers, Journalists, and Directors: Your Footage Already Has a Structure

Tomislav Brdjanović
Published date:

Every story has a moment where it lands. A pause, a change of mind, a single sentence that carries everything the film is actually about. Your job is to find that moment and build toward it. The footage has it. You know it’s in there.

The problem is everything between you and that moment: hours of material, timecodes written in documents nobody else can read, files you’ve watched twice and will watch again. This is work that consumes your time without requiring your judgment. Logging. Reviewing. Remembering. Searching for what you already know is somewhere in there.

AI has two possible relationships with that work. It can try to make the decisions for you — assembling a cut, proposing a structure, telling you what matters. Or it can do only the analytical labour, and leave every narrative decision exactly where it belongs: with you.

AVScript is built on the second idea. It transcribes, analyses, and tags your footage so that the material is searchable, navigable, and intelligible. What that material means — what story it carries, which moment is the one — that stays yours. AI clears the path. You decide where it leads.

It doesn’t matter whether you’re a documentary director, a journalist on deadline, a researcher logging interviews, or a producer building a brief for an editor. The problem is the same. The solution is the same.

🎬 Documentary director
📰 Journalist
🔬 Researcher
📋 Logger
🎙 Producer
🎭 Story editor

Step One: Let Story AI Do the Logging

Before you can find anything, the footage needs to be analysed. You drop your files into AVScript, configure the pipeline, and run it. This is the only step that takes time — and you don’t have to watch anything.

🎙
Transcribe
Full AI transcription, word by word, with frame-accurate timecodes and automatic speaker separation.
→ searchable transcript
💜
Emotions AI
Scores the emotional register of every segment — joy, tension, concentration, sadness — second by second.
→ Emotion-Joy, Emotion-Tension…
👁
Visual AI
Tags visual content: scene type, topic, activity, environment. Understands what’s on screen, not just what’s being said.
→ Scene-Interior, Topic-Interview…

You configure which stages to run. Transcribe alone is enough to make every spoken word searchable. Emotions AI and Visual AI add another layer on top — but even without them, you’ve just replaced hours of manual logging with a process that runs while you sleep.

For loggers: the logging is already done. Every word, every speaker, every timecode — indexed and searchable the moment the pipeline completes. Your job shifts from logging to curating.


Search: Stop Scrubbing

Once the pipeline runs, your footage becomes a searchable archive. Not “searchable” in a vague, approximate sense — fully searchable, by word, by speaker, by emotion, by topic.

Filter:Text matchEmotion-TensionSpeaker ATopic-Decision
interview_A_day2.mp4
00:14:17 – 00:14:29
”We knew it was the right call, but… I’m not sure I slept that week.”
Emotion-TensionSpeaker-A
interview_A_day2.mp4
00:48:03 – 00:48:14
”There were nights I just couldn’t sleep. I kept thinking about what we were building.”
Emotion-ConcentrationSpeaker-A
interview_C_day3.mp4
00:22:51 – 00:23:02
”She called me at three in the morning. I hadn’t slept in two days.”
Emotion-TensionSpeaker-C

Type a word and every file that contains it surfaces immediately — with the exact timecode, the surrounding sentence, and the AI tags that describe the emotional register of that moment. Click to play. No scrubbing, no file-hunting, no “I think it was somewhere in the second interview.”

You can layer filters: find everything tagged Emotion-Tension from Speaker A, or all segments tagged Topic-Decision across every file in the project. The entire archive responds to what you’re looking for, not just the file you happen to have open.

For journalists: your sources are now searchable. Every interview you recorded — this project, any project — has a full transcript. Search by topic, by name, by concept. Find the quote you need in seconds rather than re-watching the clip.


What You’re Looking At: The Full Feature Set

Once your footage is in AVScript, here’s what you have access to:

📝
Transcript viewer
The full transcript of every file, word by word, with timecodes. Click any word to jump to that point in the video. Edit the transcript if the AI got something wrong — corrections are saved to the file and reflected everywhere.
💜
Emotion timeline
A visual graph of the emotional register across the whole clip — where tension peaks, where the subject relaxes, where concentration or sadness comes through. Scan an hour of footage emotionally in under a minute.
✍️
Text video editor
Edit the video by editing the transcript text. Select the lines you want to keep, delete the rest — the video assembles itself to match. The fastest way to pull a rough cut from an interview without touching a timeline.
→ produces a sequence you can export as FCPXML
🎛
Multicam view
If the same scene was captured on multiple cameras, watch them side by side — synced by timecode. Switch between angles, choose the best take for each moment. Works for interviews, performances, events, and anything shot with more than one camera.
→ your angle choices become a sequence
📤
Player sharing
Share a link to any clip, any moment, any timecode — with anyone. They open it in a browser. No account, no download, no software. The moment you want them to see plays directly.
Sequence builder
Pull dialogue lines from any file into a sequence, in any order. Rearrange them, remove what doesn’t work, preview the result. The current focus is the dialogue track — verbal structure, speaker by speaker. B-roll comes later.
→ exports as PDF script + FCPXML
🤖
Story AI chat
Ask Story AI anything about your material. What were the strongest emotional moments? Which speaker talked most about the decision? Summarise what was covered in a file. Story AI responds with timecodes, quotes, and context — and can generate a sequence suggestion directly from the conversation.
→ AI-proposed sequences you can use, adapt, or export

Text Video Editor: Cut by Editing Text

Of all the ways to interact with your material in AVScript, this one has the lowest barrier. You don’t need to know what a timeline is. You don’t need to think about clips or in/out points. You read, and you delete.

Open any transcribed file and you see the full transcript as editable text — every sentence a paragraph, every paragraph a selectable block. The video is synced to it: as your cursor moves through the text, the playhead follows. You’re reading and watching simultaneously.

When you find a line you don’t want in the cut, you delete it. Not the timecode — the actual words. The sentence disappears from the transcript. The corresponding clip is removed from the sequence. What remains is automatically assembled into a new cut.

✍️ Text Video Editor — interview_A_day2.mp4
4 of 7 segments kept · 0:52 total
Transcript — edit text to cut

Spk A “So, I mean, we had a lot of meetings about it. A lot of back and forth.”

Spk A “We knew it was the right call, but… I’m not sure I slept that week.”

Spk A “My co-founder kept asking me if I was okay. I told him I was fine.”

Spk A “There were nights I just couldn’t sleep. I kept thinking about what we were building.”

Spk A “We had another investor call that Friday — I honestly can’t remember what I said.”

Spk A “My kids asked me if we were going to be okay. That’s when I knew we had to move.”

Spk A “Anyway. We made the decision on a Sunday. I think it was a Sunday.”

Assembled sequence
00:14:17 – 00:14:29
Speaker A
”We knew it was the right call, but… I’m not sure I slept that week.”
00:48:03 – 00:48:14
Speaker A
”There were nights I just couldn’t sleep. I kept thinking about what we were building.”
00:31:05 – 00:31:19
Speaker A
”My kids asked me if we were going to be okay. That’s when I knew we had to move.”

The grey lines were deleted from the transcript. The blue lines stayed. The right side shows exactly what the cut looks like — playable, exportable as FCPXML.

This is the fastest path from raw interview to a structured selection, especially for people who think in sentences rather than timecodes. You’re essentially copy-editing your way to a rough cut. No timeline, no drag and drop, no clip handles. Just read, decide, delete.

Combined with Story AI chat: ask Story AI which lines it considers strongest, let it highlight them — then use the text editor to delete everything else. Raw footage to structured sequence in minutes.


Story AI: Ask Questions, Get a Cut

The most direct path from raw footage to a structured sequence isn’t always manual. Sometimes you know roughly what story you want to tell — you just need to ask.

Story AI chat works against your analysed footage. You ask questions in natural language. It responds with specific timecodes, direct quotes, and analysis of the material — not generic AI waffle, but answers grounded in what’s actually in your files.

And when you’re ready, you can ask it to propose a sequence. It suggests an ordered list of clips based on everything it knows about the material: the transcript, the emotions, the topics, the story arc you’ve described.

🤖 Story AI — Chapter 3 · 3 files analysed
you
What’s the most emotionally intense moment across all three interviews?
AI

The peak tension moment is Speaker A at 00:31:05 – 00:31:19 in interview_A_day2.mp4 — “My kids asked me if we were going to be okay. That’s when I knew we had to move.” Emotion-Tension score hits 0.94 here, the highest across all files. It follows a quieter buildup from 00:28:40 that gives it significant contrast.

you
Suggest a 3-clip opening sequence for Chapter 3. I want it to start quietly and build to that moment.
AI

Here’s a suggested opening:

Proposed sequence — Chapter 3 opening
101:02:44 – 01:02:56Speaker B — “Nobody wanted to be the first to say it out loud.” Sets the silence before the break.
200:14:17 – 00:14:29Speaker A — “We knew it was the right call, but… I’m not sure I slept that week.” Tension building.
300:31:05 – 00:31:19Speaker A — “My kids asked me if we were going to be okay.” Peak. End on this.

You can load this directly into the sequence builder, adjust it, or export as FCPXML.

The AI isn’t editing for you — it’s doing the analytical work so you can make faster decisions. You decide what to keep. The suggested sequence is a starting point, not a verdict.

For journalists on deadline: ask Story AI to find every moment where a specific topic came up across all your interviews, and order them by emotional intensity. In thirty seconds you have a shortlist that would have taken an hour of manual review.


Building the Story: From Fragments to Structure

This is the part that used to happen on paper, in a Google Doc, or not at all until you sat down with an editor and tried to explain what you had in your head.

In AVScript, you build the sequence directly. You’ve already searched your material and found the moments that matter. Now you pull them into order.

✦ Sequence Builder · Chapter 3 — The Decision
interview_A_day2.mp4
01:04:32 · 38 segments · Speaker A
interview_B_day1.mp4
00:48:11 · 24 segments · Speaker B
interview_C_day3.mp4
00:37:55 · 19 segments · Speaker C
Sequence — drag to reorder
00:14:17
Speaker A · interview_A_day2.mp4
”We knew it was the right call, but… I’m not sure I slept that week.”
01:02:44
Speaker B · interview_B_day1.mp4
”Nobody wanted to be the first to say it out loud. We just kept looking at each other.”
00:31:05
Speaker A · interview_A_day2.mp4
”My kids asked me if we were going to be okay. That’s when I knew we had to move.”

The current version of the sequence builder works with the dialogue track — interviews, narration, presenter pieces. B-roll placement is on the roadmap. What you’re building now is the verbal spine: the words, in the order they should be heard. That’s usually the hardest part anyway.

When you’re done:

The PDF looks like this — a real document, the kind that has lived on edit suite desks for decades, now generated automatically from your sequence:

AVScript PDF script export — media bin and sequence with TC in/out

For researchers and loggers

You don’t have to build a sequence at all. AVScript’s value for you might be entirely in the search and the transcript — a fully logged, fully searchable archive of everything you’ve shot, built automatically, accessible from anywhere. You can share it with colleagues, run searches across months of material, and find any spoken word in any file without ever opening a sequence builder.


Handing Off to the Editor

If there’s an editor in the chain, here’s what changes when you use AVScript.

You, in AVScript
sequence built, PDF exported
📄
PDF script
paper cut, industry format
+
📁
FCPXML
sequence, ready to open
🎬
Editor, any NLE
opens your cut, adapts it

The PDF gives them the roadmap. The FCPXML gives them the sequence ready to open. They’re not translating your feelings into a cut — they’re refining a structure you already built.

If the editor uses Final Cut Pro and you’ve set up MetaFlow, every source clip in their Library will also carry the captions, emotion keywords, and speaker labels from Story AI. The collaboration doesn’t end at the handoff — the intelligence travels with the footage into the edit.

For documentary directors: this is the paper edit reimagined. The old workflow — Betacam tapes, timecode logs, stacks of paper — existed because you needed to think through structure before you got into the edit suite. AVScript gives you that same discipline, digitally, with AI doing the logging work and the sequence builder replacing the paper.


One Script, Any NLE

Here’s something most storytellers don’t think about until they need it: your work in AVScript isn’t locked to any editing software.

Once the script and material are in AVScript — transcribed, analysed, sequenced — the output can go to whatever NLE the editor is working in. Final Cut Pro gets FCPXML 1.13 with full MetaFlow AI metadata. Premiere Pro gets a native XMEML file (File → Import, no Translation Report) with captions on the timeline and descriptions in the project bin. DaVinci Resolve accepts both. The PDF script is software-agnostic by definition. Whatever tool is on the other side, your structural work travels there intact.

AVScript
sequence + PDF built once
🎬
Final Cut Pro
FCPXML native + MetaFlow AI metadata
.fcpxml
🎞
Adobe Premiere Pro
Native XMEML — File → Import, no Translation Report
.xml
🎛
DaVinci Resolve
FCPXML import via File → Import Timeline
.fcpxml
📄
Any editor, any tool
PDF script works regardless of NLE
.pdf

This matters in practice more than it sounds. Productions change NLEs between projects. Co-productions involve editors in different studios with different setups. A news package might go to one editor; the long-form version to another who prefers Resolve. You build the story once in AVScript. The rest is routing.


We Won’t Charge You Twice for the Same Footage

This is a principle, not just a feature.

When Story AI processes a file — transcript, emotion scores, visual tags, speaker labels — that data is stored against the media file itself, permanently, not against the project. Delete the project, the data survives. Open the same file in a new project six months later, the data is there. Use the file in ten projects over three years, the analysis runs once and costs once.

We believe you should pay for work done, not work repeated. You invested in understanding your footage. That investment belongs to the footage, not to a session or a subscription cycle.

Run Story AI on an interview today. The transcript is available every time that file appears in any project you’ll ever make — immediately, with no re-processing, at no extra cost.

This matters most in ongoing work: a documentary shot over months, a news beat where the same sources recur, an institutional archive that grows over years. Each file is analysed once. Everything that analysis produced is yours forever.


Where to Start

If you’re new to AVScript, the fastest path to value is:

1 → Create a project, add your files to the bin.
2 → Run at minimum Transcribe through Story AI → Pipeline.
3 → Open the transcript viewer, click any word to jump to it.
4 → Use search to find a moment you’ve been trying to locate.

That’s it. Everything else — Emotions AI, Visual AI, sequence builder, PDF export, FCPXML handoff — builds on top of that foundation. Add stages as you need them.

The moment that makes the story land is already in your footage. AVScript’s only job is to help you reach it faster.

→ Start with your footage · → How Story AI works · → Handing off to an FCP editor

Previous
Motion Graphics Generator — Titles, Lower Thirds and Countdowns Inside the Editor
Next
For the Editor in FCP: AI Metadata That Works With You, Not Instead of You