POST /v1/platform/videos

Creates a video request. The AI:

Writes a multi-scene narration script from your prompt.
Generates TTS audio (gpt-4o-mini-tts, voice alloy).
Searches Pexels and Pixabay for matching stock clips per scene.
Builds subtitle chunks aligned to TTS duration.
Saves everything as editable scenes returned in the response.

The render step (assembling the final .mp4 via Remotion Lambda) is not auto-triggered. Call POST /v1/platform/videos/edit/:editToken/render when you (or a human reviewer) are happy with the scenes.

Agent Quick Reference

You want…	Call this	Result
AI scene script + TTS + stock clips	`POST /v1/platform/videos` with `mode: "sync"`	Response in 30–90 s
Background generation	`POST /v1/platform/videos` with `mode: "async"`	Returns immediately. Poll `GET /v1/platform/videos/:id`
Idempotent retry safety	Add `Idempotency-Key` header	Same key returns the original response
To render the final mp4	`POST /v1/platform/videos/edit/:editToken/render`	Remotion Lambda runs; poll for status
To swap a scene's stock clip	`POST /v1/platform/videos/edit/:editToken/scenes/:sceneId/clip/select`	Replaces clip in place
To regenerate TTS for one scene	`POST /v1/platform/videos/edit/:editToken/scenes/:sceneId/tts`	Re-narrates that scene only
Background music	`POST /v1/platform/videos/edit/:editToken/bgm/select`	Adds preset BGM track

Request Body

Field	Type	Required	Description
`prompt`	`string`	Yes	Topic/brief for the narrated video. Concrete prompts produce stronger scripts.
`title`	`string`	No	Override the AI-generated title.
`description`	`string`	No	Override the AI-generated description.
`language`	`string`	No	Output language (default: `en`). Affects script + TTS pronunciation.
`sceneCount`	`number`	No	Target number of scenes (1–50, default: 5).
`aspectRatio`	`string`	No	`16:9` (default), `9:16` (vertical/Reels), or `1:1`.
`targetDurationSeconds`	`number`	No	Target total duration (10–600 s, default: 60). The AI scales script length to fit.
`classroomId`	`string`	No	Classroom UUID to create the video in. Uses the workspace default classroom if omitted.
`tier`	`string`	No	Legacy request value only: `basic`, `standard`, or `advanced`. Omit for new integrations. Do not send `default`; `default` is a response/catalog tier.
`mode`	`string`	No	`sync` (default) or `async`. See Async Mode.
`idempotencyKey`	`string`	No	Prevents duplicate processing. Can also be sent as `Idempotency-Key` HTTP header.

Pricing

The Video API is billed per generated scene for the base creation flow. Render minutes and TTS regeneration are billed as separate sub-events at the endpoints that trigger them.

Event	Endpoint	Price
Base scene generation	`POST /v1/platform/videos` (per scene generated)	$0.04 / scene
TTS regeneration	`POST /v1/platform/videos/edit/:editToken/scenes/:sceneId/tts`	$0.02 / call
Remotion render	Charged on render-callback completion	$0.15 / video minute

The priceSnapshot on the create response covers only base scene generation. Render-time and TTS-regen billing produces separate PlatformUsageRecord rows with their own priceSnapshot payloads. Render minutes are tracked at fractional precision (e.g. 1.4 min × $0.15 = $0.21). Partial seconds are preserved end-to-end.

Legacy tier field: still accepted for backwards compatibility but has no effect on price. Omit it in new integrations. If you send it, use one of the legacy request values: basic, standard, or advanced.

Example Request

curl -X POST https://api.tutorflow.io/v1/platform/videos \
  -H "Authorization: Bearer tf_platform_..." \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "What is photosynthesis? A 30-second explainer for kids",
    "sceneCount": 4,
    "aspectRatio": "16:9",
    "targetDurationSeconds": 30
  }'

Response Fields

Field	Type	Description
`id`	`string`	Video request ID.
`videoId`	`string \| null`	Internal video content ID. `null` until generation completes.
`status`	`string`	`PENDING`, `PROCESSING`, `COMPLETED`, or `FAILED`.
`isTerminal`	`boolean`	`true` once status is `COMPLETED` or `FAILED`.
`title`	`string \| null`	AI-generated or overridden title.
`description`	`string \| null`	Short description.
`language`	`string \| null`	Output language.
`sceneCount`	`number \| null`	Number of scenes actually produced.
`aspectRatio`	`string \| null`	Echoed aspect ratio.
`targetDurationSeconds`	`number \| null`	Echoed target duration.
`slug`	`string \| null`	URL-friendly slug.
`tier`	`string \| null`	Pricing tier used.
`mode`	`string \| null`	`sync` or `async`.
`priceSnapshot`	`object \| null`	Pricing details captured at request time.
`shareToken`	`string \| null`	Permanent token for the public viewer.
`editToken`	`string \| null`	Sliding-window token for the editor and edit-time mutations.
`editTokenExpiresAt`	`string \| null`	ISO 8601 expiry. Extended automatically on edit-token reads.
`previewUrl`	`string \| null`	Editor, `/{locale}/platform/videos/edit/{editToken}`.
`publicUrl`	`string \| null`	Public viewer, `/{locale}/platform/videos/{shareToken}`.
`renderStatus`	`string \| null`	`IDLE`, `RENDERING`, `COMPLETED`, or `FAILED`. `IDLE` immediately after creation.
`videoKey`	`string \| null`	S3 key where the rendered mp4 will live (set when render is triggered).
`renderTriggerUrl`	`string \| null`	Endpoint to call to start a render.
`renderPollUrl`	`string \| null`	Endpoint to poll for render progress.
`pollAfterMs`	`number \| null`	Suggested polling interval (async only).
`idempotencyKey`	`string \| null`	Echoed idempotency key.
`idempotentReplay`	`boolean \| null`	`true` if this is a replay.
`createdAt`	`string`	ISO 8601 timestamp.
`completedAt`	`string \| null`	ISO 8601 timestamp scene generation finished.

Example Response

{
  "id": "e41a085a-e43f-4f63-92f4-9e11250f6e63",
  "videoId": "e10b8286-94ad-4139-a946-3548b46f6d07",
  "status": "COMPLETED",
  "isTerminal": true,
  "title": "What Is Photosynthesis for Kids?",
  "description": "A 30-second explainer covering how plants make food and produce oxygen.",
  "language": "en",
  "sceneCount": 4,
  "aspectRatio": "16:9",
  "targetDurationSeconds": 30,
  "slug": "what-is-photosynthesis-for-kids-d6a02f3b",
  "tier": "default",
  "mode": "sync",
  "priceSnapshot": {
    "category": "video",
    "catalogKey": "video.default",
    "tier": "default",
    "unit": "scene",
    "unitPrice": 0.04,
    "units": 4,
    "amountUsd": 0.16,
    "currency": "USD",
    "source": "platform_pricing_catalog_v2"
  },
  "shareToken": "b64adc04eca28e709add5568af2c414c",
  "editToken": "2dce05b2f639b93ec7559d779b3db71c",
  "editTokenExpiresAt": "2026-04-25T14:15:28.628Z",
  "previewUrl": "https://tutorflow.io/en/platform/videos/edit/2dce05b2f639b93ec7559d779b3db71c",
  "publicUrl": "https://tutorflow.io/en/platform/videos/b64adc04eca28e709add5568af2c414c",
  "renderStatus": "IDLE",
  "videoKey": null,
  "renderPollUrl": "GET /v1/platform/videos/edit/2dce05b2f639b93ec7559d779b3db71c",
  "renderTriggerUrl": "POST /v1/platform/videos/edit/2dce05b2f639b93ec7559d779b3db71c/render",
  "requiresRenderTrigger": true,
  "nextSteps": {
    "triggerRender": "POST /v1/platform/videos/edit/2dce05b2f639b93ec7559d779b3db71c/render",
    "pollRender": "GET /v1/platform/videos/edit/2dce05b2f639b93ec7559d779b3db71c",
    "editVideo": "PATCH /v1/platform/videos/edit/2dce05b2f639b93ec7559d779b3db71c"
  },
  "idempotencyKey": null,
  "idempotentReplay": null,
  "createdAt": "2026-04-25T04:14:47.546Z",
  "completedAt": "2026-04-25T04:15:28.677Z",
  "pollAfterMs": null
}

Workflow

The natural agent workflow is a two-step pattern:

1. POST /v1/platform/videos             → scenes ready, renderStatus: IDLE
2. POST /v1/platform/videos/edit/:editToken/render   → render starts
3. GET  /v1/platform/videos/edit/:editToken          → poll until renderStatus: COMPLETED
4. video file is at videoKey on S3

You can hand the user the previewUrl between steps 1 and 2 so they review scenes before paying for a render. Or skip straight to step 2 if you trust the scenes blindly.

Triggering a Render

curl -X POST https://api.tutorflow.io/v1/platform/videos/edit/{editToken}/render

Response is the full VideoEditResDto with renderStatus: "RENDERING" and videoKey set to where the mp4 will appear.

To poll progress:

curl https://api.tutorflow.io/v1/platform/videos/edit/{editToken}

When renderStatus === "COMPLETED", fetch the file from https://{s3-bucket}.s3.{region}.amazonaws.com/{videoKey}.

To cancel an in-progress render:

curl -X POST https://api.tutorflow.io/v1/platform/videos/edit/{editToken}/render/cancel

This sets renderStatus back to IDLE. The Lambda may finish in the background, but its callback is ignored.

Editing Scenes

The edit token also unlocks per-scene mutations. See Edit Video for the full surface (add scene, update scene, replace clip, regenerate TTS, add BGM, upload custom clip, sync durations).

Async Mode

Set mode: "async" to queue generation as a background job. The response returns immediately with status: "PENDING". Poll GET /v1/platform/videos/:id until isTerminal is true.

Idempotency

Pass an idempotencyKey in the request body or Idempotency-Key header to prevent duplicate generation. Reusing the same key returns the original response with idempotentReplay: true.

Create Video