API Reference

Base URL: http://localhost:8000

Quick Examples

Generate a scene (streaming - recommended):

curl -X POST http://localhost:8000/api/v1/timepoints/generate/stream \
  -H "Content-Type: application/json" \
  -d '{"query": "Oppenheimer Trinity test control bunker 5:29 AM July 16 1945", "preset": "hyper", "generate_image": true}'

Chat with a character:

curl -X POST http://localhost:8000/api/v1/interactions/{timepoint_id}/chat \
  -H "Content-Type: application/json" \
  -d '{"character": "Oppenheimer", "message": "What did you feel when the sky turned white?"}'

Jump forward in time:

curl -X POST http://localhost:8000/api/v1/temporal/{timepoint_id}/next \
  -H "Content-Type: application/json" \
  -d '{"units": 1, "unit": "hour"}'

Quality Presets

Control the speed/quality tradeoff with presets:

Preset	Speed	Quality	Text Model	Provider
hyper	~55s	Good	`google/gemini-2.0-flash-001`	OpenRouter
balanced	~90-110s	Better	`gemini-2.5-flash`	Google Native
hd	~120-150s	Best	`gemini-2.5-flash` (extended thinking)	Google Native
gemini3	~60s	Excellent	`google/gemini-3-flash-preview`	OpenRouter

Usage:

{
  "query": "Turing interrogation Wilmslow February 1952",
  "preset": "hyper",
  "generate_image": false
}

Model Overrides:

Override preset models for custom configurations:

{
  "query": "Zheng He treasure fleet Malindi harbor 1418",
  "text_model": "google/gemini-2.0-flash-001",
  "image_model": "gemini-2.5-flash-image"
}

Permissive Mode (Google-Free):

Use only open-weight, distillable models — zero Google API calls:

{
  "query": "The signing of the Magna Carta, 1215",
  "generate_image": true,
  "model_policy": "permissive"
}

Text routes to DeepSeek/Llama/Qwen via OpenRouter, images route to OpenRouter (Flux/Gemini), and Google grounding is skipped. Response metadata reflects the actual models used:

{
  "text_model_used": "deepseek/deepseek-chat-v3-0324",
  "image_model_used": "google/gemini-2.5-flash-image-preview",
  "model_provider": "openrouter",
  "model_permissiveness": "permissive"
}

Composing model_policy with explicit models:

model_policy and explicit model names are composable — explicit models take priority:

{
  "query": "Apollo 11 Moon Landing, 1969",
  "model_policy": "permissive",
  "text_model": "qwen/qwen3-235b-a22b",
  "generate_image": true
}

This uses the specified Qwen model for text, OpenRouter for images (from permissive policy), and skips Google grounding.

LLM Parameters

The llm_params object gives downstream callers fine-grained control over generation hyperparameters. All fields are optional — unset fields use agent/preset defaults. These parameters are applied to every agent in the 14-step pipeline.

{
  "query": "Turing breaks Enigma, 1941",
  "text_model": "deepseek/deepseek-r1-0528",
  "llm_params": {
    "temperature": 0.5,
    "max_tokens": 4096,
    "top_p": 0.9,
    "system_prompt_suffix": "Keep all descriptions under 200 words. Use British English."
  }
}

Parameter	Type	Range	Providers	Description
`temperature`	float	0.0–2.0	All	Sampling temperature. Overrides per-agent defaults (which range from 0.2 for factual agents to 0.85 for creative agents).
`max_tokens`	int	1–32768	All	Maximum output tokens per agent call. Preset defaults: hyper=1024, balanced=2048, hd=8192.
`top_p`	float	0.0–1.0	All	Nucleus sampling — only consider tokens whose cumulative probability is <= top_p.
`top_k`	int	>= 1	All	Top-k sampling — only consider the k most likely tokens at each step.
`frequency_penalty`	float	-2.0–2.0	OpenRouter	Penalize tokens proportionally to how often they've appeared in the output.
`presence_penalty`	float	-2.0–2.0	OpenRouter	Penalize tokens that have appeared at all in the output so far.
`repetition_penalty`	float	0.0–2.0	OpenRouter	Multiplicative penalty for repeated tokens.
`stop`	string[]	max 4	All	Stop sequences — generation halts when any of these strings is produced.
`thinking_level`	string	—	Google	Reasoning depth for thinking models: `"none"`, `"low"`, `"medium"`, `"high"`.
`system_prompt_prefix`	string	max 2000	All	Text prepended to every agent's system prompt. Use for tone, persona, or style injection.
`system_prompt_suffix`	string	max 2000	All	Text appended to every agent's system prompt. Use for constraints, formatting rules, or output instructions.

Notes:

Parameters marked "OpenRouter" are silently ignored when the request routes to Google (and vice versa for thinking_level).
system_prompt_prefix and system_prompt_suffix affect all 14 pipeline agents. Use these to inject cross-cutting concerns (e.g., language, tone, verbosity constraints).
Request-level llm_params override per-agent defaults. For example, if llm_params.temperature is set, it overrides the judge agent's default of 0.3, the scene agent's default of 0.7, etc.

Endpoints Overview

Category	Endpoint	Description
Auth	`POST /api/v1/auth/apple`	Apple Sign-In → JWT pair
Auth	`POST /api/v1/auth/dev/token`	Dev admin: create test user → JWT pair
Auth	`POST /api/v1/auth/refresh`	Rotate refresh token
Auth	`GET /api/v1/auth/me`	Current user profile
Auth	`POST /api/v1/auth/logout`	Revoke refresh token
Auth	`DELETE /api/v1/auth/account`	Soft-delete user account
Credits	`GET /api/v1/credits/balance`	Current credit balance
Credits	`GET /api/v1/credits/history`	Paginated transaction ledger
Credits	`POST /api/v1/credits/admin/grant`	Dev admin: grant credits to any user
Credits	`GET /api/v1/credits/costs`	Credit cost table
Users	`GET /api/v1/users/me/timepoints`	User's timepoints (paginated)
Users	`GET /api/v1/users/me/export`	Full GDPR data export
Users	`POST /api/v1/users/resolve`	Find or create user by external_id (service-key protected)
Generate	`POST /api/v1/timepoints/generate/stream`	Create a scene (streaming) - recommended
Generate	`POST /api/v1/timepoints/generate/sync`	Create a scene (blocking)
Generate	`POST /api/v1/timepoints/generate`	Create a scene (background task)

All generation endpoints run a 14-agent pipeline with critique loop: dialog is reviewed for anachronisms, cultural errors, and voice distinctiveness, and retried if critical issues are found. Characters are capped at 6 with social register-based voice differentiation. Image prompts translate narrative emotion into physicalized body language (~77 words).

When AUTH_ENABLED=true, generation, chat, dialog, survey, and temporal endpoints require a Bearer JWT and deduct credits. Private timepoints return 403 for non-owners. See iOS Integration Guide for full details.

| Get | GET /api/v1/timepoints/{id} | Retrieve a scene | | Chat | POST /api/v1/interactions/{id}/chat | Talk to a character | | Time Travel | POST /api/v1/temporal/{id}/next | Jump forward | | Time Travel | POST /api/v1/temporal/{id}/prior | Jump backward | | Models | GET /api/v1/models/free | List free OpenRouter models | | Eval | POST /api/v1/eval/compare | Compare model latencies | | Eval | POST /api/v1/eval/compare/report | Compare with formatted report | | Eval | GET /api/v1/eval/models | List eval models and presets |

Timepoints

POST /api/v1/timepoints/generate/stream (Recommended)

Generate a scene with real-time progress updates via Server-Sent Events.

Request:

{
  "query": "Oppenheimer watches the Trinity test 5:29 AM July 16 1945",
  "preset": "hyper",
  "generate_image": true
}

Field	Type	Required	Description
query	string	Yes	Historical moment (3-500 chars)
generate_image	boolean	No	Generate AI image (default: false)
preset	string	No	Quality preset: `hd`, `hyper`, `balanced` (default), `gemini3`
text_model	string	No	Text model ID — OpenRouter format (`org/model`) or Google native (`gemini-*`). Overrides preset.
image_model	string	No	Image model ID — OpenRouter format (`org/model`) or Google native. Overrides preset.
model_policy	string	No	`"permissive"` — selects only open-weight models (Llama, DeepSeek, Qwen) and skips Google-dependent steps. Fully Google-free. Works alongside explicit model overrides.
llm_params	object	No	Fine-grained LLM parameters applied to all pipeline agents. See LLM Parameters below.
visibility	string	No	`public` (default) or `private` — controls who can see full data
callback_url	string	No	URL to POST results to when generation completes (async endpoint only)
request_context	object	No	Opaque context passed through to response (e.g. `{"source": "clockchain", "job_id": "..."}`)

Model selection priority (highest first):

Explicit text_model / image_model — use exactly these models
model_policy: "permissive" — auto-select open-weight models, skip Google grounding
preset — use preset's default models
Server defaults

Response: SSE stream with events:

data: {"event": "start", "step": "initialization", "progress": 0}
data: {"event": "step_complete", "step": "judge", "progress": 10}
data: {"event": "step_complete", "step": "timeline", "progress": 20}
data: {"event": "step_complete", "step": "scene", "progress": 30}
data: {"event": "step_complete", "step": "characters", "progress": 50}
data: {"event": "step_complete", "step": "moment", "progress": 65}
data: {"event": "step_complete", "step": "camera", "progress": 65}
data: {"event": "step_complete", "step": "dialog", "progress": 80}
data: {"event": "step_complete", "step": "image_prompt", "progress": 90}
data: {"event": "step_complete", "step": "image_generation", "progress": 100}
data: {"event": "done", "progress": 100, "data": {"timepoint_id": "abc123", "slug": "...", "status": "completed"}}

Note: The image_generation step only appears when generate_image: true. Without it, done follows image_prompt directly.

POST /api/v1/timepoints/generate/sync

Generate a scene synchronously. Blocks until complete (30-120 seconds).

Request: Same as streaming endpoint.

Response: Full TimepointResponse object.

POST /api/v1/timepoints/generate

Start background generation. Returns immediately with timepoint ID.

Note: Poll GET /api/v1/timepoints/{id} for completion status. Alternatively, provide a callback_url — Flash will POST the full result to that URL when generation completes.

Request: Same as streaming endpoint. Additionally supports callback_url and request_context.

When callback_url is provided, Flash POSTs the result on completion:

{
  "timepoint": { /* full TimepointResponse */ },
  "preset_used": "balanced",
  "generation_time_ms": 95000,
  "request_context": { /* echoed back from request */ }
}

On failure, a minimal error payload is POSTed instead:

{
  "id": "550e8400-...",
  "status": "failed",
  "error": "...",
  "request_context": { /* echoed back */ }
}

Response:

{
  "id": "550e8400-e29b-41d4-a716-446655440000",
  "status": "processing",
  "message": "Generation started for 'Oppenheimer watches the Trinity test'",
  "request_context": null
}

GET /api/v1/timepoints/{id}

Get a completed scene.

Query Params:

Name	Type	Default	Description
full	boolean	false	Include full metadata (scene, characters, dialog)
include_image	boolean	false	Include base64 image data

Response:

{
  "id": "550e8400-e29b-41d4-a716-446655440000",
  "query": "Oppenheimer watches the Trinity test",
  "status": "completed",
  "year": 1945,
  "month": 7,
  "day": 16,
  "season": "summer",
  "time_of_day": "pre-dawn",
  "location": "Control bunker S-10000, Jornada del Muerto, New Mexico",
  "has_image": true,
  "image_url": "data:image/jpeg;base64,...",
  "text_model_used": "gemini-2.5-flash",
  "image_model_used": "gemini-2.5-flash-image",
  "visibility": "public",
  "share_url": "https://timepointai.com/t/oppenheimer-trinity-abc123",
  "characters": {
    "characters": [
      {"name": "J. Robert Oppenheimer", "role": "primary", "description": "..."},
      {"name": "Kenneth Bainbridge", "role": "secondary", "description": "..."}
    ]
  },
  "dialog": [
    {"speaker": "Bainbridge", "text": "Now we are all sons of bitches."}
  ],
  "scene": {"setting": "...", "atmosphere": "..."},
  "image_prompt": "..."
}

GET /api/v1/timepoints

List scenes with pagination. Visibility filtering is applied automatically:

Anonymous: sees only public timepoints.
Authenticated: sees public + own private timepoints.
Explicit ?visibility=: overrides the default (private still restricted to owner).

Query Params:

Name	Type	Default	Description
page	int	1	Page number
page_size	int	20	Items per page
status	string	null	Filter by status (completed, failed, processing)
visibility	string	null	Filter by visibility (`public` or `private`)

PATCH /api/v1/timepoints/{id}/visibility

Update a timepoint's visibility. Owner-only (or open when AUTH_ENABLED=false).

Request:

{
  "visibility": "private"
}

Response: Full TimepointResponse with updated visibility.

Status	Meaning
200	Success
400	Invalid visibility value
403	Not the owner
404	Timepoint not found

DELETE /api/v1/timepoints/{id}

Delete a scene.

Character Interactions

POST /api/v1/interactions/{id}/chat

Chat with a character from a scene.

Request:

{
  "character": "Benjamin Franklin",
  "message": "What do you think of this document?"
}

Field	Type	Required	Description
character	string	Yes	Character name (case-insensitive)
message	string	Yes	Your message
session_id	string	No	Continue existing conversation

Response:

{
  "character_name": "Benjamin Franklin",
  "response": "My dear friend, this document represents our highest aspirations...",
  "emotional_tone": "thoughtful",
  "session_id": "sess_123"
}

POST /api/v1/interactions/{id}/chat/stream

Same as above, but streams the response token-by-token.

POST /api/v1/interactions/{id}/dialog

Generate more dialog between characters.

Request:

{
  "num_lines": 5,
  "prompt": "They discuss the risks of signing"
}

POST /api/v1/interactions/{id}/survey

Ask all characters the same question.

Request:

{
  "questions": ["What do you fear most about this moment?"],
  "include_summary": true
}

Response:

{
  "responses": [
    {
      "character_name": "John Adams",
      "response": "That we shall all hang for this...",
      "sentiment": "negative",
      "emotional_tone": "anxious"
    }
  ],
  "summary": "The founders express a mixture of fear and determination..."
}

Time Travel

POST /api/v1/temporal/{id}/next

Generate a scene at a later point in time, preserving characters and context.

Request:

{
  "units": 1,
  "unit": "hour"
}

Field	Type	Default	Options
units	int	1	1-365
unit	string	"day"	second, minute, hour, day, week, month, year

Response:

{
  "source_id": "550e8400-...",
  "target_id": "661f9511-...",
  "source_year": 1969,
  "target_year": 1969,
  "direction": "next",
  "units": 1,
  "unit": "hour",
  "message": "Generated moment 1 hour(s) forward"
}

Use GET /api/v1/timepoints/{target_id}?full=true to retrieve the generated scene.

POST /api/v1/temporal/{id}/prior

Same as above, but backward in time. Response has "direction": "prior".

GET /api/v1/temporal/{id}/sequence

Get all linked scenes (prior and next). When a timepoint has multiple children from separate time-jumps, the most recently created child is followed.

Query Params:

Name	Default	Options
direction	"both"	prior, next, both
limit	10	1-50

Response:

{
  "center": {"id": "...", "year": 1776, "slug": "declaration-signing-abc123"},
  "prior": [],
  "next": [
    {"id": "...", "year": 1776, "slug": "one-hour-after-declaration-def456"}
  ]
}

Models

GET /api/v1/models

List available AI models.

Query Params:

Name	Type	Default	Description
fetch_remote	boolean	false	Fetch live models from OpenRouter
free_only	boolean	false	Only return free models

GET /api/v1/models/free

Get available free models from OpenRouter. The best and fastest picks are auto-selected.

Response:

{
  "best": {
    "id": "qwen/qwen3-next-80b-a3b-instruct:free",
    "name": "Qwen3 Next 80B (free)",
    "context_length": 40960,
    "is_free": true
  },
  "fastest": {
    "id": "liquid/lfm-2.5-1.2b-thinking:free",
    "name": "LFM 2.5 1.2B Thinking (free)",
    "context_length": 32768,
    "is_free": true
  },
  "all_free": [...],
  "total": 30
}

Note: Free model availability changes frequently on OpenRouter. The best/fastest picks are determined dynamically.

GET /api/v1/models/providers

Check which providers (Google, OpenRouter) are configured and their model counts.

Response:

{
  "providers": [
    {
      "provider": "google",
      "available": true,
      "models_count": 3,
      "default_text_model": "gemini-2.5-flash",
      "default_image_model": "gemini-2.5-flash-image"
    },
    {
      "provider": "openrouter",
      "available": true,
      "models_count": 300,
      "default_text_model": "anthropic/claude-3.5-sonnet",
      "default_image_model": "gemini-2.5-flash-image"
    }
  ]
}

Evaluation

Compare model latency and performance across providers.

POST /api/v1/eval/compare

Run the same prompt across multiple models in parallel. Returns raw JSON results.

Request:

{
  "query": "Kasparov Deep Blue Game 6 1997",
  "preset": "verified"
}

Field	Type	Required	Description
query	string	Yes	Prompt to send to all models
preset	string	No	Model set: `verified` (default), `google_native`, `openrouter`, `all`
models	array	No	Specific model configs (alternative to preset)
prompt_type	string	No	Prompt type label (default: `text`)
timeout_seconds	int	No	Max time per model, 10-600 (default: 120)

Response:

{
  "query": "Kasparov Deep Blue Game 6 1997",
  "prompt_type": "text",
  "timestamp": "2026-02-05T10:47:29Z",
  "total_duration_ms": 16045,
  "models_tested": 4,
  "fastest_model": "google/gemini-3-flash-preview",
  "slowest_model": "gemini-2.5-flash",
  "success_count": 4,
  "failure_count": 0,
  "success_rate": 100.0,
  "latency_stats": {
    "min_ms": 6453,
    "max_ms": 15890,
    "avg_ms": 10150,
    "median_ms": 9240,
    "range_ms": 9437
  },
  "ranking": [
    "google/gemini-3-flash-preview",
    "google/gemini-2.0-flash-001",
    "gemini-2.5-flash",
    "gemini-2.5-flash-thinking"
  ],
  "results": [
    {
      "model_id": "google/gemini-3-flash-preview",
      "provider": "openrouter",
      "label": "Gemini 3 Flash Preview",
      "success": true,
      "latency_ms": 6453,
      "output_length": 2847,
      "output_preview": "This is a valid historical query..."
    }
  ]
}

POST /api/v1/eval/compare/report

Same as /compare, but returns both JSON data and a formatted ASCII report.

Request: Same as /compare.

Response:

{
  "comparison": { ... },
  "report": "╔══════════════════════════════════════╗\n║   MODEL COMPARISON RESULTS          ║\n..."
}

The report field contains a human-readable table with rankings, latencies, and success/failure indicators. Useful for CLI display or logging.

GET /api/v1/eval/models

List available models and presets for evaluation.

Response:

{
  "presets": {
    "verified": 4,
    "google_native": 2,
    "openrouter": 2,
    "all": 4
  },
  "models": [
    {
      "model_id": "gemini-2.5-flash",
      "provider": "google",
      "label": "Gemini 2.5 Flash"
    },
    {
      "model_id": "google/gemini-3-flash-preview",
      "provider": "openrouter",
      "label": "Gemini 3 Flash Preview"
    }
  ]
}

Service-to-Service Auth

Flash supports service-key authentication for calls from other TIMEPOINT services (billing, clockchain).

Header: X-Service-Key: {FLASH_SERVICE_KEY}

Three auth paths are evaluated in order by get_current_user:

Priority	Headers	Behavior	Use Case
1	`X-Service-Key` + `X-User-ID`	Validates key, looks up user by UUID or `external_id`	Billing relays user requests (credits deducted)
2	`X-Service-Key` only	Validates key, returns no user context	Clockchain system calls (unmetered)
3	`Authorization: Bearer <JWT>`	Validates JWT, returns authenticated user	Direct user auth (iOS app)

When AUTH_ENABLED=false and no service key is provided, all endpoints are open-access.

Admin operations (credit grants, dev tokens) use a separate X-Admin-Key header matching ADMIN_API_KEY.

Authentication

Auth endpoints are always available but only functional when AUTH_ENABLED=true.

POST /api/v1/auth/dev/token (Admin)

Create a test user (or find existing by email) and return a JWT pair. Requires X-Admin-Key header matching the ADMIN_API_KEY env var. Returns 403 if the key is missing, wrong, or ADMIN_API_KEY is not set.

On first creation, the user gets signup credits (default 50).

Request:

{
  "email": "test@example.com",
  "display_name": "Test User"
}

Field	Type	Required	Description
email	string	Yes	Email for the test user
display_name	string	No	Optional display name

Headers:

X-Admin-Key: your-admin-key

Response: Same shape as /auth/apple.

POST /api/v1/auth/apple

Verify an Apple identity token and return a JWT pair. Creates a new user on first sign-in and grants signup credits.

Request:

{
  "identity_token": "eyJhbGciOiJSUzI1NiIs..."
}

Response:

{
  "access_token": "eyJhbGciOiJIUzI1NiIs...",
  "refresh_token": "abc123...",
  "token_type": "bearer",
  "expires_in": 900
}

POST /api/v1/auth/refresh

Rotate a refresh token and return a new JWT pair. The old refresh token is revoked.

Request:

{
  "refresh_token": "abc123..."
}

Response: Same shape as /auth/apple.

GET /api/v1/auth/me

Return the current user's profile. Requires Bearer JWT.

Response:

{
  "id": "550e8400-...",
  "email": "user@example.com",
  "display_name": null,
  "created_at": "2026-02-09T12:00:00Z"
}

POST /api/v1/auth/logout

Revoke a refresh token. Always returns 200.

Request:

{
  "refresh_token": "abc123..."
}

Response:

{
  "detail": "Logged out"
}

DELETE /api/v1/auth/account

Soft-delete user account. Sets is_active=false and revokes all refresh tokens. Required for App Store compliance. Requires Bearer JWT.

Response:

{
  "detail": "Account deactivated"
}

Credits

GET /api/v1/credits/balance

Current credit balance. Requires Bearer JWT.

Response:

{
  "balance": 45,
  "lifetime_earned": 50,
  "lifetime_spent": 5
}

GET /api/v1/credits/history

Paginated transaction ledger. Requires Bearer JWT.

Query Params:

Name	Type	Default
limit	int	20
offset	int	0

Response:

[
  {
    "amount": -5,
    "balance_after": 45,
    "type": "generation",
    "description": "Scene generation (balanced)",
    "created_at": "2026-02-09T12:00:00Z"
  }
]

POST /api/v1/credits/admin/grant (Admin)

Grant credits to any user by user ID. Requires X-Admin-Key header matching the ADMIN_API_KEY env var. Returns 403 if the key is missing, wrong, or ADMIN_API_KEY is not set.

Request:

{
  "user_id": "550e8400-...",
  "amount": 100,
  "transaction_type": "stripe_purchase",
  "description": "Stripe purchase: 100 credits ($9.99)"
}

Field	Type	Required	Description
user_id	string	Yes	Target user UUID
amount	int	Yes	Credits to grant (must be > 0)
transaction_type	string	No	Ledger transaction type (default: `admin_grant`). Valid: `admin_grant`, `apple_iap`, `stripe_purchase`, `subscription_grant`, `refund`, `signup_bonus`
description	string	No	Ledger note (default: "Manual top-up")

Headers:

X-Admin-Key: your-admin-key

Response:

{
  "balance": 150,
  "granted": 100
}

GET /api/v1/credits/costs

Credit cost table. No auth required.

Response:

{
  "costs": {
    "generate_balanced": 5,
    "generate_hd": 10,
    "generate_hyper": 5,
    "generate_gemini3": 5,
    "chat": 1,
    "temporal_jump": 2
  }
}

Users

GET /api/v1/users/me/timepoints

Paginated list of the authenticated user's timepoints. Requires Bearer JWT.

Query Params:

Name	Type	Default	Description
page	int	1	Page number
page_size	int	20	Items per page (max 100)
status	string	null	Filter by status (completed, failed, processing)

Response:

{
  "items": [
    {
      "id": "550e8400-...",
      "query": "Oppenheimer Trinity test 1945",
      "slug": "oppenheimer-trinity-test-a1b2c3",
      "status": "completed",
      "year": 1945,
      "location": "Jornada del Muerto, New Mexico",
      "has_image": true,
      "created_at": "2026-02-09T12:00:00Z"
    }
  ],
  "total": 42,
  "page": 1,
  "page_size": 20
}

POST /api/v1/users/resolve

Find or create a user by external_id (Auth0 sub or other external identity provider ID). Service-key protected — requires X-Service-Key header matching FLASH_SERVICE_KEY.

Headers:

X-Service-Key: your-flash-service-key

Request:

{
  "external_id": "auth0|abc123",
  "email": "user@example.com",
  "display_name": "Jane Doe"
}

Field	Type	Required	Description
external_id	string	Yes	Auth0 sub or other external provider ID
email	string	No	User email (set on create only)
display_name	string	No	Display name (set on create only)

Response:

{
  "user_id": "550e8400-e29b-41d4-a716-446655440000",
  "created": true
}

Status	Meaning
200	User found or created
403	Invalid service key
503	`FLASH_SERVICE_KEY` not configured

GET /api/v1/users/me/export

Full JSON export of user data for GDPR Subject Access Request compliance. Returns profile, complete credit history, and full scene JSON for every user timepoint. Requires Bearer JWT.

Response:

{
  "user": {
    "id": "550e8400-...",
    "email": "user@example.com",
    "display_name": null,
    "created_at": "2026-02-09T12:00:00Z",
    "last_login_at": "2026-02-09T12:00:00Z",
    "is_active": true
  },
  "credit_history": [...],
  "timepoints": [...]
}

Health

GET /health

{
  "status": "healthy",
  "version": "2.4.0",
  "database": true,
  "providers": {
    "google": true,
    "openrouter": true
  }
}

Provider Resilience

TIMEPOINT Flash uses a dual-provider architecture with automatic failover:

Primary: Google Gemini (native API) Fallback: OpenRouter (300+ models)

Automatic Fallback

When Google API quota is exhausted or rate-limited:

Quota exhaustion (daily limit = 0) - Immediate fallback, no retries
Rate limiting (temporary) - Retry with exponential backoff, then fallback

Error Types

Error	Retries	Action
`QuotaExhaustedError`	0	Instant fallback to OpenRouter
`RateLimitError`	Up to 3	Exponential backoff, then fallback
`AuthenticationError`	0	Fail with 401
`ProviderError`	Up to 3	Retry, then fallback

Image Generation

Image generation uses a 2-tier fallback chain:

Priority	Provider	Details
1	Google Imagen	Native API, highest quality. Quota exhaustion = instant fallback.
2	OpenRouter	Via `/chat/completions` with `modalities: ["image", "text"]`. Best available model auto-selected.

Behavior:

Quota exhaustion on Google = immediate fallback to OpenRouter (no retries wasted)
In permissive mode, images route directly to OpenRouter (Google-free)
Scene completes with image from whichever provider succeeds

Errors

All errors return:

{"detail": "Error message"}

Code	Meaning
400	Invalid request state
401	Unauthorized — missing/invalid/expired JWT (when `AUTH_ENABLED=true`)
402	Payment Required — insufficient credits for the operation
403	Forbidden — private timepoint and requester is not the owner
404	Not found
422	Validation error
429	Rate limit exceeded (triggers fallback internally)
500	Server error

Rate limit: 60 requests/minute per IP.

Known Issues

No API preset: "free" option - The API does not have a built-in "free" preset. However, free models ARE fully supported:
- CLI: demo.sh has built-in free model selection (preset options 5/6) and "RAPID TEST FREE" menu option
- API: Use text_model override with free model IDs from /api/v1/models/free (e.g., google/gemini-2.0-flash-001:free)

Last updated: 2026-03-11

FilesExpand file tree

API.md

Latest commit

History

API.md

File metadata and controls

API Reference

Quick Examples

Quality Presets

LLM Parameters

Endpoints Overview

Timepoints

POST /api/v1/timepoints/generate/stream (Recommended)

POST /api/v1/timepoints/generate/sync

POST /api/v1/timepoints/generate

GET /api/v1/timepoints/{id}

GET /api/v1/timepoints

PATCH /api/v1/timepoints/{id}/visibility

DELETE /api/v1/timepoints/{id}

Character Interactions

POST /api/v1/interactions/{id}/chat

POST /api/v1/interactions/{id}/chat/stream

POST /api/v1/interactions/{id}/dialog

POST /api/v1/interactions/{id}/survey

Time Travel

POST /api/v1/temporal/{id}/next

POST /api/v1/temporal/{id}/prior

GET /api/v1/temporal/{id}/sequence

Models

GET /api/v1/models

GET /api/v1/models/free

GET /api/v1/models/providers

Evaluation

POST /api/v1/eval/compare

POST /api/v1/eval/compare/report

GET /api/v1/eval/models

Service-to-Service Auth

Authentication

POST /api/v1/auth/dev/token (Admin)

POST /api/v1/auth/apple

POST /api/v1/auth/refresh

GET /api/v1/auth/me

POST /api/v1/auth/logout

DELETE /api/v1/auth/account

Credits

GET /api/v1/credits/balance

GET /api/v1/credits/history

POST /api/v1/credits/admin/grant (Admin)

GET /api/v1/credits/costs

Users

GET /api/v1/users/me/timepoints

POST /api/v1/users/resolve

GET /api/v1/users/me/export

Health

GET /health

Provider Resilience

Automatic Fallback

Error Types

Image Generation

Errors

Known Issues