Documentation Index
Fetch the complete documentation index at: https://docs.modelrunner.ai/llms.txt
Use this file to discover all available pages before exploring further.
Accounts & authentication
How do I authenticate API requests?
How do I authenticate API requests?
Authorization header with the Key scheme: Authorization: Key <your_key>. See API keys for how to create and rotate keys.Can I use ModelRunner directly from the browser?
Can I use ModelRunner directly from the browser?
/api/modelrunner/proxy.Can my AI assistant call ModelRunner directly?
Can my AI assistant call ModelRunner directly?
Billing & limits
How is billing calculated?
How is billing calculated?
COMPLETED. Failed and cancelled requests are not billed.Do I pay if a request fails?
Do I pay if a request fails?
FAILED or CANCELLED is not charged. A request whose output failed schema validation (422) is also not charged — you see the upstream error and the balance is unaffected.What happens when I run out of credits?
What happens when I run out of credits?
402 Insufficient Balance. Top up credits and retry the request — no other action is needed.Are there rate limits?
Are there rate limits?
429, the API passes it through so your client can back off.Files & inputs
What file types can I upload?
What file types can I upload?
content_type you declare on the upload — what matters is whether the target model accepts that type. Common types include image/png, image/jpeg, image/webp, video/mp4, video/webm, audio/mpeg, audio/wav, application/pdf.What's the largest file I can upload?
What's the largest file I can upload?
upload_file tool caps inline payloads at 200 MiB — larger files should use the direct multipart endpoints.How long are uploaded files retained?
How long are uploaded files retained?
GET /files, DELETE /files/:id).Requests & lifecycle
Does ModelRunner support webhooks?
Does ModelRunner support webhooks?
GET /requests/stream) for push-style updates without polling, or poll status_url directly. Webhook delivery is on the roadmap.What's my position in the queue?
What's my position in the queue?
queue_position field in status responses is currently always 0 — real queue depth is not tracked yet. Use the SSE stream to receive updates the moment your request transitions.How long can a request stay in flight?
How long can a request stay in flight?
Can I cancel a running request?
Can I cancel a running request?
GET the cancel_url returned when you created the request. Cancellation is immediate if the provider hasn’t started; best-effort otherwise. See request lifecycle.What happens if my client crashes mid-request?
What happens if my client crashes mid-request?
GET /requests/{requestId} or list your history.Errors
What does `422` mean on a result fetch?
What does `422` mean on a result fetch?
error for the human-readable failure and details.validationErrors for per-field issues. You are not billed. See errors.Which errors are safe to retry?
Which errors are safe to retry?
429 (with backoff) and transient 5xx. Treat 400, 401, 402, 403, 404, and 422 as deterministic failures — retrying with the same input will produce the same error.
