Estimate embedding credits
Estimate credits + per-modality breakdown WITHOUT dispatching. Use this BEFORE a real POST /v1/embeddings to preview cost. Same DTO shape as POST /v1/embeddings minus encoding_format and user (irrelevant when no charge is made).
Estimates are upper bounds based on pre-fetch input; actual credits_charged from POST /v1/embeddings may differ slightly for URL-fetched media (image / video bytes whose server-side tokenization can be more or less aggressive than the local estimator).
Authorizations
Your team API key (starts with al_live_).
Headers
Optional API version pin (YYYY-MM-DD). Defaults to your team's pinned version, or the system default 2026-05-15 for unauthenticated requests.
^\d{4}-\d{2}-\d{2}$"2026-05-15"
Body
Public model slug (e.g. "aurous-embed-vision"). Pass exactly as listed by GET /v1/models.
"aurous-embed-vision"
Input — accepts a string OR an array of content parts ({type: "text"|"image_url"|"video_url"}) for multimodal. String-array (string[]) batch input is NOT accepted on v1 (same rules as POST /v1/embeddings).
Output vector dimensions. Most models return a fixed dimension and reject this parameter. If the model does not support dimensions, the estimate returns 400 embeddings_unsupported_dimensions — identical to the dispatch path, so an estimate and a real request fail at the same gate.
1024
Response
Estimate produced.
Always true — distinguishes this from a real /v1/embeddings response (which uses object: 'list').
true
Estimated token counts per modality + total.
Estimated credits for this request. Real credits_charged from POST /v1/embeddings may differ slightly for URL-fetched media (image / video tokenization variance), but the estimate is generally a tight upper bound.
0.19125
Per-modality credit decomposition + model echo.

