Create embeddings
Create embeddings from text and/or visual content (images, video). Multimodal input is combined into a SINGLE embedding (the underlying model concatenates parts into one document representation). For OpenAI-style N→N batch embedding, loop client-side: send one request per item.
Returns the OpenAI-compatible envelope (object: "list", data: [{ embedding, index, object }], model, usage) plus the Aurous usage extension carrying credits_charged and a per-modality breakdown (input: { text, visual, video }) so you can correlate charge to input. credits_charged is authoritative. Idempotency-Key is always honored.
Authorizations
Your team API key (starts with al_live_).
Headers
Stripe-style idempotency key. Replays return the cached response with Aurous-Idempotent-Replayed: true. Same key + different canonical body returns 409 idempotency_key_in_use. Replay window is 24 hours.
Optional API version pin (YYYY-MM-DD). Defaults to your team's pinned version, or the system default 2026-05-15 for unauthenticated requests.
^\d{4}-\d{2}-\d{2}$"2026-05-15"
Body
Public model slug (e.g. "aurous-embed-vision"). Pass exactly as listed by GET /v1/models.
"aurous-embed-vision"
Input — accepts a string OR an array of content parts ({type: "text"|"image_url"|"video_url"}) for multimodal. String-array (string[]) batch input is NOT accepted on v1: the underlying model concatenates batched text into a single embedding, so a customer expecting OpenAI-style N→N would get one combined vector. Loop client-side or pass a multimodal-parts array (returns one combined embedding).
Output vector dimensions. Most models return a fixed dimension and reject this parameter. If the model does not support dimensions, the request returns 400 embeddings_unsupported_dimensions.
1024
Vector encoding format. Accepted for OpenAI SDK compatibility (the Node SDK sends base64 by default); v1 always returns float vectors regardless of the value sent. base64 support (returning base64-encoded float buffers) is reserved for a future release.
float, base64 End-user identifier for your records (optional). Stored on the inference row; no Aurous-side behavior. Mirrors OpenAI compat.
256Response
Embedding created.
OpenAI envelope discriminator.
list Embedding items. Always a single-element array on v1.
Model slug used for this request (echoed from the request).
"aurous-embed-vision"
Token + credit accounting for this request.

