OpenAIAnthropicGooglexAIPerplexity
← Back to all updates
Published ·xAI• Minor

xAI added per-request priority scheduling for text inference endpoints, with applied tier reported in responses.

The read

Priority is request scoped, so teams can reserve higher scheduling priority for selected text calls and pay only when applied.

The frame

For builders using Chat Completions or Responses, scheduling priority becomes an input parameter with response-side confirmation.

Primary source
xAI
Keep reading
See what Nova3 builds
Related updates

This is the world we build in. It moves this fast. We move with it.

Bring us your project