OpenAIAnthropicGooglexAIPerplexity
← Back to all updatesPublished ·Anthropic• Minor
Anthropic released a research preview of Fast mode for Claude Opus 4.8 on the Claude API.
The read
Speed is the primary constraint for Opus models and this preview targets latency for high intelligence tasks.
The frame
The restriction on sampling parameters requires developers to use default model behavior while testing these faster inference speeds.
Primary source
Anthropic →Keep reading
See what Nova3 builds →Related updates
- Jun 06, 2026Anthropic updated prompt caching to automatically read from the longest previously cached prefix when breakpoints are set.
- Jun 06, 2026Anthropic optimized prompt caching support in the Message Batches API to improve cache hit rates.
- Jun 06, 2026Anthropic added a delete endpoint to the Message Batches API for managing batch processing tasks.
This is the world we build in. It moves this fast. We move with it.
Bring us your project →