OpenAIAnthropicGooglexAIPerplexity
← Back to all updates
Published ·Anthropic▲ Material

Anthropic added max_tokens for the advisor tool to cap advisor model output per call.

The read

This matters when advisor responses do not need full length, cutting latency and output token cost.

The frame

Set tools[].max_tokens on the advisor tool definition when workloads need capped advisor output.

Primary source
Anthropic
Keep reading
See what Nova3 builds
Related updates

This is the world we build in. It moves this fast. We move with it.

Bring us your project