How the math works
Within each task
Parallelism doesn’t stop at the task level. Inside each skill run, independent tool calls fire simultaneously:Concurrency controls
Parallelism is managed at two levels to protect against rate limit cascades:| Level | Limit | Reason |
|---|---|---|
| Skills per session | Max 2 concurrent | Prevents token-per-minute overflow |
| Tool calls per skill turn | Unlimited parallel | Independent ops always concurrent |
retry-after header, falls back to exponential backoff (1s → 2s → 4s), and retries up to 3 times before surfacing an error.