Skip to main content

Our approach to rate limits for the Claude API

Updated today

Your rate limit depends on your usage tier, and is currently measured in three key metrics:

  1. Requests per minute (RPM)

  2. Input tokens per minute (ITPM)

  3. Output tokens per minute (OTPM)

If you exceed any of these rate limits, you will get a 429 error describing which rate limit was exceeded, along with a retry-after header indicating how long to wait.

Rate limits are set at the organization level and are defined by usage tiers. Each tier has different spend and rate limits, with automatic tier advancement based on usage thresholds up to Tier 4.

You can view your organization's current tier and limits in the Claude Console.

More information on usage tiers and rate limits can be found in our Claude docs.

Did this answer your question?