Overview
DeepRecall enforces rate limits to ensure fair usage and maintain service quality for all users. Rate limits are applied per API key.Standard Rate Limits
All API keys have the following rate limits:Standard Limits
- 60 requests per minute
- 1,000 requests per hour
- 10,000 requests per day
- 50,000 requests per month
π Need Higher Limits?
π Need Higher Limits?
Contact us for custom quotas:
How Rate Limits Work
Rate limits are enforced using a sliding window algorithm:- Per-Minute Limit - Maximum requests in any 60-second window
- Per-Hour Limit - Maximum requests in any 60-minute window
- Per-Day Limit - Maximum requests in any 24-hour window
- Monthly Quota - Total requests allowed per calendar month
All time windows are calculated using a sliding window, not fixed periods. This prevents burst traffic at period boundaries.
Checking Your Rate Limit Status
Every API response includes rate limit information:Response Headers (Coming Soon)
Response Body
Rate Limit Exceeded
When you exceed a rate limit, youβll receive a429 Too Many Requests response:
What To Do
1
Wait
Wait until the
reset_at time before making more requests2
Implement Retry Logic
Use exponential backoff to retry failed requests
3
Optimize Requests
Use pagination and filtering to reduce API calls
4
Request Higher Limits
If you consistently hit limits, contact us for custom quotas
Best Practices
1. Implement Exponential Backoff
2. Cache Responses
Cache API responses to reduce redundant requests:3. Use Pagination Efficiently
Request only the data you need:4. Monitor Usage
Track your usage to avoid unexpected limit hits:5. Optimize Filters
Use specific filters to get only the data you need:Monthly Quota Reset
Monthly quotas reset on the 1st day of each month at 00:00 UTC.Unused quota does not roll over to the next month.
Monitoring Your Usage
API Response
Every API response includes usage information in therate_limit field:
monthly_usage- API calls made this monthremaining_credits- Credits remaining in quotamonthly_quota- Your total monthly allocation
Usage Reports
π Request Usage Reports
π Request Usage Reports
Get detailed usage reports and monitoring:
Need Higher Limits?
Contact us to discuss custom rate limits for your use case:π§ Contact Us
π§ Contact Us
Request custom rate limits and quotas:
FAQ
Do failed requests count against my quota?
Do failed requests count against my quota?
No. Only successful requests (status codes 200-299) count against your quota. Rate limit errors (429) and auth errors (401, 403) do not count.
Can I get a temporary limit increase?
Can I get a temporary limit increase?
Yes. Contact support if you have a legitimate temporary need (e.g., data migration, backfill). We can grant temporary increases on a case-by-case basis.
What happens if I exceed my monthly quota?
What happens if I exceed my monthly quota?
Your API calls will return
429 Too Many Requests until:- Your quota resets (next month)
- You contact us to request additional quota
- You purchase additional quota (enterprise plans only)
Do rate limits apply to the /health endpoint?
Do rate limits apply to the /health endpoint?
No. The
/health endpoint is public and has no rate limits. Itβs designed for monitoring services.Can I have multiple API keys with different limits?
Can I have multiple API keys with different limits?
Each API key inherits the limits from your account plan. All keys under the same account share the same monthly quota.
Need Help?
π§ Support
π§ Support
For rate limit questions or custom limits: