Skip to content

Manage Token Settings

Token settings allow you to control how your API keys consume resources. You can set quotas, rate limits, and assign keys to token groups for organized management.

Token Quota

Each API key can have an optional token quota — the maximum number of tokens it can consume. Once the quota is reached, requests using that key will be rejected until the quota is reset or increased.

Setting a Quota

  1. Go to your dashboard
  2. Navigate to Token Management
  3. Select the key you want to configure
  4. Set the Token Quota value
  5. Save changes

Quota Reset

Quotas can be configured to reset on a schedule (daily, weekly, monthly) or managed manually from the dashboard.

Rate Limits

Rate limits control the maximum number of requests a key can make within a time window.

SettingDescription
Requests per minuteMaximum API calls per minute
Tokens per minuteMaximum tokens consumed per minute
Requests per dayMaximum API calls per day

Token Groups

Token groups let you organize multiple keys under a single management unit. This is useful for:

  • Team management — Group keys by team or department
  • Project isolation — Separate token budgets per project
  • Usage tracking — Monitor consumption at the group level

Creating a Token Group

  1. Go to Token ManagementGroups
  2. Click Create Group
  3. Set the group name and shared quota
  4. Assign API keys to the group

Modifying Token Settings

You can modify token settings at any time from the dashboard:

  • Increase/decrease quotas — Adjust based on actual usage patterns
  • Change rate limits — Tighten or relax as needed
  • Move keys between groups — Reorganize without recreating keys
  • View usage history — Check consumption trends over time

Usage Monitoring

The dashboard provides real-time usage statistics:

  • Total tokens used — Across all keys or per key
  • Request count — Number of API calls
  • Model breakdown — Token usage per model
  • Time series — Usage trends over time

TIP

Set up quota alerts to get notified before you hit your limits. This prevents unexpected service interruptions.

3xCoder — Unified AI API Endpoint