Every AI operation now costs a defined number of credits. Credits are deducted before the streaming response begins, ensuring users cannot exceed their allocation mid-request. If a user lacks sufficient credits, the request is rejected with a clear error before any provider call is made.
Each subscription tier receives a monthly credit allocation that resets on the billing cycle. Unused credits do not roll over. The allocation scales with the tier to match expected usage patterns, from lightweight exploration on the free plan to high-volume production workloads on enterprise.
| Tier | Credits / Month |
|---|---|
| Free | 20 |
| Basic | 100 |
| Pro | 1,000 |
| Enterprise | 10,000 |
The credit cost per operation varies by model and feature. RAG queries cost more than simple chat completions because they involve embedding generation and vector retrieval in addition to the LLM call.