v0.2.1

Credit System

Usage-based billing with tier-specific credit allocations and pre-deduction before AI streaming.

Every AI operation now costs a defined number of credits. Credits are deducted before the streaming response begins, ensuring users cannot exceed their allocation mid-request. If a user lacks sufficient credits, the request is rejected with a clear error before any provider call is made.

Each subscription tier receives a monthly credit allocation that resets on the billing cycle. Unused credits do not roll over. The allocation scales with the tier to match expected usage patterns, from lightweight exploration on the free plan to high-volume production workloads on enterprise.

TierCredits / Month
Free20
Basic100
Pro1,000
Enterprise10,000

The credit cost per operation varies by model and feature. RAG queries cost more than simple chat completions because they involve embedding generation and vector retrieval in addition to the LLM call.

Contributors

Sascha RahnSascha Rahn