Architecture

Overview

PoolSwitch is built around one shared routing engine.

That core is reused in two product surfaces:

In both modes, the same components handle state and failover.

Embedded clients: direct in-process clients for Python and Node.js
Proxy server: HTTP wrapper around the same routing engine for multi-language access
Storage: memory, Redis, and SQLite persistence for key state
Strategies: algorithms like round_robin, least_used, and quota_failover
Core pool: central state transitions for success, cooldown, and key lifecycle
Quota and retry: provider-agnostic quota evaluation and exponential backoff retry policies

PoolSwitch selects a healthy key.
It injects the provider auth header.
It sends the upstream request.
It classifies the upstream result:
- success
- retryable rate limit
- quota exhaustion
- network failure
It updates key state, cooldowns, failovers, and metrics.
It returns the final parsed result or final error.

Retries occur only before a successful upstream response is accepted by the caller.

Prometheus-compatible /metrics includes: