Models — Token-as-a-Service

Model families at a glance

Unified access to every leading model family across multiple providers.

Family	Modalities	Max context	Suppliers
Chat & Completion AWS Bedrock OpenAI · Alibaba DashScope · Anthropic Corporate +10	Chat & Completion	2M	13
Vision & Multimodal DeepInfra · Mistral AI · OpenRouter +1	Vision & Multimodal	262K	4
OCR & Documents Athens OCR CPU Utilities · Athens OCR PaddleOCR-VL · Athens OCR olmOCR +2	OCR & Documents	33K	5
Document Conversion TaaS Gateway	Document Conversion	-	1
Image Generation AWS Bedrock OpenAI · OpenAI · OpenAI Corporate +1	Image Generation	-	4
Embedding CloudSigma · OTE Greece	Embedding	-	2
Reranking CloudSigma · OTE Greece	Reranking	-	2
Text-to-Speech OTE Greece	Text-to-Speech	-	1
Transcription Groq · OTE Greece	Transcription	-	2
Speaker Recognition OTE Greece	Speaker Recognition	-	1
Audio Understanding OTE Greece	Audio Understanding	-	1

Built for reliability and compliance

Every model request benefits from platform-level safeguards.

Automatic failover

When a supplier returns an error or rate-limit, TaaS instantly retries the same request against a secondary provider — completely transparent to your application.

Geo-routing & EU-only

Pass region: "EU" in any request to guarantee traffic stays within European infrastructure. Per-key region restrictions enforce this at the API level.

Supplier filtering

Prefer specific providers for cost or compliance reasons? Restrict keys to named suppliers while keeping automatic failover within your approved list.

Multi-modal in one API

Chat, vision, image generation, embeddings, speech synthesis, audio transcription, and reranking — one key, one endpoint, one bill. No per-provider credentials to manage.

OpenAI-compatible

Drop-in replacement for the OpenAI SDK. Change one base URL and start accessing every model family — no other code changes required.

Usage & cost visibility

Per-model, per-key token and cost breakdowns in real time. Set budget caps per key so a single workload can never overspend.

Data Residency

EU-only routing in a single field

Add "region": "EU" to any API call. TaaS selects only EU-hosted supplier endpoints and rejects the request if no compliant route is available — giving you a hard guarantee, not a best-effort one.

Combine with per-key allowed_regions to enforce residency at the credential level, so even misconfigured clients can't route traffic outside your approved geography.

Explore access controls

# EU-only chat request
curl https://taas.cloudsigma.com/v1/chat/completions \
  -H "Authorization: Bearer $TAAS_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o",
    "region": "EU",
    "messages": [
      {"role": "user", "content": "Summarise this contract."}
    ]
  }'

# Response header confirms routing:
# X-TaaS-Supplier: openai-eu
# X-TaaS-Region:   EU

Resilience

Multi-supplier failover — automatic and invisible

TaaS maintains live health scores for every supplier. When an upstream returns a 429, 500, or timeout, your request is retried against the next best option — typically in under 200 ms.

Failover respects your geo and supplier restrictions: if your key is locked to EU suppliers, failover only considers other EU-approved routes.

Read the developer guide

# Python — using openai SDK, zero changes needed
from openai import OpenAI

client = OpenAI(
    api_key="your-taas-key",
    base_url="https://taas.cloudsigma.com/v1"
)

# TaaS handles failover transparently
response = client.chat.completions.create(
    model="claude-sonnet-4-5",
    messages=[{"role": "user",
               "content": "Draft a privacy notice."}]
)
print(response.choices[0].message.content)

Get started today

Access every model through one API

No per-provider accounts. No credential rotation. Full multi-modal coverage with EU-ready geo-filtering from day one.

View pricing Developer guide