Access chat, vision, image generation, embeddings, speech, transcription and reranking through a single unified API — with automatic failover across multiple suppliers and full geo-filtering for data residency.
POST /v1/chat/completions
{
"model": "gpt-4o",
"region": "EU",
"messages": [{
"role": "user",
"content": "Hello!"
}]
}
EU-only routing enforced — data never leaves European infrastructure
Unified access to every leading model family across multiple providers.
| Family | Modalities | Max context | Suppliers | Geo-filter |
|---|---|---|---|---|
| Chat & Completion AWS Bedrock OpenAI · Alibaba DashScope · Anthropic Corporate +10 |
Chat & Completion | 2M | 13 | |
| Vision & Multimodal DeepInfra · Mistral AI · OpenRouter +1 |
Vision & Multimodal | 262K | 4 | |
| OCR & Documents Athens OCR CPU Utilities · Athens OCR PaddleOCR-VL · Athens OCR olmOCR +2 |
OCR & Documents | 33K | 5 | |
| Document Conversion TaaS Gateway |
Document Conversion | - | 1 | |
| Image Generation AWS Bedrock OpenAI · OpenAI · OpenAI Corporate +1 |
Image Generation | - | 4 | |
| Embedding CloudSigma · OTE Greece |
Embedding | - | 2 | |
| Reranking CloudSigma · OTE Greece |
Reranking | - | 2 | |
| Text-to-Speech OTE Greece |
Text-to-Speech | - | 1 | |
| Transcription Groq · OTE Greece |
Transcription | - | 2 | |
| Speaker Recognition OTE Greece |
Speaker Recognition | - | 1 | |
| Audio Understanding OTE Greece |
Audio Understanding | - | 1 |
Every model request benefits from platform-level safeguards.
When a supplier returns an error or rate-limit, TaaS instantly retries the same request against a secondary provider — completely transparent to your application.
Pass region: "EU" in any request to guarantee traffic stays within European infrastructure. Per-key region restrictions enforce this at the API level.
Prefer specific providers for cost or compliance reasons? Restrict keys to named suppliers while keeping automatic failover within your approved list.
Chat, vision, image generation, embeddings, speech synthesis, audio transcription, and reranking — one key, one endpoint, one bill. No per-provider credentials to manage.
Drop-in replacement for the OpenAI SDK. Change one base URL and start accessing every model family — no other code changes required.
Per-model, per-key token and cost breakdowns in real time. Set budget caps per key so a single workload can never overspend.
Add "region": "EU" to any API call. TaaS selects only EU-hosted supplier endpoints and rejects the request if no compliant route is available — giving you a hard guarantee, not a best-effort one.
Combine with per-key allowed_regions to enforce residency at the credential level, so even misconfigured clients can't route traffic outside your approved geography.
# EU-only chat request
curl https://taas.cloudsigma.com/v1/chat/completions \
-H "Authorization: Bearer $TAAS_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "gpt-4o",
"region": "EU",
"messages": [
{"role": "user", "content": "Summarise this contract."}
]
}'
# Response header confirms routing:
# X-TaaS-Supplier: openai-eu
# X-TaaS-Region: EU
TaaS maintains live health scores for every supplier. When an upstream returns a 429, 500, or timeout, your request is retried against the next best option — typically in under 200 ms.
Failover respects your geo and supplier restrictions: if your key is locked to EU suppliers, failover only considers other EU-approved routes.
Read the developer guide# Python — using openai SDK, zero changes needed
from openai import OpenAI
client = OpenAI(
api_key="your-taas-key",
base_url="https://taas.cloudsigma.com/v1"
)
# TaaS handles failover transparently
response = client.chat.completions.create(
model="claude-sonnet-4-5",
messages=[{"role": "user",
"content": "Draft a privacy notice."}]
)
print(response.choices[0].message.content)
No per-provider accounts. No credential rotation. Full multi-modal coverage with EU-ready geo-filtering from day one.