OpenAI compatible API. Attested gateway. Public status.

Phala

Phala models on TrustedRouter with prices, routes, policy notes, and source links.

Verify gateway
1 URLbase_url migration
100smodels and routes
0prompt logs by default

phala

Confidential

All providers

ProviderPhala
Models18 public models
Prepaid routes18
BYOK routes18
Zero data retentionyes
Confidential computeyes
Provider E2EEyes
Policy noteTracked as a confidential AI provider with provider-side attestation and encrypted prompt transport.
Policy source

Measured performance

302 samples

Continuously sampled across Phala's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.

p50 TTFT3302 ms
Throughput76 tok/s
Uptime57.95%
Modelp50 TTFTp50 TTFBThroughputUptimeConfig excludedSamples
qwen/qwen-2.5-7b-instruct 1259 ms 1256 ms 46.15% 13
z-ai/glm-4.7 2553 ms 2450 ms 72.22% 18
qwen/qwen3.5-397b-a17b 2837 ms 2733 ms 50.00% 16
openai/gpt-oss-120b 2939 ms 2937 ms 59.09% 22
google/gemma-3-27b-it 2972 ms 2868 ms 64.29% 28
qwen/qwen3-vl-30b-a3b-instruct 3002 ms 3000 ms 76.47% 17
qwen/qwen3-30b-a3b-instruct-2507 3054 ms 3051 ms 75.00% 8
z-ai/glm-4.7-flash 3184 ms 3182 ms 76 tok/s 76.00% 25
moonshotai/kimi-k2.6 3302 ms 3301 ms 64.29% 14
moonshotai/kimi-k2.5 3416 ms 3414 ms 57.14% 14
deepseek/deepseek-v3.2 3426 ms 3322 ms 9 tok/s 59.09% 22
qwen/qwen2.5-vl-72b-instruct 3592 ms 3590 ms 61.54% 13
openai/gpt-oss-20b 3734 ms 3733 ms 38.89% 18
minimax/minimax-m2.5 3905 ms 3903 ms 42.86% 21
deepseek/deepseek-chat-v3.1 4130 ms 4128 ms 52.94% 17
z-ai/glm-5 4354 ms 4353 ms 60.00% 15
z-ai/glm-5.1 4565 ms 4462 ms 41.18% 17
qwen/qwen3.5-27b 0.00% 6 probe_config_error 4

Full provider & model leaderboard.

Provider models

Models served by Phala.

Each row links to pricing, provider, benchmark, and API pages for the model.

Model Context Endpoints Prompt Completion Routes
deepseek/deepseek-chat-v3.1
DeepSeek: DeepSeek V3.1
163,840 2 $1.155/1M $3.41/1M prepaid BYOK
deepseek/deepseek-v3.2
DeepSeek: DeepSeek V3.2
163,840 2 $0.352/1M $0.528/1M prepaid BYOK
google/gemma-3-27b-it
Google: Gemma 3 27B
131,072 2 $0.121/1M $0.44/1M prepaid BYOK
minimax/minimax-m2.5
MiniMax: MiniMax M2.5
204,800 2 $0.22/1M $1.518/1M prepaid BYOK
moonshotai/kimi-k2.5
MoonshotAI: Kimi K2.5
262,144 2 $0.66/1M $3.3/1M prepaid BYOK
moonshotai/kimi-k2.6
MoonshotAI: Kimi K2.6
262,144 2 $1.199/1M $5.06/1M prepaid BYOK
openai/gpt-oss-120b
OpenAI: gpt-oss-120b
131,072 2 $0.165/1M $0.66/1M prepaid BYOK
openai/gpt-oss-20b
OpenAI: gpt-oss-20b
131,072 2 $0.044/1M $0.165/1M prepaid BYOK
qwen/qwen-2.5-7b-instruct
Qwen: Qwen2.5 7B Instruct
131,072 2 $0.044/1M $0.11/1M prepaid BYOK
qwen/qwen2.5-vl-72b-instruct
Qwen: Qwen2.5 VL 72B Instruct
131,072 2 $0.22/1M $0.77/1M prepaid BYOK
qwen/qwen3-30b-a3b-instruct-2507
Qwen: Qwen3 30B A3B Instruct 2507
131,072 2 $0.165/1M $0.605/1M prepaid BYOK
qwen/qwen3-vl-30b-a3b-instruct
Qwen: Qwen3 VL 30B A3B Instruct
262,144 2 $0.22/1M $0.77/1M prepaid BYOK
qwen/qwen3.5-27b
Qwen: Qwen3.5-27B
262,144 2 $0.33/1M $2.64/1M prepaid BYOK
qwen/qwen3.5-397b-a17b
Qwen: Qwen3.5 397B A17B
262,144 2 $0.605/1M $3.85/1M prepaid BYOK
z-ai/glm-4.7
Z.ai: GLM 4.7
202,752 2 $0.935/1M $3.63/1M prepaid BYOK
z-ai/glm-4.7-flash
Z.ai: GLM 4.7 Flash
202,752 2 $0.11/1M $0.473/1M prepaid BYOK
z-ai/glm-5
Z.ai: GLM 5
204,800 2 $1.32/1M $3.85/1M prepaid BYOK
z-ai/glm-5.1
Z.ai: GLM 5.1
202,752 2 $1.331/1M $4.62/1M prepaid BYOK

Sign in

Choose a sign in method.