About Speed Arena

Speed Arena is a real-time LLM speed benchmark tool. Type a prompt and watch multiple AI models race to generate the response side-by-side, with live timers and token counters.

Bring your own API keys to race any combination of 11 models from top providers. Taalas HC1 is always in the race as the defending champion — no key needed.

Models (11)

Taalas HC1 — Llama 3.1-8B (built-in, no key needed)
GPT-4.1 nano — OpenAI (164 tok/s)
Claude 4.5 Haiku — Anthropic (96 tok/s)
Gemini Flash — Google
GLM-4.7 Flash — Zhipu AI
Llama 4 Scout — Cerebras (2,600 tok/s)
DeepSeek V3.1 — Fireworks AI (355 tok/s)
Mistral Small — Mistral AI
Grok 4.1 Fast — xAI
MiniMax-M2.5 — MiniMax
Kimi K2.5 — Moonshot AI

How it works

1. Add your API keys for the providers you want to race (keys stored in your browser only)

2. Select which challengers to race against Taalas HC1

3. Type a prompt and hit RACE

4. Watch all models stream their responses simultaneously with live timers

Links

taalas.com — Taalas AI inference hardware
@Geekissimo — Creator

Speed Arena is BYOK (Bring Your Own Key). API keys are stored locally in your browser and never logged on the server. Results may vary based on network conditions, server load, and prompt complexity.