Tokens/Second Visualizer — Free & Secure | werkzeuge

Feel how fast an LLM responds at a given token rate. Streams text live in your browser — nothing is transmitted.

See and feel how fast a language model responds at a given token rate. The text streams live in your browser.

100% in your browser — nothing leaves your device.

Sample text

Paste your own text

Long German compound words are split into many subword tokens — that is why they cost more than short English words.

Speed (tokens/second)

Presets:

Adjustable during playback. Typical values: local/CPU ~10, cloud models ~50–120.

Total tokens

Shown

Elapsed (s)

Effective (tok/s)

Estimated duration (s)

Live stream

estimate The token split is a heuristic and only an estimate — real tokenizers (OpenAI BPE, Claude, Gemini) count differently. For exact OpenAI token counts, use the Token Counter. Go to Token Counter

Team-Kollaboration

Collaborate with AI as a team — shared conversations, shared knowledge, consistent results.

Discover Team Features →