Neural Lab · Live
Watch how an LLM finds the answer.
Ask anything — even 2 + 2. Real BPE tokens, real next-token logprobs from a hosted model, real streaming response, and the transformer architecture lighting up layer by layer as your question travels through it.
⌘/Ctrl + Enter
Ready
Idle — waiting for a question
Type a question and press Run. Real BPE tokens, real next-token logprobs, real streaming answer.
0%
Transformer architecture · 7 layers · live activation
TokenizeEmbedAttentionFFN + RetrieveDeep LayersPredictGenerate
1 · Tokens of your prompt
— · 0 tokType a question above to see real BPE tokens.
2 · Top-5 next-token predictions
—Predictions will appear once you type.
3 · Streamed answer
0 tok
The model's answer will appear here, token by token, as the network fires.
4 · Pipeline events
0 evt
Events will stream in here as the pipeline runs.