Tools
Interactive Visualization
Prompt memory budget
Step through an LLM as it opens up — from a single black box to the fully unrolled autoregressive loop. Alongside, watch the KV-cache memory that one realistic prompt — “Write a story” into a ~500-word essay — actually costs, for a small grouped-query model and a frontier MLA model.
Loading memory budget
From the blogLLM Inference