Tools
Interactive Visualization
Batch-Aware SSD Scheduling
The GPU waits for the last read in a KV-cache recall batch. These small traces show how FIFO leaves foreign work ahead of that tail, and how a request-aware controller can pull the critical reads forward without changing the total work. This is intentionally a one-SSD, visible-window sketch, not a full channel-aware controller model.
Select a traceStep the controllerCompare the tail
Loading scheduler