Tools
Interactive Visualization

Batch-Aware SSD Scheduling

The GPU waits for the last read in a KV-cache recall batch. These small traces show how FIFO leaves foreign work ahead of that tail, and how a request-aware controller can pull the critical reads forward without changing the total work. This is intentionally a one-SSD, visible-window sketch, not a full channel-aware controller model.

Select a traceStep the controllerCompare the tail

Loading scheduler