Paper And Artifact
DynaSchedBench: Calibrated Dynamic Scheduling Benchmarks and Observability Paradox in LLM-based Scheduling Agents
The paper introduces SESC for calibrated event streams, SSI for
difficulty stratification, and a modular simulation-evaluation
stack for testing reactive and lookahead-based scheduling
policies under different observability regimes.
What is released
The public package exposes the benchmark generator, simulator,
evaluation helpers, visualization commands, and agent
interfaces under the single Python import dsbx.
What to read first
Start with the installation and CLI pages, then follow the
quickstart workflow to generate an instance, run a scheduler,
and inspect the resulting trajectory.
What the paper emphasizes
The benchmark is not just a dataset. Its calibration model,
observability settings, and evaluation loop are part of the
experimental design.