Commit bc94f75e authored by Brewer, Wes's avatar Brewer, Wes
Browse files

Aggregate system power for multi-partition systems. Restructure setonix...

Aggregate system power for multi-partition systems. Restructure setonix configs to config/setonix/part-*
parent 5888713d
Loading
Loading
Loading
Loading
+2 −2
Original line number Diff line number Diff line
@@ -56,7 +56,7 @@ given instead of the parquet files for more quickly running subsequent simulatio

Multi-partition systems are supported by running the `multi-part-sim.py` script, where a list of configurations can be specified using the `-x` flag as follows:

    python multi-part-sim.py -x setonix-cpu setonix-gpu
    python multi-part-sim.py -x setonix/part-cpu setonix/part-gpu

This will simulate synthetic workloads on two partitions as defined in `config/setonix-cpu` and `config/setonix-gpu`. To replay telemetry workloads from another system, e.g., Marconi100's PM100 dataset, first create a .npz snapshot of the telemetry data, e.g., 

@@ -64,7 +64,7 @@ This will simulate synthetic workloads on two partitions as defined in `config/s

This will dump a .npz file with a randomized name, e.g. ac23db.npz. Let's rename this file to pm100.npz for clarity. Note: can control-C when the simulation starts. Now, this pm100.npz file can be used with `multi-part-sim.py` as follows:

    python multi-part-sim.py -x setonix-cpu setonix-gpu -f pm100.npz --reschedule --scale 192
    python multi-part-sim.py -x setonix/part-cpu setonix/part-gpu -f pm100.npz --reschedule --scale 192

The `--reschedule` flag will use the internal scheduler to determine what nodes to schedule for each job, and the `--scale` flag will specify the maximum number of nodes for each job (generally set this to the max number of nodes of the smallest partition). 

Loading