Loading
Fix BlueWaters dataloader: PBS job ID parsing, walltime field, remove filter
- Parse PBS job ID positionally from semicolon-delimited log header
(field 3: "MM/DD/YYYY HH:MM:SS;E;6335144.bw;key=value...") instead of
regex searching for jobid=/job_id=/Job_Id= which never appear in Torque
accounting logs. All GT jobs previously got sequential IDs (1, 2, 3...)
from Job._get_next_id(); now they get the real PBS IDs (e.g. 6335144.bw).
- Change wall_time regex from resources_used.walltime (actual) to
Resource_List.walltime (requested limit), matching what REDI extracts
for the time_limit field. Fixes time_limit MAE in validation.
- Remove filter_str / eval(filter_str) traffic filter support. The
filter: "traffic > 1e8" in experiments/bluewaters.yaml was accidentally
keeping the same 18 jobs as the full parse, but obscured the root cause.
- experiments/bluewaters.yaml: remove filter, fix start date to ISO format
(20170328 was parsed as Unix timestamp → 1970-08-22), disable
simulate_network (torus topology crashes on unconstrained job set).
Result: BlueWaters validation now uses id_aligned comparison (9 shared
jobs, corr=1.000, MAE=0). Remaining 9/18 GT-only jobs reflect the
date-window mismatch (RAPS logs by completion date, REDI filters by start
date) — not a bug.
Co-Authored-By:
Claude Sonnet 4.6 <noreply@anthropic.com>