FetchResult¶

FetchResult `dataclass` ¶

FetchResult(bench_df: DataFrame, trace_df: DataFrame, opcounts: dict[str, dict[str, float]], meta: dict[str, Any], metrics: list[str] = (lambda: ['runtime'])(), output_flags: dict[str, bool] = (lambda: {'estimator_inputs': True, 'merged_parquet': True, 'trace_parquet': True})())

In-memory result of a pipeline run.

Holds the parsed bench DataFrame, the derived trace DataFrame, and the opcount mapping needed to materialise the artifact bundle on disk (runtimes.csv, opcounts.json, bench_data.parquet, trace.parquet, meta.json).

The output_flags dict gates which artifacts write() produces; meta.json is always written. metrics selects which per-metric CSVs the estimator-inputs bundle includes (runtime → runtimes.csv, mgas_s → mgas_s.csv).

default_out_dirname ¶

default_out_dirname() -> str

Return {earliest_run_ts}_{latest_run_ts} formatted YYYY-MM-DDTHH-MM-SSZ.

Source code in src/benchmarkoor_fetch/result.py

def default_out_dirname(self) -> str:
    """Return `{earliest_run_ts}_{latest_run_ts}` formatted YYYY-MM-DDTHH-MM-SSZ."""
    window = self.meta.get("data_window") or {}
    start = _format_ts(window.get("start"))
    end = _format_ts(window.get("end"))
    return f"{start}_{end}"

write ¶

write(out_dir: Path) -> None

Write the artifact bundle to out_dir (created if missing).

Source code in src/benchmarkoor_fetch/result.py

def write(self, out_dir: Path) -> None:
    """Write the artifact bundle to `out_dir` (created if missing)."""
    out_dir = Path(out_dir)
    out_dir.mkdir(parents=True, exist_ok=True)

    flags = self.output_flags
    row_counts: dict[str, int] = {"bench_data": int(len(self.bench_df))}

    if flags.get("estimator_inputs", True):
        if "runtime" in self.metrics:
            runtimes = _runtimes_frame(self.bench_df)
            runtimes.to_csv(out_dir / "runtimes.csv", index=False)
            row_counts["runtimes"] = int(len(runtimes))
        if "mgas_s" in self.metrics:
            mgas_s = _mgas_s_frame(self.bench_df)
            mgas_s.to_csv(out_dir / "mgas_s.csv", index=False)
            row_counts["mgas_s"] = int(len(mgas_s))
        (out_dir / "opcounts.json").write_text(
            json.dumps(self.opcounts, indent=2, sort_keys=True)
        )
        row_counts["opcounts"] = int(len(self.opcounts))

    if flags.get("merged_parquet", True):
        self.bench_df.to_parquet(out_dir / "bench_data.parquet", index=False)

    if flags.get("trace_parquet", True):
        self.trace_df.to_parquet(out_dir / "trace.parquet", index=False)
        row_counts["trace"] = int(len(self.trace_df))

    meta = dict(self.meta)
    meta["row_counts"] = row_counts
    (out_dir / "meta.json").write_text(
        json.dumps(meta, indent=2, sort_keys=True, default=str)
    )

FetchResult¶

FetchResult dataclass ¶

default_out_dirname ¶

write ¶

FetchResult `dataclass` ¶