Skip to content

Add runtime provenance to benchmark metadata#7

Open
Genmin wants to merge 1 commit into
PolicyEngine:mainfrom
Genmin:fix/eval-runtime-provenance
Open

Add runtime provenance to benchmark metadata#7
Genmin wants to merge 1 commit into
PolicyEngine:mainfrom
Genmin:fix/eval-runtime-provenance

Conversation

@Genmin
Copy link
Copy Markdown

@Genmin Genmin commented May 2, 2026

Summary

  • add a shared runtime provenance helper for Python/package versions and committed dependency lockfile hashes
  • include runtime provenance in reference-output metadata and scenario-manifest metadata
  • include generated timestamp and runtime provenance in no-tools resume metadata

Fixes #6

Validation

  • uv run --extra dev pytest -q (189 passed)
  • uv run --extra dev ruff check policybench/provenance.py policybench/eval_no_tools.py policybench/cli.py tests/test_eval_no_tools.py tests/test_provenance.py
  • uv run --extra dev ruff format --check policybench/provenance.py policybench/eval_no_tools.py policybench/cli.py tests/test_eval_no_tools.py tests/test_provenance.py
  • git diff --check

@vercel
Copy link
Copy Markdown

vercel Bot commented May 2, 2026

@Genmin is attempting to deploy a commit to the PolicyEngine Team on Vercel.

A member of the Team first needs to authorize it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Determinism and reproducibility hardening

1 participant