fix: batch-eval docs + preserve config-bundle placeholders on AB-test promote by jariy17 · Pull Request #1638 · aws/agentcore-cli

jariy17 · 2026-06-24T20:06:52Z

Two independent bug-bash fixes, one commit each.

1. `fix(docs)`: remove non-existent `Builtin.Completeness` evaluator

The batch-evaluation docs listed Builtin.Completeness and used it in a CLI example, but the API rejects it (Builtin evaluator Builtin.Completeness does not exist). The valid builtin list lives in run-eval.ts and never included it.

Dropped the row from the evaluator table in docs/batch-evaluation.md
Switched the dataset example in docs/commands.md to Builtin.Correctness

2. `fix(ab-test)`: preserve portable component placeholders on promote

config-bundle promote fetched the winning version's components from the service — which keys them by resolved (account/region-specific) runtime ARN — and wrote them straight into agentcore.json, replacing the committed {{runtime:<name>}} placeholders with hardcoded ARNs and breaking cross-account/region portability.

Before: "{{runtime:cbagent}}" → after promote: "arn:aws:bedrock-agentcore:us-west-2:...:runtime/cbbugbash_cbagent-N5owhv3MRl"

Fix: restorePlaceholderKeys() inverts the local bundle's placeholder→ARN map (using the same resolver deploy uses) and rewrites each incoming ARN key back to the placeholder before adopting. ARNs with no matching local placeholder pass through unchanged.

Testing

promote.test.ts: added a case asserting an ARN-keyed service response is rewritten back to {{runtime:r}} and the ARN key is absent. 13/13 pass.
typecheck + prettier + eslint pass (pre-commit hooks).

Builtin.Completeness is listed in the batch-evaluation docs and used in a CLI example, but the API rejects it ('does not exist'). The valid builtin list lives in run-eval.ts. Drop it from the evaluator table and switch the dataset example to Builtin.Correctness.

config-bundle promote fetched the winning version's components from the service, which keys them by resolved (account/region-specific) runtime ARN, and wrote them straight into agentcore.json — replacing the committed {{runtime:<name>}} placeholders with hardcoded ARNs and breaking cross- account/region portability of the config. Remap the service-returned ARN keys back to the bundle's existing portable placeholders (inverting the same resolver deploy uses) before adopting them.

agentcore-devx-automation · 2026-06-24T20:07:51Z

Claude Security Review: no high-confidence findings. (run)

agentcore-cli-automation

LGTM — two well-scoped, independent fixes with appropriate test coverage.

Docs fix: Verified Builtin.Completeness is not in BUILTIN_EVALUATOR_LEVELS in src/cli/operations/eval/run-eval.ts (only Correctness, Faithfulness, Helpfulness, ResponseRelevance, Conciseness, Coherence, InstructionFollowing, Refusal, GoalSuccessRate, ToolSelectionAccuracy are listed). Removing the row and swapping the example to Builtin.Correctness is correct.

Promote placeholder fix: restorePlaceholderKeys correctly inverts the local placeholder→ARN map by reusing the same resolveComponentKeyForJsonPath resolver the deploy/recommendation paths use, so behavior stays consistent. Skipping local arn:-prefixed keys and the arn !== key guard (unresolved placeholders) are both right. Pass-through for unmatched service ARNs is a reasonable fallback.

Test quality: Good — the new test only mocks at true I/O boundaries (ConfigIO, getConfigurationBundleVersion) and exercises the real resolver. Not excessively coupled to implementation.

No telemetry needed since this is a bug fix on a path that doesn't currently emit telemetry.

github-actions · 2026-06-24T20:10:06Z

Package Tarball

aws-agentcore-0.20.2.tgz

How to install

gh release download pr-1638-tarball --repo aws/agentcore-cli --pattern "*.tgz" --dir /tmp/pr-tarball
npm install -g /tmp/pr-tarball/aws-agentcore-0.20.2.tgz

github-actions · 2026-06-24T20:12:09Z

Coverage Report

Status	Category	Percentage	Covered / Total
🔵	Lines	37.2%	13612 / 36586
🔵	Statements	36.47%	14473 / 39678
🔵	Functions	31.82%	2335 / 7337
🔵	Branches	31.14%	9013 / 28942

Generated in workflow #3822 for commit 2deb3a6 by the Vitest Coverage Report Action

jariy17 added 2 commits June 24, 2026 20:02

jariy17 requested a review from a team June 24, 2026 20:06

github-actions Bot added the size/s PR size: S label Jun 24, 2026

jariy17 temporarily deployed to e2e-testing June 24, 2026 20:07 — with GitHub Actions Inactive

github-actions Bot added the agentcore-harness-reviewing AgentCore Harness review in progress label Jun 24, 2026

agentcore-devx-automation Bot added the claude-security-reviewing Claude Code /security-review in progress label Jun 24, 2026

agentcore-devx-automation Bot removed the claude-security-reviewing Claude Code /security-review in progress label Jun 24, 2026

agentcore-cli-automation approved these changes Jun 24, 2026

View reviewed changes

github-actions Bot removed the agentcore-harness-reviewing AgentCore Harness review in progress label Jun 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: batch-eval docs + preserve config-bundle placeholders on AB-test promote#1638

fix: batch-eval docs + preserve config-bundle placeholders on AB-test promote#1638
jariy17 wants to merge 2 commits into
mainfrom
fix/batch-eval-docs-and-abtest-placeholder

jariy17 commented Jun 24, 2026

Uh oh!

agentcore-devx-automation Bot commented Jun 24, 2026

Uh oh!

agentcore-cli-automation left a comment

Uh oh!

github-actions Bot commented Jun 24, 2026

Uh oh!

github-actions Bot commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

jariy17 commented Jun 24, 2026

1. fix(docs): remove non-existent Builtin.Completeness evaluator

2. fix(ab-test): preserve portable component placeholders on promote

Testing

Uh oh!

agentcore-devx-automation Bot commented Jun 24, 2026

Uh oh!

agentcore-cli-automation left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 24, 2026

Package Tarball

How to install

Uh oh!

github-actions Bot commented Jun 24, 2026

Coverage Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

1. `fix(docs)`: remove non-existent `Builtin.Completeness` evaluator

2. `fix(ab-test)`: preserve portable component placeholders on promote