ref(chat): Replace reply-generator slots with AgentRunner by dcramer · Pull Request #751 · getsentry/junior

dcramer · 2026-07-03T22:56:55Z

The chat runtime now passes agent execution through a small AgentRunner interface instead of handing around the full reply-generator function signature. Slack replies, Slack resumes, local turns, durable dispatch, continuation, handlers, and CLI composition roots all receive an explicit runner dependency.

This pulls the second commit from #748 onto main after #750. The conflict resolution keeps #750's minimal three-status AgentRunOutcome contract, preserves sandbox trace propagation precedence in createAgentRunner, and removes silent production fallback paths so queue and worker code use the runner wired by their composition boundary.

Refs #746

Introduce a small consumer-owned AgentRunner interface for the runtime paths that previously accepted reply-generator injection slots. Wire it through the Slack reply executor, Slack resume, local runner, agent dispatch, continuation, handlers, and CLI composition roots. Keep the minimal AgentRunOutcome contract from #750 while preserving sandbox trace propagation precedence and requiring concrete runner dependencies instead of silent production fallbacks. Cherry-picked from 3a826ac and resolved onto #750. Refs #746 Co-Authored-By: GPT-5.5 Codex <noreply@openai.com> Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> Co-Authored-By: GPT-5 Codex <noreply@openai.com>

vercel · 2026-07-03T22:57:01Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
junior-docs	Ready	Preview, Comment	Jul 4, 2026 12:39am

Root typecheck previously ran a hand-maintained package list that excluded @sentry/junior-evals, and CI never ran typecheck at all, so type breaks in the eval harness landed silently. Switch the root script to a recursive pnpm run so any package that defines typecheck is covered automatically, give junior-evals a typecheck script, and add the step to the CI workflow. Refs #746 Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

The eval harness still injected the removed reply-generator slots: replyExecutor.generateAssistantReply was silently ignored (evals would call the real Pi model instead of scripted replies), and the Slack continuation and scheduled-dispatch paths crashed on an undefined agentRunner. Provide the mock as replyExecutor.agentRunner and thread one AgentRunner through processEvents to both call sites. The scripted mock branches also predate #750: they returned raw AssistantReply objects where the executor now expects an AgentRunOutcome, so wrap them with completedAgentRun. Refs #746 Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

The production runner was constructed independently in app.ts, production.ts, and the services fallback, so the three sites could silently diverge. Build it once in createApp, inject it through runtimeServiceOverrides.replyExecutor.agentRunner, and make createProductionConversationWorkOptions require the runner instead of silently constructing its own. Also remove the parallel tracePropagation channel from the agent-dispatch handler and runner deps: the runner wrapper owns that default, and the explicit context write made the wrapper's option dead code on the dispatch path. Refs #746 Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

Add a shared tests fixture exporting the respond-backed harness runner and a never-run guard runner, replacing four hand-rolled stubs and two copy-pasted inline runners in the OAuth callback harnesses. The harnesses now accept an injected agentRunner, so the Slack callback integration tests inject their reply mock at the runner seam instead of module-mocking @/chat/respond. Also extract the duplicated local-turn setup in the chat CLI into one helper, and skip the context-rewriting wrapper in createAgentRunner when no trace propagation default is configured. Refs #746 Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

createProductionConversationWorkOptions required an agentRunner but only used it for the resume path, while the reply runtime still resolved its runner from the services overrides; a caller supplying mismatched values would silently run replies and resumes on different runners. Fold the explicit runner into the services passed to createSlackRuntime so one channel owns it. Refs #746 Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

) `ReplyRequestContext` was a flat ~35-field bag where nearly every field was optional, so the type could not express which combinations were valid and call sites gave no hint of a field's role. The request is now grouped into `input`, `routing`, `policy`, `state`, `observers`, and `durability` sub-objects, and `AgentRunner.run` takes the single grouped request (`messageText` now lives in `input`). No behavior change and no field optionality changes: every flat field maps 1:1 into exactly one group, runtime destination/requester invariant checks are unchanged, and the outcome contract from #750/#751 (`suspended` status, deferred pause handlers, `resumeVersion`) is untouched. The executor body still operates on the historical flat shape via a private `flattenReplyRequestContext` step; consuming the groups directly is deferred to the #746 Phase 5 decomposition. This ports the remaining regrouping commit from #748 onto the reworked outcome model. Review order: start with the group interfaces and flatten step in `src/chat/respond.ts`, then the six call-site regroupings (`runtime/reply-executor.ts`, `runtime/slack-resume.ts`, `agent-dispatch/runner.ts`, `local/runner.ts`, `runtime/agent-continue-runner.ts`, the oauth handlers). The 36 test-file changes are mechanical regroupings of existing assertions — no tests added, removed, or weakened — with a shared `flattenReplyRequestForTest` fixture replacing per-file copies of the flatten shim. Refs #746 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: GPT-5.5 Codex <noreply@openai.com> Co-authored-by: Claude Fable 5 <noreply@anthropic.com>

vercel Bot deployed to Preview – junior-docs July 3, 2026 22:58 View deployment

dcramer and others added 5 commits July 3, 2026 16:31

vercel Bot deployed to Preview – junior-docs July 4, 2026 00:39 View deployment

dcramer marked this pull request as ready for review July 4, 2026 00:42

github-actions Bot added the risk: high PR risk score: high label Jul 4, 2026

dcramer merged commit 1b50e1a into main Jul 4, 2026
20 checks passed

dcramer deleted the pull-748-agent-runner-seam branch July 4, 2026 01:31

dcramer mentioned this pull request Jul 4, 2026

ref(chat): Split agent-run request context into role-scoped groups #752

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ref(chat): Replace reply-generator slots with AgentRunner#751

ref(chat): Replace reply-generator slots with AgentRunner#751
dcramer merged 6 commits into
mainfrom
pull-748-agent-runner-seam

dcramer commented Jul 3, 2026

Uh oh!

vercel Bot commented Jul 3, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

dcramer commented Jul 3, 2026

Uh oh!

vercel Bot commented Jul 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vercel Bot commented Jul 3, 2026 •

edited

Loading