[codex] Fix train renderer pool cache by xeophon · Pull Request #1889 · PrimeIntellect-ai/verifiers

xeophon · 2026-06-27T13:24:20Z

Summary

Cache v1 TrainClient renderer pools by effective renderer model and serialized chat_template_kwargs.
Add a regression test proving identical kwargs reuse a pool while changed or absent kwargs get distinct pools.

Root Cause

Commit a8036b441 / PR #1876 threaded per-request chat_template_kwargs into create_renderer_pool, but TrainClient still cached a single self._pool. The first request could therefore lock in one renderer pool and leak its chat-template settings into later requests with different kwargs or no kwargs.

Impact

Training/eval paths using the v1 train renderer client can now safely mix requests with different chat-template options, such as Qwen thinking settings, without cross-request pool contamination.

Validation

uv run pytest tests/v1/test_train_client.py
uv run ruff check --fix verifiers/v1/clients/train.py tests/v1/test_train_client.py
git diff --check

Note: repo-wide uv run ruff check --fix is currently blocked by an unrelated existing Python 3.11 f-string parse issue in verifiers/v1/cli/dashboard/eval.py:194. The pre-commit/pre-push hooks also reintroduced unrelated uv.lock exclude-newer churn, so the commit and push used the already-passing targeted checks while keeping uv.lock out of the PR.

Note

Fix `TrainClient._renderer_pool` to cache pools per model and chat template kwargs

Replaces the single shared _pool attribute in train.py with a _pools dict keyed by (renderer_model, serialized_chat_template_kwargs).
Creates a new renderer pool when the key is absent, and reuses the existing pool when the key matches — enabling distinct pools per unique combination of model and chat template kwargs.
Adds a unit test in test_train_client.py verifying that identical kwargs reuse the same pool, differing kwargs (e.g. enable_thinking) produce a new pool, and omitting kwargs produces a third pool.
Behavioral Change: callers that previously shared one pool across all invocations will now get distinct pools when chat_template_kwargs differ.

^{Macroscope summarized e8dce84.}

Fix train renderer pool cache

e8dce84

xeophon added the codex label Jun 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[codex] Fix train renderer pool cache#1889

[codex] Fix train renderer pool cache#1889
xeophon wants to merge 1 commit into
mainfrom
codex/daily-bug-scan-20260627-train-renderer-pools

xeophon commented Jun 27, 2026 •

edited by macroscopeapp Bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

xeophon commented Jun 27, 2026 • edited by macroscopeapp Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Root Cause

Impact

Validation

Fix TrainClient._renderer_pool to cache pools per model and chat template kwargs

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

xeophon commented Jun 27, 2026 •

edited by macroscopeapp Bot

Loading

Fix `TrainClient._renderer_pool` to cache pools per model and chat template kwargs