Proxy cap, K=5 stats — regex-log 5/5; openssl's misses are not budget-related (0.80)

What was run

The two signal tasks × 5 attempts each (10 trials) through the reasoning-cap proxy (port 8021, cap ~4000), qwen3.6-35b-a3b, thinking ON.

Results

Mean 0.80: regex-log 5/5, openssl-selfsigned-cert 3/5.

Interpretation

The cap reliably fixes the reasoning loop — regex-log went from loops-and-fails (uncapped) to 5/5.
openssl's 60% is not a budget problem: its whole trajectory is only ~1.7k–3.7k output tokens, so per-turn reasoning never even reaches the 4000 cap. That's the model's baseline on the task.
Learned to judge the cap by pass rate, not by trajectory token totals — totals sum all turns and look scary even when each turn is bounded.

Next steps

More samples for tighter confidence (K=15), and try the dense 27B model through the same proxy.

modelllama-local/qwen3.6-35b-a3bagentharnesses.minimal_pi:MinimalPithinkingonreasoning budgetproxy · cap 4000 — see journal for the authoritative mechanismmaxTokens / contextWindow32768 / 131072agent timeout ×2.0trials10 of 10 — 8 pass · 2 failmean reward0.80tokens (job total)4,088,115 in / 164,705 outstarted / finished2026-07-02T18:16 / 2026-07-02T18:52wall clock36m12s

#	result	total	agent	in/out tok
1	FAIL	1m10s	21s	39851/2843
2	PASS	1m05s	17s	50316/2808
3	PASS	1m00s	12s	25763/1827
4	FAIL	1m11s	21s	52497/2821
5	PASS	59s	12s	21653/1687

#	result	total	agent	in/out tok
1	PASS	7m57s	6m58s	889355/32115
2	PASS	7m33s	6m36s	739562/49511
3	PASS	4m36s	3m36s	491670/21487
4	PASS	6m20s	5m24s	1589957/37832
5	PASS	4m17s	3m21s	187491/11774

smokeqwen3.6-35b-a3b20260702-181612

Proxy cap, K=5 stats — regex-log 5/5; openssl's misses are not budget-related (0.80)

What was run

Results

Interpretation

Next steps

Run details

Tasks

openssl-selfsigned-cert — 3/5 passed

regex-log — 5/5 passed

smoke__qwen3.6-35b-a3b__20260702-181612

Proxy cap, K=5 stats — regex-log 5/5; openssl's misses are not budget-related (0.80)

What was run

Results

Interpretation

Next steps

Run details

Tasks

openssl-selfsigned-cert — 3/5 passed

regex-log — 5/5 passed

smokeqwen3.6-35b-a3b20260702-181612