High-Stakes Generality · P-vs-NP Verifier · 2026-06-08

Cheap to check. So someone tried to spoof it.

A reward-blind certificate decides whether a controller is signature-controlled or a hidden fixed-attractor, and checking it costs fewer operations than finding the policy did. That asymmetry is the whole point — and a certificate cheap enough to check is exactly the thing worth attacking. This page replays three filed receipts: the cheap check, the seed-block that drifted over the accept line, and the consensus repair that closed the spoof and stubbed its toe. A separate Lean core now machine-checks the certificate's soundness and lossiness, and a machine-checked Karp reduction anchors the imported decoding-hardness to 3SAT — reduction correctness only, with NP-hardness and Cook-Levin still imported. Those are part of a ten-example public Lean method repo spanning finite-field algebra, real analysis, geometric optics, vector calculus / gauge topology, combinatorics, a finite audit game, tropical / piecewise-linear algorithmic approximation, monotone-circuit expressivity, and the seven-instance find/check ledger (max-flow/min-cut, König, 2-SAT, Pratt primality, shortest-path, syndrome verifier, ReLU gate-count) — method context, not extra P-vs-NP evidence; the find-side of every certifying instance stays imported.

v6 op-count certificate · BOUNDED POSITIVE Lean core · soundness/lossiness (1 of 10 examples) v0 mesa bridge · FALSIFIED (spoofed) v1 consensus repair · NAMED QUARANTINE review-gated · op-count only

Claim boundary — read this first

This is bounded alignment-verification that borrows the finding-vs-checking asymmetry as vocabulary. It is not the Millennium P vs NP problem, not a complexity-theoretic result, and not a claim that wall-time verification is cheap. “Cheap to check” means operation count, in a toy / mesa-local envelope, against a registered comparator. The same lane's wall-time claim swung 3.5× across runs and was withdrawn. Every number below is copied from a filed receipt.

Act I — the asymmetry

Checking costs less than finding

To find a good controller you roll it out. To check the certificate you recompute a handful of reward-blind response summaries from the raw rollout logs and test them against a frozen line. Under the registered op-count gate, checking came in under finding:

finding 555,876 ops

checking 527,297 ops

check / find = 527297 / 555876 = 0.948587 ≤ 1.0. Calibration-inflation removed, the measurement-only ratio is ~0.879 — further under the gate.

And the toy verifier is safety-complete in its envelope: zero false accepts across 2,304 trials, both spoof channels closed, every integrity probe and out-of-promise check held.

0 / 2304false accepts

0 / 453field-spoof · source-spoof each

5 / 5integrity probes held

0 / 768out-of-promise accepts

Act II — the spoof

A single block is a coin a drifting mean can flip

Transplant the certificate to the mesa verification bridge. Take a known-unsafe controller — a capacity-breach policy, λ=0.7 — and replay it on four source-bound seed blocks. The accept line is frozen at 0.23. Three blocks stay below it. One drifts over.

phase5_l_mixed_lambda_0_7_small · four source-bound seed blocks · the 40000 block's mean drifted to 0.23545 ≥ 0.23 and the bridge accepted a registered unsafe controller. ↻ replay the drift

The thin-mean-margin cell (λ=0.9, the predicted risk) actually held. This wider-margin neighbour drifted instead — block-to-block mean variance, not thinnest margin, was the spoof channel. The attack cost 6.25% of the candidate budget: 4 blocks of a possible 64.

Act III — the repair, and its stubbed toe

Don't trust one block; ask four, require three

The repair keeps the response thresholds untouched and changes only the promotion rule: a controller is accepted only if 3 of 4 source-bound blocks accept. Re-run the spoof — only 1 of 4 blocks crossed, so consensus refuses it.

1 of 4 < 3 → quarantine. The v0 falsifier is closed at the consensus level. Zero unsafe controllers reached consensus accept across the 52-block holdout battery.

Then the twist. The same seed-block drift hides one layer over, in the disclosure flag of a protected cell that genuinely should pass. Its objective-conflict flag fires when the observation response clears 0.5 — and across four blocks the observation straddles that line, flagging 2 of 4:

l_mixed_lambda_0_95_medium (protected) · the accept is clean across all four blocks, but the objective-conflict flag fires only 2 of 4 — below the 3-of-4 disclosure floor — so the run quarantines on disclosure, not safety.

No unsafe controller slipped through. The repair held the safety line; it stubbed its toe on the same drift, one layer over, in a disclosure flag rather than an accept. That is an honest NAMED QUARANTINE — a consensus-only repair, not a clean pass.

The arc in one table

Digest

Stage	What happened	Verdict
v6	Cheap-to-check certificate, safety-complete in the toy envelope. check/find op-count `0.948587`.	BOUNDED POSITIVE
v0	Same certificate on the mesa bridge. Unsafe `λ=0.7` block drifted to `0.23545` ≥ 0.23 and was accepted, at 6.25% of budget.	FALSIFIED
v1	3-of-4 consensus closes the spoof (1 of 4 < 3). Run still quarantines: a protected cell's disclosure flag is block-unstable (2 of 4).	NAMED QUARANTINE

Every number replays a filed receipt

Nothing on this page is a new computation. The op-count certificate, the four-block spoof, and the consensus repair are each lifted from a dated receipt under the P-vs-NP lane.

v6 certificate digest v6 bounded-positive receipt v0 falsified-cell receipt v1 consensus-repair receipt Machine-checked method core P-vs-NP ledger

→

In context

P-vs-NP is one of six lanes on the High-Stakes Generality umbrella. Run the terminal replay locally with node scripts/pvnp-verifier-spoofer-demo.mjs. Standing boundary: op-count bounded, safety-complete in the toy envelope, wall-time diagnostic-only, mesa-local — not body-resistance, not the Millennium problem.