A 2026 AI research project - on GitHub

PSDGPhilosopher's Stone Dice Game

PSDG — Philosopher's Stone Dice Game. An exact solver can lose to a worse opponent — not from noise, randomness or hidden information, but because the rules decouple placement from possession. The act of choosing is long over by the time its consequences are realized. PSDG is the smallest game where this gap is exactly measurable.

Philosopher's Stone Dice Game playmat on a wooden table with dice, books, and alchemical props. — Dice only randomize the opening position. After that, PSDG is fully deterministic and skill-based.

PSDG is a compact, exactly solved deterministic dice game with reproducible benchmarks. The board snapshot alone is not a sufficient state representation: an agent can perfectly optimize the obvious scoring while still missing the latent structure that actually determines who wins. In benchmarked deployment protocols, even a static policy derived from the exact oracle can lose to a blundering opponent about 6–9% of the time.Randomness applies only to setup (dice fix the opening position); after that, no further rolls. Commitments (Twists) land before every scoring consequence is easy to read from the tableau alone—without hidden information after setup.Learn basic play in about 3 minutes — Watch on YouTube. Tiebreak / Immortal takes a few more minutes — demo & script.

Mechanism	Role in the rules
Irrevocable early commitments	Twist locks both Phase 1 tops and Phase 2 facings before Tumble and Exchange unpack scoring.
Latent / conditional rules	Eligibility, Phase 2 after Tumble, Immortal — all visible, easy to ignore if state is “what scores now.”
Information at the Exchange	Sequential vs simultaneous Gift — best-response timing vs Nash-style node (see snapshot).

PSDG structure	Rough real-world rhyme
Static principal line vs re-solving	Fixed policy vs replanning after deviation or new context
Simultaneous Exchange	Parties cannot condition on each other’s latest move in lockstep
Latent tiebreaker / phase activation	Logic that only governs once rare preconditions fire

Audience	Page
One-page narrative (mechanisms + layers + implications)	Technical report (summary)
Adjacent papers & frameworks (non-exhaustive)	Related work
FAQ (short answers; same opening framing as home)	FAQ
Motivation — proxy vs latent rules (no prerequisites)	Mortal vs Oracle parable
Same story — runnable Q-learning / bandit demo	Mortal vs Oracle — Q-learning demo
ML / agents / evaluation	PSDG for ML
Alignment, robustness, misspecification	PSDG for AI safety
Equilibrium, commitment, extensive form	PSDG for game theory
Mechanism: static vs re-solving on a fixed open	Fixed-board blunder sweep
One-game blunder walkthrough	Blunder wins (worked example)

Solver mode	Exchange	B wins	Rate	A wins + draws (= 5000 − B)
Re-solving (optimal at Exchange)	Simul. or sequential	287	5.7%	4713
Static (A commits from principal line)	Sequential (B best-responds)	427	8.5%	4573
Static (A commits from principal line)	Simultaneous (B plays Nash)	347	6.9%	4653

If you are…	Start here	Dedicated page
Quick links after reading the snapshot	§ Get oriented	—
Methodology / how reviewers bucket the work	How to read PSDG · FAQ § 1	/how-to-read-psdg.html
New to the research thesis (not the rules)	Parable	/parable.html
Run the minimal RL / bandit demo	Q-learning / bandit demo	/qlearning-parable.html
Clone solver / reproduce benchmarks	Solver, benchmarks, and GitHub	Home § Solver
New to the game / how to play	§ Rules in brief · Rules	/rules.html
Spine: brief → unusual → claims	§ In brief · § Fastest way	Home
Training / evaluating agents	§ Snapshot · ML	/ml.html
Objectives, robustness, misspecification	§ In brief · AI safety	/aisafety.html
Equilibrium, extensive form, commitment	§ Snapshot · Game theory	/gametheory.html

PSDGPhilosopher's Stone Dice Game

In brief

Why this is unusual

Fastest way to understand PSDG

1. The parable

2. The game

3. The solver and benchmark

What PSDG is claiming

Solver, benchmarks, and GitHub

Ready, fire, aim

Rules in brief

Twist and Tumble

Core ideas

Unlike chess, the board isn’t enough

Three mechanisms (at a glance)

Poisonous System Gift

Exact enough to sharpen the diagnosis

Illustrative analogies (not identities)

Specialist audiences

Empirical snapshot

Reading the 5.7% row (re-solving)

Reading static vs simultaneous in this table

Get oriented

Audience routes

PSDGPhilosopher's Stone Dice Game

In brief ​

Why this is unusual ​

Fastest way to understand PSDG ​

1. The parable ​

2. The game ​

3. The solver and benchmark ​

What PSDG is claiming ​

Solver, benchmarks, and GitHub ​

Ready, fire, aim ​

Rules in brief ​

Twist and Tumble ​

Core ideas ​

Unlike chess, the board isn’t enough ​

Three mechanisms (at a glance) ​

Poisonous System Gift ​

Exact enough to sharpen the diagnosis ​

Illustrative analogies (not identities) ​

Specialist audiences ​

Empirical snapshot ​

Reading the 5.7% row (re-solving) ​

Reading static vs simultaneous in this table ​

Get oriented ​

Audience routes ​

In brief

Why this is unusual

Fastest way to understand PSDG

1. The parable

2. The game

3. The solver and benchmark

What PSDG is claiming

Solver, benchmarks, and GitHub

Ready, fire, aim

Rules in brief

Twist and Tumble

Core ideas

Unlike chess, the board isn’t enough

Three mechanisms (at a glance)

Poisonous System Gift

Exact enough to sharpen the diagnosis

Illustrative analogies (not identities)

Specialist audiences

Empirical snapshot

Reading the 5.7% row (re-solving)

Reading static vs simultaneous in this table

Get oriented

Audience routes