curve_depeg_demo

Forked Ethereum mainnet at block 19,000,000, 4 simulation steps (0.8 minutes of mainnet block time). Seed 42.

Horizon
4 steps
Actions recorded
8
Wall-clock
11.7s

No swap or liquidation actions executed

This run produced no successful swaps, no liquidations, and no failed swap attempts either. The scenario may have completed without firing any of the report’s currently-known action types — check the actions table in data/runs/runs.duckdb for what this run actually recorded.

Methodology

Reinforcement-learning agents

The agents in this run use the hand-coded / scripted baselines (see the Agent column above). Mayavi’s agents are RL-trainable on the same forked-mainnet stack — mayavi train --env aave|vesting|liquidator produces a PPO policy, and VestingRecipient(policy_path=…) loads one into a scenario. Trained-policy-vs-baseline evaluation results (each on a real forked mainnet, $0 marginal cost):