vesting_cliff_weth_demo

Forked Ethereum mainnet at block 19,000,000, 25 simulation steps (5.0 minutes of mainnet block time). Seed 42.

Recipients
10
Total swaps
200
WETH sold
10.0000
USDC realized
25,630.49
Avg effective price
2,563.05
USDC/WETH
Peak step
0.7628
WETH sold (step 20)
Wall-clock
21.2s

Sell pressure over time

Cumulative inventory liquidated, in WETH, by simulation step. The slope reflects the aggregate selling intensity across all recipients.

Realized USDC per step

Per-step revenue summed across all recipients. Gaps indicate steps where no recipient sold (small rounding amounts below the dust threshold).

Effective realized price

USDC received per WETH sold, per step. A declining line indicates accumulating slippage from sustained selling.

Pool tick — WETH/USDC 5bps

Per-step Uniswap V3 tick for the tracked pool, captured by the MarketSnapshotHook at the start of every scheduler step. A monotone trend signals sustained price pressure; abrupt jumps correspond to large fills landing in a single step.

Per-recipient breakdown

Agent WETH sold USDC realized Effective price Swaps
recipient-0 Hand-coded heuristic 1.0000 2,563.07 2,563.07 20
recipient-1 Hand-coded heuristic 1.0000 2,563.02 2,563.02 20
recipient-2 Hand-coded heuristic 1.0000 2,563.05 2,563.05 20
recipient-3 Hand-coded heuristic 1.0000 2,563.04 2,563.04 20
recipient-4 Hand-coded heuristic 1.0000 2,563.08 2,563.08 20
recipient-5 Hand-coded heuristic 1.0000 2,563.07 2,563.07 20
recipient-6 Hand-coded heuristic 1.0000 2,563.08 2,563.08 20
recipient-7 Hand-coded heuristic 1.0000 2,563.02 2,563.02 20
recipient-8 Hand-coded heuristic 1.0000 2,563.05 2,563.05 20
recipient-9 Hand-coded heuristic 1.0000 2,563.01 2,563.01 20

Methodology

Reinforcement-learning agents

The agents in this run use the hand-coded / scripted baselines (see the Agent column above). Mayavi’s agents are RL-trainable on the same forked-mainnet stack — mayavi train --env aave|vesting|liquidator produces a PPO policy, and VestingRecipient(policy_path=…) loads one into a scenario. Trained-policy-vs-baseline evaluation results (each on a real forked mainnet, $0 marginal cost):