# Boros Backtest Modes `demeter.boros_v4` supports multiple execution modes that should be treated as engine parameters rather than different strategy families. ## Mode Matrix | Mode | Intended Use | Fidelity | Speed | Recommendation | | --- | --- | --- | --- | --- | | `BAR_APPROX` | smoke tests, signal debugging | low | highest | dev only | | `NEXT_TRADE` | lightweight replay baseline | low to medium | high | dev and sanity checks | | `TX_REPLAY_BEST_EXEC` | stable experimental replay | medium | high | default experimental baseline | | `EVENT_REPLAY_FULL_PROTO` | highest current Boros fidelity | highest | high enough | recommended high-fidelity mode | ## Current Comparison Latest stable comparison on the real dual-market Boros dataset: | Engine | Runtime (s) | Final Net Value | Total PnL | | --- | ---: | ---: | ---: | | `TX_REPLAY_BEST_EXEC` | 209.50 | 996.1757 | -3.8243 | | `EVENT_REPLAY_FULL_PROTO` | 200.70 | 1000.8072 | +0.8072 | Interpretation: - `EVENT_REPLAY_FULL_PROTO` currently improves pnl materially. - It does so without a runtime penalty large enough to reject it. - That makes it the best candidate for the recommended high-fidelity Boros engine. ## Diagnostics The full execution mode also supports source-selection diagnostics. Current diagnostic snapshot after dust filtering and conservative split gating: - `orderbook_fill`: 155 - `amm_fill`: 25 - `only_available_quote`: 150 - `selected_best_all_in_rate`: 30 This suggests: - most executions still come from a single credible source - deeper split or matching complexity should be justified by measured gains, not by assumption ## Recommendation For public experimental release: - expose `TX_REPLAY_BEST_EXEC` as the stable default - expose `EVENT_REPLAY_FULL_PROTO` as the higher-fidelity experimental option - keep `BAR_APPROX` and `NEXT_TRADE` documented as development utilities