Portfolio Backtest: Testing Combined Strategies

1

Start from the first out-of-sample day

The first lookback days (default: 365) are used as the initial calibration window. Performance reporting starts only after this — strictly out-of-sample.

2

Optimize weights on the past window

Each day, the optimizer finds the weight vector w that maximizes the slope/volatility of the combined equity curve over the past 365 days. Constraints: weights sum to 1, all ≥ 0 (no shorting strategies).

3

Apply weights to tomorrow's returns

The weights just computed are applied to the next day's actual strategy returns. The optimizer never sees the day it's predicting — this is what makes it out-of-sample.

4

Roll forward one day and repeat

The window slides forward. Each day gets fresh weights based on the most recent 365-day history.

Panel	What it shows	What to look for
Top — Cumulative Return	Managed portfolio (green) vs. p5–p95 band of 1,000 random portfolios (grey)	Managed line should trend above the random median. Consistent outperformance > 2 years is meaningful.
Middle — Drawdown	Managed portfolio's drawdown from equity peak	MDD should be tolerable. Deep drawdowns that recover slowly suggest low diversification or strategies too correlated.
Bottom — Weight History	Each strategy's allocation % over time	Stable weights → strategies have consistent relative performance. Rapidly switching weights → the optimizer is chasing short-term noise.

Warning sign	What it likely means
Managed Sharpe < random median	Dynamic allocation is hurting, not helping. Consider equal-weight allocation instead.
Weights oscillate wildly (bottom panel)	Strategies have similar edge — the optimizer is fitting noise. Use longer lookback or add more strategies.
One strategy receives >80% weight consistently	The other strategies are not contributing. Evaluate whether they're worth running at all.
MDD exceeds 2× individual strategy MDD	Strategies are drawing down together. They may be too correlated to provide diversification.
OOS period < 180 days	Not enough data — results are statistically unreliable. Run individual strategy backtests longer first.

BlaveClaw

Portfolio Backtest: Testing Combined Strategies

The problem with backtesting strategies one by one

What the management backtest does

The random portfolio benchmark

Reading the three-panel chart

How weights are optimized: slope / volatility

Leverage and target volatility

When to run the management backtest

Warning signs in the management backtest