feedback
← Back to Learn

How to Read a Backtest

The six numbers BlaveClaw reports — what each means and which ones to trust.

A real example to anchor the concepts

The btc_sma_cross strategy (SMA45/SMA100 on BTCUSDT 1h, 2022–2026) produced these results:

Total Return
+145.8%
Benchmark (Buy & Hold)
+68.2%
Max Drawdown
−42.6%
Sharpe Ratio
0.77
Sortino Ratio
0.72
Omega Ratio
1.04

The strategy more than doubled the buy-and-hold return — but it also had a brutal −42.6% drawdown at one point. Whether that's acceptable depends entirely on your risk tolerance, and the metrics below will help you think through it.

Sharpe Ratio — risk-adjusted return

Sharpe ratio = annualized return ÷ annualized volatility (both upside and downside). It answers: how much return per unit of total risk?

SharpeInterpretation
< 0Losing money on a risk-adjusted basis
0 – 0.5Weak — barely compensates for risk
0.5 – 1.0Acceptable for trend-following (our SMA example falls here)
1.0 – 1.5Good
> 1.5Strong — treat with skepticism if backtest period is short
High Sharpe on a short backtest is a red flag. Sharpe 3.0 over 6 months = fewer than 130 data points. That's noise, not signal. Require at least 2 years before trusting a Sharpe above 1.5.

Sortino Ratio — downside-only risk

Sortino ratio is like Sharpe, but only penalizes downside volatility. Upside variance doesn't count as risk. It answers: how much return per unit of bad volatility?

For the SMA Cross, Sortino (0.72) is close to Sharpe (0.77) — meaning the strategy's volatility is fairly symmetric. A strategy where Sortino is significantly higher than Sharpe has large upside swings but controlled drawdowns, which is desirable.

Sortino vs SharpeWhat it suggests
Sortino ≈ SharpeSymmetric returns — upside and downside volatility are similar
Sortino >> SharpeReturns are positively skewed — large gains, small losses (ideal)
Sortino << SharpeReturns are negatively skewed — small gains, occasional large losses (dangerous)

Max Drawdown (MDD) — your worst moment

Max drawdown is the largest peak-to-trough decline in equity during the backtest. If equity peaked at $100,000 and dropped to $57,380 before recovering, MDD = −42.6%.

MDD matters because it measures how much pain you'd have had to endure to stay in the strategy. Most people quit well before the recovery.

Never allocate more capital than you could psychologically survive losing twice over.
If MDD is −42.6%, a $100K allocation means you may face a $42,600 paper loss. If that would cause you to stop the strategy, allocate less.

Omega Ratio — the full picture

Omega ratio = total returns above a threshold ÷ total returns below it. The threshold is usually 0 (break-even). Unlike Sharpe, Omega captures the full return distribution, not just mean and variance.

OmegaInterpretation
< 1.0More total loss than gain — unprofitable
1.0 – 1.1Marginally profitable, fragile under real-world costs
1.1 – 1.3Solid
> 1.5Strong — or possibly overfitted

The SMA Cross has Omega = 1.04 — only slightly above 1.0. After the 22% of capital paid in fees over 4 years, the real-world edge is thin. This is normal for a trend-following strategy; the Sharpe and Sortino ratios are the better primary quality metrics for this strategy type.

The five most common backtest mistakes

1

Lookahead bias: Using data that wasn't available at decision time. BlaveClaw enforces next-bar-open execution to prevent this, but custom indicator calculations can still introduce it — e.g., computing a Z-score using the full series' mean.

2

Skipping the warm-up period: Indicators like SMA(100) need 100 bars before they're meaningful. The first 100 bars should be excluded from performance measurement. BlaveClaw's WARMUP parameter handles this automatically.

3

Optimizing on the full dataset: If you scan parameters using all available data and then report performance on the same data, you're reporting training error, not generalization. Always hold out at least 20–30% of data for out-of-sample validation.

4

Ignoring fees: The btc_sma_cross paid 22% of capital in fees over 4 years. For higher-frequency strategies, fees can easily exceed alpha. Use FEE = 0.0005 (maker/taker 0.05%) as a realistic baseline on Binance.

5

Confusing total return with risk-adjusted return: A strategy that returns 200% with an MDD of −80% is not "better" than one returning 80% with an MDD of −15%. The first will cause most real users to panic-exit near the bottom.

Next step: Now that you can read a backtest, learn how BlaveClaw's proprietary Taker Intensity indicator can power signal strategies. Read Taker Intensity Guide →