Backtesting Gone Wrong
In the first part of this mini-series, we saw how backtesting can be both essential and deceptive: a valuable tool to compress decades of market history into hours of computation, but also vulnerable to biases such as look-ahead errors, survivorship distortions, and data snooping. The lesson was clear—careless backtesting can make weak strategies look strong.
In this second part, we turn to the constructive side of the story. What does it take to design backtests that genuinely inform investment decisions? We outline best practices that help separate meaningful signal from misleading noise, and discuss how to take the step from historical simulation to disciplined live implementation.
Backtesting is not about chasing the highest in‑sample Sharpe ratio. It is only meaningful if an investment strategy remains economically and statistically sound given realistic data limitations and trading frictions. The following four guidelines help to ensure that historical simulations form a sound basis for evaluating systematic investment strategies:
By following these practices, one can better recognize whether the apparent advantage of a strategy is likely due to a genuine investment edge or to the idiosyncrasies of a particular data set.
A well‑designed backtest is an important, but not sufficient prerequisite for the use of real capital in an investment strategy. The following steps are often advisable to successfully move from backtesting an investment strategy to its live version:
This step-by-step approach aligns the live implementation of the investment strategy with the expectations from the backtest and supports a disciplined implementation of the investment strategy over time.
In Part I, we saw how even a toy example with five stocks could produce wildly misleading results when biases or shortcuts crept in. To avoid such pitfalls in real-world applications, backtesting must be approached with rigor and discipline. The goal is to provide a realistic picture of the long-term performance of an investment strategy. To achieve this, the backtest of an investment strategy needs to:
Adherence to these principles enables backtesting to replay history as faithfully as possible and largely without illusion. However, even the most rigorous historical simulation is no guarantee of tomorrow’s success. Markets will always find new ways to surprise. What careful backtesting analysis does offer, however, is the well-grounded confidence that the expected performance of an investment strategy is based on evidence and not just the ghostly afterglow of an over-fitted past.
References
Arnott, R., Harvey, C. R., & Markowitz, H. (2019). A backtesting protocol in the era of machine learning. Journal of Financial Data Science, 1(1), 64 74.
Bailey, D., & López de Prado, M. (2014). The deflated Sharpe ratio: Correcting for selection bias, backtest overfitting and non normality. Journal of Portfolio Management, 40(5), 94 107.
Important Information: The content is created by a company within the Vontobel Group (“Vontobel”) for institutional clients and is intended for informational and educational purposes only. Views expressed herein are those of the authors and may or may not be shared across Vontobel. Content should not be deemed or relied upon for investment, accounting, legal or tax advice.
Results and/or use of backtesting are hypothetical in nature and use historical data without live trading results. Past performance is no guarantee of future performance, and changes in market conditions, liquidity, or execution costs could significantly impact live results. Backtesting carries the risk of overfitting, where a strategy is too closely aligned with past data and fails to adapt to new market conditions. This information is for educational purposes and does not constitute investment advice.
Vontobel makes no express or implied representations about the accuracy or completeness of this information, and the reader assumes any risks associated with relying on this information for any purpose. Vontobel neither endorses nor is endorsed by any mentioned sources.