← Back to Blog

Why We Only Use Peer-Reviewed Strategies (And Why That Matters)

Most trading signals are backtested on the same data used to discover them — a process that virtually guarantees impressive historical numbers and mediocre live results. Peer-reviewed research breaks that cycle. Here's exactly how, and why it's the only foundation worth building on.

If you've ever looked at a signal provider's backtest and thought "that seems too good," your instincts are correct. The uncomfortable truth about quantitative trading is that finding a pattern in historical data is easy. Finding one that actually persists into the future is a fundamentally different problem — and most retail signal products don't even try to solve it.

The Problem With Most Retail Signals

Here's how the typical signal gets built: a developer downloads several years of price data, runs a strategy optimizer across hundreds of parameter combinations, and publishes whichever version produced the best Sharpe ratio. The backtest looks pristine. Drawdowns are modest. The equity curve slopes upward at a satisfying angle.

This process has a name in statistics: data mining bias, sometimes called curve-fitting or overfitting. When you test enough variations of a strategy on the same dataset, you will eventually find one that fits the historical noise almost perfectly. The problem is that noise, by definition, doesn't repeat. The strategy wasn't discovering a real market inefficiency — it was memorizing random fluctuations in a specific historical window.

Three red flags that expose this approach:

Without those safeguards, a five-year backtest is essentially a five-year in-sample fit. It tells you almost nothing about the next five years.

What Peer Review Actually Means

Academic peer review is not a rubber stamp. When a researcher submits a paper claiming to have found a market anomaly, anonymous reviewers — typically other finance academics — attempt to find every possible flaw: data errors, look-ahead bias, transaction cost omissions, cherry-picked sample periods, alternative explanations that weren't considered. The paper only gets published if it survives that gauntlet.

More importantly, the editorial process requires full methodology disclosure. Every variable, every parameter, every data source must be described in enough detail that an independent researcher can reconstruct the result from scratch. This is the opposite of a black-box signal.

Why This Matters

A published, replicable strategy has already been stress-tested by researchers who were actively trying to kill it. That's a meaningfully higher bar than a proprietary backtest that no one outside the firm has ever examined.

Replication is the second layer. After publication, other research groups around the world independently attempt to reproduce the result — often on different datasets, different time periods, different markets. A strategy that survives multiple independent replications is a strategy that reflects something real about how markets work, not just a quirk in one researcher's data.

The Replication Crisis in Finance

There's a sobering body of research on how many published factors actually hold up. Campbell Harvey, Yan Liu, and Heqing Zhu catalogued the academic literature on cross-sectional return predictors and found that over 300 factors had been claimed in peer-reviewed journals by 2016.

"Most claimed research findings in financial economics are likely false."
— Harvey, Liu & Zhu (2016), Review of Financial Studies

Their argument: the conventional statistical threshold for significance (a t-statistic above 2.0) was calibrated for a world where researchers test one or two hypotheses. When you're searching across hundreds of potential factors, that threshold generates an enormous number of false positives by chance alone. To account for the multiple-testing problem across the full factor zoo, they proposed the bar should be closer to a t-statistic of 3.0 or higher.

316+
Claimed factors by 2016
>50%
Estimated false positives
3.0
Recommended min. t-stat

The message is not that academic finance is broken — it's that even peer review isn't a perfect filter, and the highest-confidence signals are those that have been replicated across independent studies, multiple asset classes, and long out-of-sample windows.

What Survives Replication

A handful of factors show up reliably enough, across enough contexts, that the academic consensus treats them as genuine market phenomena rather than statistical artifacts. These tend to share a common property: there is a plausible economic mechanism that explains why the premium should exist and why it might persist even after discovery.

These are not obscure discoveries. They have been documented in dozens of independent studies, across markets and time periods spanning more than a century in some cases. That breadth of evidence is what distinguishes them from the long tail of factors that look compelling in a single dataset but evaporate under scrutiny.

How R2S Applies This

Every strategy on R2S must clear three requirements before it gets implemented:

The current strategy library covers 20+ strategies across equities, fixed income, commodities, and currencies. Each one links to the source paper directly in the dashboard, so you can read the original research yourself rather than taking our word for it.

This Is Not a Guarantee

It would be dishonest to end here without the most important caveat: peer-reviewed does not mean risk-free, and it does not mean the strategy will work during the period you happen to be subscribed.

Even the most robustly documented factors go through extended drawdown periods. Momentum, for example, experienced sharp reversals in 2009, 2020, and other periods where market conditions shifted rapidly. Value underperformed for most of the 2010s before recovering. These are features of the strategies, not bugs — the premium exists precisely because investors face real pain holding the position during those difficult stretches.

The edge from systematic, research-backed strategies is probabilistic, not deterministic. It says: over a sufficiently long horizon, this approach has a meaningful probability of outperforming a passive benchmark. It does not say anything about any particular month or year.

What peer review buys you is not certainty. It buys you a much higher prior probability that the underlying signal reflects something real. That is worth a great deal compared to the alternative — but it should be understood clearly for what it is.

Further Reading

  • Harvey, C.R., Liu, Y., & Zhu, H. (2016). "…and the Cross-Section of Expected Returns." Review of Financial Studies, 29(1), 5–68. The definitive treatment of the multiple-testing problem in the factor zoo.
  • McLean, R.D., & Pontiff, J. (2016). "Does Academic Research Destroy Stock Return Predictability?" Journal of Finance, 71(1), 5–32. Documents how anomaly returns shrink by roughly half in the post-publication period, as arbitrageurs trade against the now-public signal.

Browse the strategy library

Every R2S strategy links to its source paper. Read the research, check the methodology, and see live signals — all in one place.

OPEN THE DASHBOARD →