The Problem That Won't Stay Quiet
Every advisor and institutional allocator faces the same recurring tension: how do you protect a balanced portfolio during the moments that matter most — the sharp, disorienting drawdowns that test client conviction and, mathematically, require disproportionate recoveries just to break even?
The 60/40 portfolio, long the canonical benchmark of balanced investing, has an admirable long-run track record. But its short- and medium-horizon biography is littered with episodes of severe pain. And that pain is not merely psychological. As Elisabetta Basilico, PhD, CFA, notes in her March 2026 Alpha Architect research summary1, the mechanics are unforgiving: "Those drawdowns matter not only behaviorally but mechanically, through volatility drag: large losses require disproportionately large gains to recover."
That single sentence encapsulates the engine behind every serious portfolio construction conversation. A 50% drawdown requires a 100% gain to recover. The behavioral and mechanical costs compound on each other. The question, then, is not whether to seek defensive strategies — it is which ones actually work. And that question, as Basilico makes clear, has been badly constrained by the brevity of available data.
The Empirical Trap: When History Is Too Short to Tell the Truth
Most practitioners evaluating defensive strategies are working from a dataset that begins around 1980 — roughly four decades of monthly returns, a handful of true crises (the 2000–02 dot-com collapse, 2008–09, the 2020 COVID shock), and a long bull market that flatters almost everything. That is not a rigorous stress test. It is a historical coincidence.
The paper Basilico synthesizes corrects for this directly. As she summarizes: "Most evaluations of defensive strategies are constrained by post-1980 data, which contains only a limited number of severe drawdowns. The authors extend the testing window to 1800 using a stitched deep-history dataset at monthly frequency. That materially increases the number of meaningful drawdown episodes and reduces the risk that conclusions are driven by a small set of modern crises."
Two centuries. Monthly frequency. This is not a backtest exercise — it is a fundamentally different epistemic standard. When you dramatically increase the number of severe drawdown episodes in your sample, strategies that appeared robust in the modern era are exposed and strategies that appeared unremarkable are validated. Basilico's central framing is unambiguous: two centuries changes the leaderboard.
This reframing — from a modern sample to a deep-history sample — is arguably the most important methodological contribution of the paper she is reviewing. For advisors and allocators who rely on peer-reviewed academic research, the implications ripple across standard assumptions about gold, put options, factor investing, and trend-following in ways that demand a careful re-read of existing portfolio construction doctrine.
The Winner's Circle: Trend-Following and DAR
The headline finding is clean. Across the worst months for a 60/40 portfolio — specifically the bottom 10% — two strategies emerge as reliably effective: trend-following and a factor-based overlay called DAR (Defensive Absolute Return).
Basilico writes: "Across the worst 60/40 months, trend-following and a defensive factor overlay called DAR (defensive absolute return) stand out as the most reliable help. In the 10% worst 60/40 months, trend-following and the original DAR design both deliver positive average returns, and they work more often than not."
Two characteristics make this finding significant. First, both strategies deliver positive average returns in the very months when 60/40 is suffering most — not merely smaller losses, not "less bad," but genuinely positive. Second, they work "more often than not," which matters for the implementation reality: a hedge that works probabilistically the majority of the time is one that advisors can defend to clients through difficult periods.
Traditional equity factors — low-risk, quality, value — also exhibit some defensive properties, but Basilico is explicit that they occupy a secondary tier: "Traditional equity factors like low-risk, quality, and value help too, but they tend to be weaker hedges than these two." For allocators already tilted toward quality or minimum-volatility equities, this is not a reason to abandon those tilts — but it is a clear signal not to conflate factor exposure with crisis protection.
Gold: The Safe Haven That Isn't (In the Long Run)
Perhaps the most provocative finding in the paper, and one with direct implications for the portfolio construction industry, concerns gold. Gold's status as a defensive asset — a "safe haven," an inflation hedge, a crisis buffer — is near-dogma in mainstream advisory practice. Tail-risk discussions almost reflexively include gold. The 2020 pandemic, the 2022 inflation shock, various geopolitical crises — modern episodes have generally been kind to gold's reputation.
The long sample is not.
Basilico summarizes the finding without hedging: "Gold's reputation as a safe haven is supported by some modern episodes, but the long sample does not support gold as a consistent defensive strategy for 60/40 drawdowns. In the authors' tests, gold underperforms in the worst 60/40 months over the full sample. This is a useful corrective for portfolio construction that treats gold as a default hedge."
"A useful corrective" is an understatement for what this evidence represents at the level of standard practice. If the two-century dataset is treated as the relevant test, gold fails the job description of a reliable portfolio hedge in a statistically meaningful way. Recency bias — a powerful force in both human cognition and financial research — has constructed a gold narrative that the deeper evidence does not support.
This does not necessarily mean eliminating gold from all portfolios. Investors with specific inflation-hedging mandates, commodity exposure goals, or currency diversification objectives may retain legitimate reasons to hold gold. But those holding gold specifically as protection against 60/40 drawdowns are relying on a story that the data — extended to its full historical depth — declines to validate.
Put Options: Effective Tail Protection, Expensive Full-Cycle Insurance
Equity index put options occupy a familiar place in the defensive toolkit. They offer convex protection — when markets crash sharply, puts deliver asymmetric gains. That is their design and their appeal.
But Basilico's synthesis of the research is unsparing about the cost. "Equity index put overlays deliver convex protection in sharp selloffs, but the long-run carry cost is meaningfully negative. The paper's conclusion is consistent with prior work: systematic option buying is effective in the left tail, but it tends to reduce long-run portfolio returns unless used selectively or funded explicitly."
The phrase "funded explicitly" is the key. A put overlay that is paid for by selling other options (collars, for instance) or by systematically harvesting volatility premium elsewhere operates under a different cost structure than one financed by simply carving into equity exposure. The research does not dismiss tail protection — it re-frames the question from "does it work in crashes?" (yes) to "what is the full-cycle cost?" (meaningfully negative unless thoughtfully funded).
For advisors considering systematic put-buying programs, this is a disciplined reminder that crash protection is never free, and that the cost of protection — across long periods that include many non-crash months — must be factored into the total value proposition.
DAR4020: Engineering a Smarter Defensive Overlay
The paper's most technically sophisticated contribution — and the one Basilico highlights as a practical enhancement — is a redesign of the DAR portfolio itself. The original DAR concept, introduced by Cavaglia et al. (2022), sorts a broad set of factor strategies by their rolling correlation to the 60/40 benchmark. It takes long positions in the most negatively correlated tercile of factors and short positions in the most positively correlated tercile.
The construction is elegant in theory but carries a structural limitation in practice: because it is perfectly balanced long/short against correlation, it produces "a negative correlation but near-zero long-run return by construction." Protection is achieved, but expected return across non-crisis periods is sacrificed almost entirely. This is the defensive overlay that costs you in bull markets — tolerable in theory, practically difficult to sustain across multi-year equity rallies without client pressure to exit.
The paper's proposed modification — DAR4020 — adjusts the construction asymmetrically. As Basilico explains: "The authors propose DAR4020: long 40% of the most negatively correlated factors and short 20% of the most positively correlated factors. This preserves much of the defensive correlation profile while creating a net long exposure to factor premia. The result is a strategy that aims to provide protection with a more durable full-cycle return profile."
This is a meaningful engineering improvement. By going long a larger proportion of the defensive factor universe and short a smaller proportion of the positive-correlation universe, the strategy retains most of its crisis-period protection while adding a positive expected return from its net long factor exposure. The hedge is not fully free — no hedge is — but the full-cycle return drag is substantially reduced relative to the symmetric original. For institutional allocators seeking to justify a defensive sleeve through extended periods of market strength, this distinction is critical.
The Complementarity Thesis: Why DAR4020 and Trend-Following Belong Together
One of the most practically useful insights in the paper — and one that reflects the kind of portfolio construction nuance that separates systematic researchers from product marketers — is that trend-following and DAR4020 are not redundant. They are structurally different animals that happen to share the same job description in a portfolio context, and their differences make them better together than either alone.
The asymmetry Basilico describes is specific and important. Trend-following is a momentum-based strategy: it needs a sustained directional move to generate the signals required to reposition. In sharp, sudden drawdowns — the kind where markets fall 15–20% over a few weeks before reversing — trend strategies may arrive late to the trade and provide limited protection precisely when it is most needed.
DAR4020 operates differently. Because it is structurally positioned with negative beta to the 60/40 benchmark through its factor correlation architecture, its protection is immediate. It does not need to "see" the drawdown to provide defense — the negative correlation is built in before the market moves.
Basilico captures this asymmetry directly: "Trend-following is often less helpful at the start of drawdowns, particularly when declines are sharp and short-lived, because signals can be slow to reposition at turning points. DAR4020, by contrast, tends to provide immediate protection because it is structurally positioned with negative 60/40 beta. Over longer drawdowns, trend-following tends to 'catch up' and often becomes highly effective."
The implication for portfolio construction is explicit: "A blended allocation improves robustness across drawdown shapes." And Basilico's final synthesis from the underlying research paper is perhaps the most distilled expression of the paper's practical message: "Our long-run evidence shows that multi-asset defensive strategies, particularly a return-enhanced version of the defensive absolute return (DAR) portfolio introduced by Cavaglia et al. (2022) and trend-following, provide the most effective downside protection. DAR and trend-following are complementary across tests by diversifying each other across stages of drawdowns."
That sentence — "diversifying each other across stages of drawdowns" — is not marketing language. It is a precise description of a structural relationship: one strategy protects in the immediate shock, the other protects in the sustained deterioration. Together, they cover the full topography of how drawdowns actually unfold.
The Design Principle: Defensiveness as a Portfolio Problem, Not a Product Decision
Throughout Basilico's synthesis runs an organizing principle that the evidence consistently reinforces: effective defensive portfolio construction is a system design question, not a product selection question.
The industry's default mode is to evaluate defensive products one at a time: "Is gold good? Should we add a put overlay? Should we run trend-following?" But the paper's evidence — and Basilico's synthesis of it — suggests that this product-by-product framing misses the larger point. The interactions between defensive strategies, their different sensitivities to the stage and shape of drawdowns, and their combined effect on the full-cycle return profile of the total portfolio are what matter.
As Basilico's editorial note summarizes: "Treat defensiveness as a portfolio design problem, not a product choice. The evidence supports combining defensive sleeves rather than relying on a single hedge. A balanced overlay of trend-following and DAR4020 can reduce drawdown depth while maintaining a positive expected return profile, improving the likelihood that clients can hold the hedge through extended bull markets."
That last clause deserves to be read again: "improving the likelihood that clients can hold the hedge through extended bull markets." It is a behavioral insight dressed in a quantitative jacket. A hedge with a significantly negative full-cycle expected return will be abandoned. Clients — and their advisors — will exit it after two or three years of watching it subtract from returns in a rising market. The behavioral durability of a defensive strategy is part of its effectiveness. A hedge you abandon before the crash arrives is not a hedge at all.
Key Takeaways for Advisors and Investors
Extend your evaluation horizon — dramatically.
If you are assessing defensive strategies using post-1980 data, you are working with an insufficient sample. The two-century evidence base changes the rankings materially. Strategies that look compelling in the modern sample may not hold up in deep history.
Trend-following and DAR deserve serious allocation consideration.
These are the two strategies with the most consistent empirical support as defenders of balanced portfolios across crisis periods. For advisors not currently including either, the evidence warrants a structured re-evaluation.
1. Gold is not a reliable 60/40 hedge — full stop.
Modern episodes support gold's defensive reputation. The long-run data does not. Advisors relying on gold as the primary hedge against equity/bond drawdowns are building on a narrative that the full sample contradicts. Gold may have other legitimate portfolio roles, but crisis protection for 60/40 investors is not one the evidence validates consistently.
2. Puts work in crashes but cost significantly over full cycles.
Systematic put buying is effective in the left tail. It is not cost-neutral. Unless the put overlay is explicitly funded — through a collar structure, a volatility premium strategy, or other means — it will materially reduce long-run returns. Use tail protection surgically, not reflexively.
3. DAR4020 is a meaningful improvement on the original DAR design.
By adjusting from a symmetric long/short construction to an asymmetric 40-long/20-short structure, the research demonstrates that much of the crisis protection is preserved while a positive expected return is restored. For institutions seeking a structurally defensive factor overlay with positive long-run expected return, DAR4020 represents a more durable design.
4. Combine trend-following and DAR — they are not substitutes.
The asymmetry in their protection profiles — DAR4020 providing immediate structural defense, trend-following providing sustained protection in extended deteriorations — means they complement rather than duplicate each other. A blended allocation is more robust across different drawdown shapes than either in isolation.
5. Think about defensiveness as a system, not a product.
The research supports a portfolio design approach to tail protection: constructing a defensive sleeve that combines strategies with complementary profiles, maintains positive full-cycle expected returns, and remains behaviourally durable enough for clients to hold through extended periods of market strength. A hedge that gets fired in a bull market is a hedge that won't be there when you need it.
6. Behavioral durability is a quantitative variable.
The likelihood that a client holds a defensive strategy through bull markets is a function of its full-cycle return profile. Strategies with severely negative expected returns in non-crisis periods will be abandoned. This is not a soft observation — it has hard consequences for whether the hedge actually functions when the drawdown arrives.
Closing Perspective
Elisabetta Basilico's synthesis of this research is a model of what rigorous, practitioner-facing academic translation looks like. She takes a complex empirical paper — spanning two centuries of data, multiple factor strategies, and a novel portfolio construction methodology — and renders its most important findings in language that is both precise and actionable.
The underlying research itself represents a genuine contribution to the defensive investing literature. By dramatically extending the evaluation window, testing strategies across a far richer set of drawdown episodes, and proposing a practical refinement to an existing framework (DAR → DAR4020), the authors produce conclusions that are more trustworthy and more useful than the post-1980 consensus has been.
For the advisory industry, the message is bracing and specific: the two strategies with the deepest empirical support — trend-following and the DAR factor overlay — are not yet mainstream allocations in most balanced portfolios. Gold, which is mainstream, lacks consistent long-run evidence as a 60/40 hedge. Put options, which are used selectively, work in crashes but carry costs that most implementations do not adequately account for.
The research does not ask advisors to abandon common sense or client-specific mandates. It asks them to subject their assumptions about defensive strategies to a longer, more demanding empirical standard. Two centuries of drawdown data is not just more data. It is a harder test — and the strategies that pass it are the ones worth building into the infrastructure of durable portfolios.
Footnote:
1 Elisabetta Basilico, Phd. "The Best Defensive Strategies: Two Centuries of Evidence." Alpha Architect, 9 Mar. 2026.