Pattern failed_double_top

Failed Double Top Breakout

Bearish reversal: price broke above resistance (double top breakout) but then falls back below the resistance level. Bulls are trapped. Failure threshold normalized by daily volatility.

Signal family

Pattern — Formal chart-pattern detectors (double tops / bottoms, failed breakouts, HH/HL structure).

Parameters

Name Description Default Range
peak_order Peak detection window 15 5–25
tolerance_zscore Tolerance (z-scores of daily vol) 1.5 0.5–3.0
failure_window Max days for failure after breakout 60 20–120

Historical context

12,186 valid triggers on 3,025 distinct tickers between 2015-07-28 and 2026-04-21. Universe: us_only · mcap ≥ $100,000,000 · price ≥ $1 (3,029 tickers). Entry at open T+1. 1d = intraday T+1; 20d = open T+1 to close T+20.

Benchmarks: spxew (S&P 500 Equal Weight — the primary benchmark here; a median-stock view that avoids the 2020+ megacap-concentration distortion), spx (S&P 500, cap-weighted), and msci (MSCI World USD). Per-stock regime: trending = ADX(14) ≥ 25, high vol = 20d ann. vol ≥ 20%.

At a glance (20d alpha vs S&P 500 Equal Weight, US-only)

Bearish (negative alpha = signal right)
-0.02%
vs random-date null: beats random (pperm=0.020)

Failed Double Top Breakout is a single-direction signal — only the bearish side is meaningful. (A failed pattern reverses the original setup, so the direction flips by construction.)

Reading this: the random-date null is: for each ticker, sample N random dates and compute the same alpha — what alpha does a signal with no information produce? If the signal's observed alpha beats the null (pperm≤0.05), it's adding real information. If it's inside or worse than the null, the signal doesn't add value over random firing — any observed alpha is either noise or a universe artifact.

How often does FAILED_DOUBLE_TOP fire in each regime?

The signal's bucket distribution is itself informative. If 50%+ of all FAILED_DOUBLE_TOP triggers fire in the "non-trending + high vol" quadrant, the signal is structurally a chop-market event — regardless of what its textbook definition claims. Bullish and bearish are shown separately; counts are across the full US-only sample after the mcap and price floor.

Failed Double Top Breakout (failed_double_top) — trigger count distribution by per-stock regime quadrant (trending/non-trending × high/low realized volatility) for , US-only universe

Per-stock regime quadrant — 20d alpha

Each trigger is tagged with the host stock's own technical regime on the trigger date: is the stock itself in a trend (ADX(14) ≥ 25) or ranging? And is its realized 20-day volatility high (≥ 20% annualized) or low? This is the textbook conditioning variable — "does this signal work better in trending stocks?" — answered at the level of the individual stock, not the market. Positive bars are good for the signal; negative bars mean alpha vanishes into the benchmark or worse.

Failed Double Top Breakout (failed_double_top) — mean 20-day alpha versus S&P 500 Equal Weight by per-stock regime quadrant,  side by side
Trending + Low vol
Stock in a clean directional move with low realized volatility. Textbook "trend-following paradise" — smooth grind with little whipsaw risk.
Trending + High vol
Violent directional moves — parabolic rallies, crisis selloffs. Trend exists but the path is noisy. Signal timing may be imprecise.
Non-trending + Low vol
Quiet chop, summer doldrums, consolidations. No directional bias but also no big swings — small edges become reliable if they exist at all.
Non-trending + High vol
Choppy and violent — the classical "whipsaw zone" for momentum signals. Crossovers and breakouts fire repeatedly without follow-through.

Sub-period check — does the signal work in every era?

A multi-year average can hide major instability. We split the sample into three non-overlapping windows: 2015–2019 (pre-COVID, normalized monetary policy), 2020–2022 (pandemic crash + recovery + rate-shock bear), and 2023+ (post-ZIRP, AI megacap rally). If a signal's alpha is positive overall but comes entirely from one era, that's a red flag — the conditions that produced it may not repeat. A robust signal shows a consistent sign across all non-empty buckets.

Failed Double Top Breakout (failed_double_top) — 20-day alpha split by historical sub-period (2015-2019, 2020-2022, 2023+) to check consistency across market regimes

↓ Bearish triggers negative alpha = signal was right (stock underperformed market)

Bench Metric 1d 5d 20d 60d 252d
spx Stock % +0.00% +0.19% +0.55% +1.82% +9.35%
Bench % +0.05% +0.33% +0.92% +2.73% +13.72%
Alpha % -0.05% -0.13% -0.36% -0.92% -4.35%
Median alpha -0.04% -0.22% -0.76% -2.48% -10.80%
Hit rate (α>0) 49.1% 47.7% 46.4% 43.5% 37.1%
p (naive) 0.0409 0.0173 0.0010 <0.001 <0.001
p (HAC) 0.0410 0.0184 0.0015 <0.001 <0.001
N 12,184 12,167 12,113 11,676 10,567
msci Stock % +0.00% +0.19% +0.55% +1.82% +9.35%
Bench % -0.02% +0.20% +0.68% +2.26% +11.22%
Alpha % +0.03% +0.00% -0.13% -0.42% -1.84%
Median alpha +0.00% -0.13% -0.56% -2.03% -8.12%
Hit rate (α>0) 50.0% 48.7% 47.5% 44.6% 40.0%
p (naive) 0.2971 0.9990 0.2456 0.0359 0.0001
p (HAC) 0.2967 0.9990 0.2613 0.0733 0.0556
N 12,166 12,124 12,046 11,598 10,517
spxew Stock % +0.00% +0.19% +0.55% +1.82% +9.35%
Bench % +0.07% +0.25% +0.57% +1.93% +9.62%
Alpha % -0.06% -0.04% -0.02% -0.07% -0.04%
Median alpha -0.06% -0.14% -0.42% -1.65% -6.60%
Hit rate (α>0) 48.6% 48.4% 47.7% 45.0% 41.3%
p (naive) 0.0225 0.4650 0.8315 0.7297 0.9251
p (HAC) 0.0225 0.4697 0.8379 0.7714 0.9623
N 12,137 12,073 12,003 11,603 10,453
Distribution of all 20d alpha outcomes for this direction. Median and winsorized mean shown.
Failed Double Top Breakout (failed_double_top) — bearish 20-day alpha histogram showing distribution of per-trigger returns
Observed 20d alpha (vertical line) against the null distribution of random-date firing. If the line is deep inside the null cloud, the signal adds no information. If it sits in a tail, the signal is doing real work in that direction.
Failed Double Top Breakout (failed_double_top) — bearish 20-day observed alpha versus random-date permutation null (200 iterations)
Permutation null detail — all horizons × both benchmarks
200-iteration null: for each ticker, sample N random dates from its history (matching observed trigger count) and compute the same alpha. The null distribution's 95% CI is where a signal with no information would land. pperm = one-sided fraction of null iters with mean ≥ observed.
Horizon Bench Observed α Null mean Null 95% CI pperm
1d spx -0.05% +0.06% [-0.07%, +0.04%] 0.109
1d msci +0.03% +0.03% [-0.10%, +0.02%] 0.980
1d spxew -0.06% +0.03% [-0.09%, +0.02%] 0.209
5d spx -0.13% +0.21% [-0.12%, +0.93%] 0.030
5d msci +0.00% +0.22% [-0.12%, +0.95%] 0.348
5d spxew -0.04% +0.24% [-0.10%, +0.97%] 0.144
20d spx -0.36% +0.41% [-0.14%, +1.73%] 0.005
20d msci -0.13% +0.53% [-0.03%, +1.88%] 0.015
20d spxew -0.02% +0.60% [+0.02%, +1.91%] 0.020
60d spx -0.92% +0.75% [-0.19%, +2.59%] 0.005
60d msci -0.42% +1.20% [+0.29%, +3.04%] 0.005
60d spxew -0.07% +1.43% [+0.49%, +3.24%] 0.005
252d spx -4.35% +1.54% [-0.77%, +4.77%] 0.005
252d msci -1.84% +3.89% [+1.55%, +7.11%] 0.005
252d spxew -0.04% +5.18% [+2.88%, +8.42%] 0.005

Example triggers on US large-caps (2023+, mcap ≥ $30B)

Six recent bearish FAILED_DOUBLE_TOP triggers on US mega-caps, filtered to |alpha| ≤ 25% to exclude catalyst-driven outliers (earnings surprises, M&A, binary events). The first three are the strongest outcomes — what the signal looks like when it works. The last three are the weakest — what the signal looks like when it fails. Each chart shows the stock's price with signal-appropriate technical overlays (e.g. MACD subpanel on MACD pages, Bollinger Bands on Bollinger pages, the 52-week trailing max line on 52w-high pages), a dot marking the trigger date, and the forward window shaded (green when the signal was right, red when it wasn't). Click any chart to open full-size.

Strongest outcomes (what FAILED_DOUBLE_TOP looks like when it works)
Weakest outcomes (what FAILED_DOUBLE_TOP looks like when it fails)
Stock-regime quadrants (2×2 per-stock, 20d alpha detail table)
Each quadrant groups triggers by the stock's own ADX(14) and RV(20) at the trigger date — the textbook conditioning variable (not market-level). Stock %, bench %, alpha %, and HAC p-value shown for each benchmark.
Quadrant N Stock % (spx) Bench % (spx) Alpha % (spx) p (HAC) Stock % (msci) Bench % (msci) Alpha % (msci) p (HAC) Stock % (spxew) Bench % (spxew) Alpha % (spxew) p (HAC)
Trending + Low vol Clean directional grind, low whipsaw 457 -0.61% +0.63% -1.32% 0.0011 -0.61% +0.39% -1.08% 0.0056 -0.61% +0.24% -0.98% 0.0046
Trending + High vol Crisis selloff or parabolic rally 5,090 +0.84% +0.94% -0.09% 0.6703 +0.84% +0.71% +0.15% 0.4776 +0.84% +0.55% +0.33% 0.1239
Non-trending + Low vol Quiet chop, summer doldrums 1,018 -0.14% +0.84% -1.04% <0.001 -0.14% +0.46% -0.69% 0.0007 -0.14% +0.28% -0.57% 0.0041
Non-trending + High vol Classical "whipsaw zone" for momentum 5,621 +0.67% +0.95% -0.26% 0.0910 +0.67% +0.73% -0.06% 0.7082 +0.67% +0.67% -0.01% 0.9389
Sub-period breakdown table (20d alpha)
Historical clustering check. If alpha concentrates in one era, the signal's robustness is questionable.
Period N Alpha % (spx) p (HAC) Alpha % (msci) p (HAC) Alpha % (spxew) p (HAC)
2015-2019 2015-01-01 → 2020-01-01 3,132 +0.06% 0.7177 +0.42% 0.0124 +0.27% 0.1318
2020-2022 2020-01-01 → 2023-01-01 3,786 -0.56% 0.0102 -0.36% 0.0995 -0.63% 0.0033
2023-2026 2023-01-01 → 2099-01-01 5,268 -0.43% 0.0304 -0.26% 0.2004 +0.29% 0.1366

Methodology and caveats

How to read. Entry at open of T+1 (one trading day after the signal fires on close of T). 20d = open T+1 to close T+20. Alpha = stock return − benchmark return over the same window (Convention A, single-sided, textbook). For bullish triggers, POSITIVE alpha = signal was right. For bearish triggers, NEGATIVE alpha = signal was right (stock underperformed market). No sign-flipping; the direction of the bet determines what "good" looks like. Per-stock regime is each stock's own ADX(14) and RV(20) at the trigger date — not market-wide state.

Three p-values, three robustness tests. (a) p_naive: scipy one-sample t-test on winsorized alphas. Optimistic because overlapping 20d windows on the same ticker inflate effective N. (b) p_hac: Newey-West HAC with lag = horizon — corrects for the overlap and is the academic-finance standard. (c) p_perm: fraction of 200 random-date null iterations with mean ≥ observed. Tests whether the signal beats random date selection at all. A signal that clears all three (pnaive, phac, pperm all < 0.05) has real information; a signal that fails pperm has zero edge even if the t-test says "significant."

Caveats. (i) Universe reflects today's active tickers; delisted losers pruned → survivorship bias. (ii) Mcap ≥ $100M filter uses today's snapshot, not point-in-time — mild lookahead on which stocks enter the sample, not on returns. (iii) Means and p-values use winsorized alphas (1/99 percentile) to prevent data errors from dominating. Medians and hit rates use raw data. (iv) Zero transaction costs assumed. Realistic bid-ask + commissions remove 20–40bps from 20d alpha on US large-caps, more on small-cap. Sub-20bps alpha is noise in practice. (v) Past performance does not predict future results.

How to use this

1 · When to reach for this signal

Neutral signal. Bullish 20d alpha —, bearish -0.36%. Neither direction beats random date selection (pperm not significant either way). Observed alpha is likely noise or universe drift rather than information about the trigger. Useful for context, not for standalone entries.

2 · When it works — the setups that drive it

  • Best bearish setup: Trending + High vol — alpha -0.09% / 20d on 5,090 historical triggers.
  • Best era for bearish: 2015-2019 — alpha +0.06% / 20d.

3 · When it fails — common false positives

  • Weakest bearish cell: Trending + Low vol — alpha -1.32% / 20d on 457 triggers.
  • Worst era for bearish: 2020-2022 — alpha -0.56% / 20d.

Signal-specific failure patterns

Clean bearish alpha across both horizons
Failed double top (price broke above the resistance then fell back below) fires 12,186 bearish triggers. α=−0.36 at 20d (p(HAC)=0.001, p_perm=0.005), widening to −0.92 at 60d (p<1e-4, p_perm=0.005). The 'failed breakout trapping late buyers' thesis produces the strongest pattern-signal alpha in the suite.
evidence: bearish 20d α=−0.36 p_perm=0.005; 60d α=−0.92 p_perm=0.005
2015-2019 was a near-null regime
Sub-period bearish 20d: 2015-2019 α=+0.06 (signal fails), 2020-2022 α=−0.57, 2023-2026 α=−0.43. The failed-breakout signal works in recent years; 2015-2019 it essentially broke even. Modern-market edge is consistent with narrow-breadth megacap leadership where failed breakouts on non-leaders are penalized.
evidence: bearish 20d by period: +0.06, −0.57, −0.43
By design, this signal catches 'trap' setups
The failed-double-top structure specifically identifies a 'bulls-trapped' event: breakout above the double-top resistance, then the close falls back below, stranding late buyers with a losing position. Textbook bearish reversal pattern. The data confirms the textbook here: 60d alpha of −0.92% is substantial.
evidence: 60d alpha −0.92 with p(HAC)<1e-4
Bullish direction does not exist by design
Despite the signal name suggesting a direction-neutral setup, 'failed double top' is definitionally bearish. In v2 backtest data a small number of bullish-tagged rows (N=10) existed as pre-wrapper-refactor legacy; those were deleted in v3. The signal is single-direction.

4 · Pairing inside a screen

The statements below describe how this signal relates to others by construction — which indicator family it belongs to, and where same-family redundancy might reduce the independence of evidence inside a Daily Report. These are taxonomic classifications drawn from standard technical-analysis texts; they are not pairing backtests. A multi-signal convergence backtest is planned but not yet run.

Sequential with completed pattern

Failed-double-top and double-top-breakout fire on the same underlying pattern structure at different points: the breakout signal fires when price breaks the neckline; failed_double_top fires when the pattern fails to complete and price reverses (Edwards & Magee, Technical Analysis of Stock Trends, 11th ed. 2018; Bulkowski, Encyclopedia of Chart Patterns, 3rd ed. 2021). They are sequential rather than concurrent — one signal replacing the other as the setup evolves, not two independent pieces of evidence.

What would likely rescue this signal

This block calls out the data or conditions that could turn a technically weak signal into a usable one in a composite screen. Based on signal mechanics and the observed failure patterns above; individual combinations are not yet backtested.

  • Hold to 60dAlpha compounds 20d → 60d significantly. Time stop rather than tight invalidation price stop.
  • Volume-gate the failure dayFailures on heavy volume are distribution events; on light volume they're noise that often reverts. Vol filter may concentrate the −0.36/−0.92 alpha.

See also Why technical-only signals don't survive on their own for the broader argument.

5 · Before you act — a 5-point checklist

  1. Normal trading day? Rule out earnings (within ±3 days), ex-dividend, or known corporate-action dates — the signal is almost certainly reading noise, not momentum, in those windows.
  2. Where is price vs its own 50 / 200 DMA? Pattern signals carry their own structural context; check that the implied support/resistance levels have historical relevance, not just the most-recent 3-month range.
  3. What's the sector breadth doing? An isolated signal in a broadly down-trending sector is a lower-confidence setup than one firing with the rest of its peer group.
  4. Is ADV20 enough for your size? If the trigger is on a $500M name and you want to move $1M notional, you're the tape. Consider adv20d ≥ 5% of your intended position.
  5. What invalidates you? Define a price level (for longs: a close below the trigger-day low; for shorts: close above the trigger-day high) and honor it. The backtest alpha is an average; any one trade can be at either tail.

Execution notes

Strong bearish edge. Hold the full 60d window — alpha doubles from 20d to 60d. Entry open T+1. Skip 2015-2019 sample in calibration (regime-different).