Physics-Derived Market Intelligence

Two systems. Both built from physics. Both predict volatility.

James Lawrence • CTO @ Strykr AI • Forward Deployed Engineer

System 1 classifies volatility regimes on daily data — walk-forward validated across 20 years and 53 assets. System 2 measures structural fragility in real-time order books. Together they form a complete macro risk framework.

72.1%

Regime Accuracy (Daily)

1.93x

Fragility Signal (L2)

29,912

OOS Signals

Independent Systems

Two Systems, One Framework

Strategic bias from regime classification. Tactical timing from structural analysis.

Both systems are built from physical science — one operates on daily price data (macro), the other on real-time order book depth (micro). Neither uses standard financial indicators. Together they answer: "Is the market fragile?" and "Is it about to break?"

System 1 — Regime Classifier (Daily)

13 proprietary physics-derived features. Classifies any asset into one of 6 empirically discovered volatility regimes. Walk-forward validated: 72.1% accuracy, 38.6pp spread, 29,912 out-of-sample signals across 20 years.

System 2 — Structural Fragility (Real-Time)

Computes a proprietary fragility metric from real-time order book depth. Measures how resilient the market's liquidity structure is to perturbations. When it drops, the market is structurally fragile. Validated: 1.93x volatility prediction on institutional-grade L2 data.

Price Data

Daily bars, 53 assets

→

Physics Engine

13 proprietary features

→

Regime Classifier

6 empirical regimes

→

Strategic Bias

Long vol / Short vol

Order Book L2

Real-time depth

→

Proprietary Simulation

Structural analysis

→

Fragility Index

Resilience metric

→

Tactical Trigger

Execute / Hold / Eject

The #1 feature: Primary Structural Signal (59.8% of classifier power)

A proprietary measurement adapted from physics that studies how systems behave near critical thresholds. It detects structural instability before it becomes visible in price or implied volatility. Specification available under NDA.

The Proof

Global Stress Index vs actual VIX (daily, out-of-sample 2020–2026)

Composite Stress Index (red) vs actual VIX (blue dotted). This chart uses the 2020–2026 fold from the daily walk-forward validation. The model was trained only on pre-2020 data.

CURRENT: ELEVATED

Global Regime Distribution

What percentage of all 53 assets are in each regime over time

Cyan = assets in Safe Zone (sell vol). Red = assets in Stress Zone (buy vol). Gray = Neutral. When red dominates, systemic vol expansion is likely.

Signal Accuracy

Every SELL VOL signal vs what VIX actually did 90 days later

Green dot = system said "sell vol" and VIX fell (correct). Red X = system said "sell vol" and VIX rose (wrong). Green-to-red ratio = accuracy.

Asset Deep Dives

Per-asset regime history with price overlay

Background shading shows the regime the asset was classified in at each point. Cyan = Safe Zone (sell vol). Red = Stress Zone (buy vol). Gray = Neutral. Green triangles mark SELL VOL signals.

SPY (S&P 500 ETF)

Gold

EUR/USD

Tesla

NVIDIA

Resolution Analysis

Daily is the proven resolution. Hourly is a research lead.

The daily signal is walk-forward validated across 20 years. The hourly signal showed promise in preliminary (non-walk-forward) testing but does not survive rigorous walk-forward due to limited data history (~2 years via yfinance).

Resolution	Horizon	Accuracy	Spread	Signals	Status
Daily	90 days	72.1%	38.6pp	29,912	Walk-forward validated (7 folds, 20yr)
Hourly	90 days	69.0%*	27.6pp	1,459	Preliminary (3 folds, 2yr, BUY VOL only)

Daily short-vol is the core signal

The 7-fold walk-forward on daily data is the primary validated result. Hourly data (limited to ~730 days via public sources) showed a 69% BUY VOL signal in one favorable fold but didn't generalize. With longer hourly history (e.g., from a data vendor), the complementary long-vol edge could be validated — but we won't claim it until it survives walk-forward.

Walk-Forward Validation — Daily Resolution

7 folds, 20 years, retrained from scratch each fold

The model is retrained every 3 years on daily data using only the past. No future information leaks into any fold. This is the gold standard for strategy validation.

Period	SELL VOL	BUY VOL	Combined	Spread	Signals
2005–2008	56.7%	83.8%	71.8%	43.4pp	2,903
2008–2011	78.1%	39.5%	73.6%	25.9pp	5,263
2011–2014	66.8%	66.7%	66.8%	34.9pp	3,671
2014–2017	57.1%	55.5%	56.5%	37.4pp	2,774
2017–2020	66.6%	61.8%	64.6%	38.8pp	4,426
2020–2023	85.3%	54.0%	73.0%	45.8pp	6,216
2023–2026	71.2%	53.5%	62.5%	44.2pp	4,659
Aggregate	72.1%	60.1%	67.8%	38.6pp	29,912

Crisis alpha: the system gets better when it matters

The two highest-accuracy folds are crisis periods — 83.8% BUY VOL before the GFC, 85.3% SELL VOL after COVID. Most quant strategies degrade during crises. This one improves. The physics features detect structural instability that is invisible to price-based models.

System 2 — Structural Fragility

A proprietary fragility metric predicts volatility from order book depth

System 2 computes a real-time "resilience score" from L2 order book data. When the score drops, future volatility increases. The methodology is proprietary. Results below are on institutional-grade Nasdaq L2 data.

Asset	Correlation	Fragility Ratio	Result
SPY (S&P 500)	-0.42	1.93x	Validated — fragility predicts volatility
AAPL	+0.17	0.98x	No signal — insufficient depth
MSFT	+0.01	1.06x	No signal — insufficient depth
AMZN	-0.16	1.05x	No signal — insufficient depth
SPY (Placebo)	+0.17	0.83x	Signal vanished — not an artifact of volume

Works on deep markets. Fails on thin ones. Validated by placebo.

The fragility metric produces a 1.93x volatility signal on SPY (the deepest order book in equities). It fails on single stocks where depth is insufficient. A placebo test (randomized book structure) destroyed the signal, confirming it captures genuine structural properties — not just aggregate volume. Applicable to any deep liquid market: indices, FX, crypto majors. Methodology available under NDA.

What Drives the Signal

Feature importance — what drives the classifier

13 proprietary physics-derived features. The top 3 account for 78.0% of total predictive power (averaged across 7 walk-forward folds). Feature names are obfuscated; specifications available under NDA.

Short-Vol Accuracy by Asset Class

Regime Transitions

Last 60 days — which assets are transitioning

Each cell = one asset on one day. Color = regime. When rows shift color simultaneously, a systemic regime transition is in progress.

Validation

Head-to-head against the industry standard

Same 53 assets, daily resolution. Same binary question: "Will realized vol be higher or lower 90 days from now?" Our system uses 7-fold walk-forward; GARCH uses expanding window.

Metric	GARCH(1,1)	This System (v2)
SELL VOL accuracy	58.4%	72.1%
BUY VOL accuracy	—	60.1%
Best-worst regime spread	—	38.6pp
Out-of-sample signals	—	29,912
Test methodology	Single split	7-fold walk-forward
Test period	2020–2026	2005–2026 (20 years)
Assets / classes	53 / 15	53 / 15

Known limitations

72% directional accuracy does not guarantee P&L — short-vol strategies carry asymmetric tail risk. All results are historical simulations, not live trading. BUY VOL accuracy (60.1%) is weaker than SELL VOL. 90-day horizon is medium-term — not suitable for intraday or HFT. Execution and position sizing remain open problems.

Technical

Built to run, not just to publish

SYSTEM 1 — REGIME CLASSIFIER
Proprietary features	13 (physics-derived, proprietary)
Discovery method	Gaussian Mixture Model on 381,264 bars
Classifier	Gradient-boosted ensemble + isotonic calibration
Regimes	6 (discovered empirically, not predefined)
Validation	7-fold walk-forward on daily bars (2005–2026)
Out-of-sample signals	29,912 across 53 assets and 15 classes
Anti-overfitting	Retrained 7x; signal holds in every fold (56–85%)
SYSTEM 2 — STRUCTURAL FRAGILITY
Method	Proprietary simulation on L2 order book depth
Metric	Proprietary fragility index (resilience score)
Signal	-0.42 correlation with future volatility (SPY)
Fragility ratio	1.93x (fragile vs resilient regimes)
Placebo validated	Signal vanishes on randomized book data
Applicability	Deep liquid markets (indices, FX, crypto majors)
SHARED
Stack	Python, NumPy, pandas, SciPy, scikit-learn, gradient boosting

FAQ

Is this just overfitting? +

No. The system is walk-forward validated across 7 separate 3-year periods from 2005 to 2026, retrained from scratch each fold. The combined accuracy (67.8%) never drops below 56.5% in any fold. The 38.6pp regime spread held stable across 20 years of different market conditions — bull markets, crises, low-vol eras, rate cycles.

Why not just use VIX? +

VIX is a reaction measure — implied vol as priced by the options market right now. This system is predictive — it identifies structural fragility that precedes the VIX spike, not the spike itself. In testing, the Stress Index led VIX moves by days to weeks.

What about transaction costs? +

The backtest includes 5bps transaction costs per trade. The strategy is medium-term (90-day horizon) with low turnover (18% time in market), so costs are not the primary driver of P&L. The main risk is the asymmetric payoff of short volatility, not slippage.

Is this a black box? +

No. Every feature has a specific physical interpretation and measures a distinct structural property of market dynamics. We know exactly why the model classifies each regime, unlike deep learning approaches. Full feature specifications available under NDA.

Can a fund replicate this from the website? +

No. The website shows results and obfuscated feature names — not formulas, thresholds, window sizes, or classifier hyperparameters. The 13 feature definitions, training methodology, and model configuration are proprietary.

Why physics instead of standard quant factors? +

Standard factors (momentum, mean-reversion, carry) describe what happened. Physics features describe the structural state of market dynamics — properties orthogonal to traditional signals that price-based indicators miss entirely.

Which asset classes work best? +

Bonds (94.8%), emerging markets (90.8%), and European equities (85.5%) show the strongest classification accuracy. Commodities and FX are in the 65–72% range. Credit and single-name tech equities are weakest (~49%). The system works best on assets with well-defined volatility cycles and sufficient history.

How would a fund deploy this? +

Three options: (1) Overlay — run the classifier daily on a vol-selling book and skip trades when the system flags Stress Zone, reducing tail risk. (2) Signal generation — feed regime labels into an existing risk model as an additional factor. (3) Standalone — sell vol when Safe Zone, buy vol when Stress Zone, flat otherwise. The 90-day horizon suits systematic macro and vol-arb desks.

Interested?

Two ways to work together.

License the Signal

Daily regime classifications for your asset universe. Integrate as an overlay on existing vol strategies. Covers equities, FX, commodities, bonds, and crypto.

Hire Me

Quant research or applied engineering role. Physical sciences background, production ML experience, and a working prototype with 36 years of validated results.

LinkedIn • GitHub

Full methodology, feature specifications, and live demo available under NDA.