CH(AI)OS THEORY -- Chaos Engineering Reimagined for AI-Native Systems

7 Chaos Experiment Domains

Vec. 1

Data Poisoning

Critical

State Corruption Under Intermittent Connectivity

Edge-cloud divergence when connectivity drops. Models continue inferring, state reconciliation silently fails, and nobody tests the sync layer for semantic correctness.

Inject Protocol

Network partitions of escalating duration with model updates during blackout

Measure

State reconciliation latency, inference divergence rate, silent conflict count

Run Experiment

Vec. 2

Adversarial Evasion

High

Cascading Confidence Collapse

Multi-stage agentic pipelines where each handoff launders uncertainty. Low-confidence outputs wrapped in high-confidence formats cascade through the chain unchecked.

Inject Protocol

Confidence score corruption at each handoff point

Measure

End-to-end calibration error, confidence laundering index

Run Experiment

Vec. 3

Model Extraction

High

Adversarial Operational Context

Not adversarial inputs to the model -- adversarial conditions around it. EW-degraded sensor feeds, spoofed telemetry, operator fatigue altering interaction patterns.

Inject Protocol

Environmental degradation at sensor layer + operator behavior anomalies

Measure

Output stability curves, detection latency, false action rate

Run Experiment

Vec. 4

Inversion & Privacy

Medium

Mode Flapping

Systems oscillating between AI and rules-based fallback 40 times a minute. Threshold calibrated in the lab falls apart in the field. Operator trust collapses in 90 seconds.

Inject Protocol

Confidence oscillation at frequencies from once/hour to 60x/minute

Measure

Mode switch frequency, operator override rate, time-to-distrust

Run Experiment

Vec. 5

Reward Hacking

High

Discreet Model Drift

The AI equivalent of a slow gas leak. Accuracy degrades 12% over six months, outputs still look reasonable, validation suite isn't running in production. Nothing melts down.

Inject Protocol

Synthetic distribution shift: feature means drifting 1% per day over 60 days

Measure

Detection lag, degradation curve shape, downstream decision impact

Run Experiment

Vec. 6

Distribution Shift

Critical

Tool Use Hallucination in Agentic Chains

Models calling tools that don't exist, fabricating parameters, inventing plausible responses. Downstream systems process hallucinated results as ground truth. Phantom side effects.

Inject Protocol

Tool availability faults, latency injection, malformed response returns

Measure

Hallucinated call rate, fabricated response rate, downstream contamination spread

Run Experiment

Vec. 7

Algorithmic Bias

Medium

Feedback Loop Poisoning in Human-AI Teaming

Operators trust the model, only correct obvious errors. Subtly wrong outputs get approved. Human and AI co-create a degraded standard of correctness neither would reach alone.

Inject Protocol

Deliberate subtle errors at known rate, varying subtlety over time

Measure

Catch rate by severity, trust calibration drift, feedback contamination rate

Run Experiment

Vec. 8

Report Index

Complete Vector Reference

Cross-reference guide and vector index for the full CH(AI)OS THEORY technical report.

Run Experiment

Why AI Needs Its Own Chaos Engineering

Data Poisoning

Adversarial Evasion

Model Extraction

Inversion & Privacy

Reward Hacking

Distribution Shift

Algorithmic Bias

Report Index