// MODEL VALIDATION
Three tiers of validation.
The OWI is a structured model where AI is used as a data ingestion and signal extraction tool. It is not an AI system generating analysis. It is validated at three levels: structural completeness, internal logical consistency, and real-world model accuracy against ground truth data. Explore this page to simulate, test, and verify each tier.
Does every output field exist? Are all sections populated? Is the model output structurally complete? This is the baseline - the model must produce all expected outputs before deeper validation is meaningful.
| Check | Result | |
|---|---|---|
| OWI score present | 25.46 | |
| OWI in 0–100 range | 25.46 | |
| All 5 sub-indices present | 5 present | |
| Trajectory label present | Ahead of Schedule | |
| Trajectory color present | #38BFFF | |
| Momentum factor present | 1.1500 | |
| History series present | 18 months | |
| News data present | 4 headlines | |
| News signal summary present | Present | |
| Live scores present | 5 sub-indices | |
| Calculated_at timestamp | 2026-04-14 02:26:23 UTC | |
| Milestone store accessible | OK | |
| AI Capability: score field | 30.53 | |
| AI Capability: weight field | 28% | |
| AI Capability: color field | #38BFFF | |
| Humanoid Robotics: score field | 30.40 | |
| Humanoid Robotics: weight field | 22% | |
| Humanoid Robotics: color field | #A78BFF | |
| Economic Abundance: score field | 11.92 | |
| Economic Abundance: weight field | 18% | |
| Economic Abundance: color field | #2DFFA0 | |
| Labor Market Shift: score field | 16.84 | |
| Labor Market Shift: weight field | 20% | |
| Labor Market Shift: color field | #FFCB47 | |
| Wealth Distribution: score field | 11.58 | |
| Wealth Distribution: weight field | 12% | |
| Wealth Distribution: color field | #FF7096 |
Does the reasoning contradict itself? Do conclusions follow from inputs? This tier verifies that every number in the model is internally coherent, the weighted sum actually produces the OWI, sub-index scores follow from their components, trajectory matches the computed ratio, and no formula is silently violated.
| Check | Result | |
|---|---|---|
| Weights sum to 1.0 | 1.0000 | |
| Momentum 1.0 ≤ M ≤ 1.15 | 1.1500 | |
| OWI = Σ(score×weight)×momentum | Recomputed: 25.46 vs Reported: 25.46 | |
| Sub-indices = live+bonus+news | 4/5 consistent | |
| Live score direction consistent | No inversions | |
| All news adjustments within ±5 | OK | |
| Composite news signal ±5 | 1.00 | |
| All milestone bonuses 0–8 | OK | |
| All sub-index scores 0–100 | OK | |
| Trajectory matches OWI/time ratio | Ratio: 1.115 → AHEAD (got: AHEAD) | |
| Headlines have non-zero scores | 3/4 non-zero | |
| No NaN/null in sub-index scores | Clean |
Formula Trace - Live Recomputation
Is the model correct relative to reality? Does it track against ground truth? Does it outperform a naive baseline? Are the feedback loops working? This is the hardest tier - it requires external reference points and cannot be satisfied by internal checks alone.
| Check | Result | |
|---|---|---|
| Unemployment trend → labor score direction | Cannot verify - raw unemployment value not in debug | |
| OWI outperforms naive time baseline | OWI: 25.46 vs naive: 22.84% (time elapsed) | |
| Time progress 10yr accurate | Computed: 22.83% vs expected: 22.84% | |
| Momentum reflects recent confirmations | Confirmations: 8 → expected M: 1.15, actual: 1.1500 | |
| Score history accumulating | 26 daily records in last 30 days | |
| News signal direction consistent with OWI position | News: 1.00 (positive) · OWI vs naive: above | |
| No scores inflated without live data | Verified | |
| Milestone feedback loop active | 84 total confirmed events | |
| OWI within plausible historical range | 25.46 (range: 0–100.00) | |
| Trajectory key↔label consistent | AHEAD → Ahead of Schedule |
Ground Truth 1 - OWI vs Naive Time-Elapsed Baseline
Ground Truth 2 - Milestone Confirmation Feedback Loop
Ground Truth 3 - Score History Accumulation (last 30 days)
Per Sub-Index Weighted Contribution
Each sub-index score = live data components average + milestone bonus + news signal. Every live component is sourced from a named public data series. If a component has no live data, it contributes 0 rather than a fabricated value.
AI Capability - 28% weight - Score: 30.53
| Series | Raw Value | Score |
|---|---|---|
| - | - | |
| - | - | |
| - | - |
Humanoid Robotics - 22% weight - Score: 30.40
| Series | Raw Value | Score |
|---|---|---|
| - | - | |
| - | - |
Economic Abundance - 18% weight - Score: 11.92
| Series | Raw Value | Score |
|---|---|---|
| - | - | |
| - | - | |
| - | - | |
| - | - |
Labor Market Shift - 20% weight - Score: 16.84
| Series | Raw Value | Score |
|---|---|---|
| - | - | |
| - | - | |
| - | - |
Wealth Distribution - 12% weight - Score: 11.58
| Series | Raw Value | Score |
|---|---|---|
| - | - | |
| - | - | |
| - | - |
The OWI is a model-backed system with an AI interpretation layer. Real economic data forms the base score. Claude scores each headline −5 to +5 for its relevance to the optional work prediction, nudging each sub-index by up to ±5 points. This section shows exact prompt sent to Claude.
You are the OWI scoring engine for optionalwork.com, tracking Elon Musk's prediction that AI/robotics will make work optional within 10-20 years.
TASK 1 - Score each headline -5 to +5 for OWI impact:
+5 = Major acceleration (AGI achieved, mass robot deployment, UBI enacted)
+3 = Moderate positive signal
+1 = Weak positive
0 = Neutral
-1 = Minor setback
-3 = Moderate setback
-5 = Strong negative (ban, major failure)
Categories: ai_capability | robotics | economic_abundance | labor_displacement | wealth_distribution
TASK 2 - Detect if any headline CONFIRMS a watched target (high confidence only):
WATCHED TARGETS:
[live target list injected at runtime - 241 targets across 5 categories]
Return ONLY valid JSON, no markdown:
{
"scored_headlines": [{"headline":"...","score":3,"category":"ai_capability","reasoning":"...","url":"..."}],
"composite_signal": 1.4,
"signal_summary": "2-3 sentence summary of today's OWI signal",
"milestone_detections": [{"target_label":"...","category":"...","source_name":"...","source_url":"...","date":"YYYY-MM-DD","confidence":"high"}]
}
Confirmed milestones contribute a log-scaled bonus to each sub-index. Every confirmed event requires a source URL - no unverified claims affect the score.
| Category | Confirmed | Calculation | Bonus |
|---|---|---|---|
| AI Capability | 41 | min(8.0, log₂(41+1) × 2.8) | +8.00 |
| Humanoid Robotics | 20 | min(8.0, log₂(20+1) × 2.8) | +8.00 |
| Economic Abundance | 8 | min(8.0, log₂(8+1) × 2.8) | +8.00 |
| Labor Market Shift | 9 | min(8.0, log₂(9+1) × 2.8) | +8.00 |
| Wealth Distribution | 6 | min(8.0, log₂(6+1) × 2.8) | +7.86 |
Multiple output analyses to characterize whether the OWI behaves as a constrained model (bounded, deterministic, rule-enforced) or as structured reasoning with checks (logic gates, consistency enforcement, interpretive layers). A well-designed index should exhibit both properties at different layers.
Analysis 1 - Constraint Enforcement (20/20 active)
Hard constraints are rules that the model enforces unconditionally - floors, ceilings, caps. A constrained model cannot produce outputs outside these bounds regardless of input magnitude.
| Status | Constraint | Current Value | Enforcement Mechanism |
|---|---|---|---|
| ✓ | OWI hard ceiling (≤ 100) | 25.46 | min(100, adjusted) enforced in engine |
| ✓ | OWI hard floor (≥ 0) | 25.46 | max(0, ...) enforced in engine |
| ✓ | AI Capability: score ∈ [0, 100] | 30.53 | Enforced by min(100, max(0, ...)) |
| ✓ | AI Capability: news adj ∈ [−5, +5] | 0.00 | max(-5, min(5, ...)) enforced before application |
| ✓ | AI Capability: milestone bonus ∈ [0, 8] | 8.00 | min(8.0, log₂(n+1)×2.8) - log scale caps growth |
| ✓ | Humanoid Robotics: score ∈ [0, 100] | 30.40 | Enforced by min(100, max(0, ...)) |
| ✓ | Humanoid Robotics: news adj ∈ [−5, +5] | 0.70 | max(-5, min(5, ...)) enforced before application |
| ✓ | Humanoid Robotics: milestone bonus ∈ [0, 8] | 8.00 | min(8.0, log₂(n+1)×2.8) - log scale caps growth |
| ✓ | Economic Abundance: score ∈ [0, 100] | 11.92 | Enforced by min(100, max(0, ...)) |
| ✓ | Economic Abundance: news adj ∈ [−5, +5] | 0.00 | max(-5, min(5, ...)) enforced before application |
| ✓ | Economic Abundance: milestone bonus ∈ [0, 8] | 8.00 | min(8.0, log₂(n+1)×2.8) - log scale caps growth |
| ✓ | Labor Market Shift: score ∈ [0, 100] | 16.84 | Enforced by min(100, max(0, ...)) |
| ✓ | Labor Market Shift: news adj ∈ [−5, +5] | 0.00 | max(-5, min(5, ...)) enforced before application |
| ✓ | Labor Market Shift: milestone bonus ∈ [0, 8] | 8.00 | min(8.0, log₂(n+1)×2.8) - log scale caps growth |
| ✓ | Wealth Distribution: score ∈ [0, 100] | 11.58 | Enforced by min(100, max(0, ...)) |
| ✓ | Wealth Distribution: news adj ∈ [−5, +5] | 0.00 | max(-5, min(5, ...)) enforced before application |
| ✓ | Wealth Distribution: milestone bonus ∈ [0, 8] | 7.86 | min(8.0, log₂(n+1)×2.8) - log scale caps growth |
| ✓ | Momentum factor ∈ [1.0, 1.15] | 1.1500 | 1 + min(0.15, n×0.025) - hard cap prevents runaway multiplier |
| ✓ | Daily micro-signal ∈ [−0.8, +0.8] | Deterministic seed | mt_rand seeded by date+key - same input always produces same output |
| ✓ | Weights sum exactly to 1.0 | 1.0000 | Hardcoded distribution - 0.28+0.22+0.18+0.20+0.12 = 1.0 |
Analysis 2 - Structured Reasoning Checks (6/6 verified)
Structured reasoning checks verify that the model's logic gates are functioning - outputs follow from inputs, not from hardcoded values or arbitrary lookups. This distinguishes a reasoning system from a lookup table.
| Status | Check | Result | What This Proves |
|---|---|---|---|
| ✓ | Score > 0 only when live data present | 5 sources active | Engine returns 0 for sub-index if no live data and no milestone bonus |
| ✓ | Trajectory derived from OWI÷time ratio | Ratio: 1.115 → AHEAD | Not a label lookup - computed from (owi / time_progress_10) |
| ✓ | Milestone bonus is monotonically increasing (log scale) | n=1→2.80, n=5→7.24, n=10→8.00 | Each additional confirmation adds less than the last - diminishing returns enforced |
| ✓ | News scores use graduated scale (not binary) | 3 distinct score values in current batch | Claude produces nuanced −5 to +5 scoring, not just positive/negative flags |
| ✓ | Sub-indices without live data not inflated | Verified | All sub-indices have live data |
| ✓ | OWI independent of time (not locked to elapsed %) | OWI: 25.46 vs time: 22.83% | OWI is data-driven - it can exceed or fall below naive time-elapsed baseline |
Analysis 3 - Sensitivity Analysis (10% perturbation per sub-index)
If each sub-index score were 10% higher, how much would the OWI change? This reveals which categories have the most leverage over the final score - and confirms the model responds proportionally to input changes, not in binary jumps.
| Sub-Index | Weight | Current Score | OWI Δ if +10% | Leverage |
|---|---|---|---|---|
| AI Capability | 28% | 30.53 | +0.855 | |
| Humanoid Robotics | 22% | 30.40 | +0.669 | |
| Labor Market Shift | 20% | 16.84 | +0.337 | |
| Economic Abundance | 18% | 11.92 | +0.215 | |
| Wealth Distribution | 12% | 11.58 | +0.139 |
Verdict - Constrained model with structured reasoning
Four deterministic test scenarios with hand-calculable expected outputs. Each scenario uses fixed, simple inputs so the result can be verified with a calculator before running the model. If the model produces the correct output for all four scenarios, the formula implementation is confirmed correct.
Scenario A - Zero State ✓ PASS
All live scores = 0, no milestones, no news signal, no recent confirmations. OWI must equal exactly 0.
| Sub-Index | Live | +Bonus | +News | Score | ×Weight | Contrib |
|---|---|---|---|---|---|---|
| Ai Capability | 0.00 | +0.00 | +0.00 | 0.00 | ×0.28 | 0.0000 |
| Robotics | 0.00 | +0.00 | +0.00 | 0.00 | ×0.22 | 0.0000 |
| Economic Abundance | 0.00 | +0.00 | +0.00 | 0.00 | ×0.18 | 0.0000 |
| Labor Displacement | 0.00 | +0.00 | +0.00 | 0.00 | ×0.20 | 0.0000 |
| Wealth Distribution | 0.00 | +0.00 | +0.00 | 0.00 | ×0.12 | 0.0000 |
| Raw sum × momentum | 0.0000 × 1.0000 = 0.00 | |||||
Scenario B - Uniform 50, No Bonuses ✓ PASS
All live scores = 50, no milestones confirmed, no news, no recent confirmations. OWI must equal 50.00.
| Sub-Index | Live | +Bonus | +News | Score | ×Weight | Contrib |
|---|---|---|---|---|---|---|
| Ai Capability | 50.00 | +0.00 | +0.00 | 50.00 | ×0.28 | 14.0000 |
| Robotics | 50.00 | +0.00 | +0.00 | 50.00 | ×0.22 | 11.0000 |
| Economic Abundance | 50.00 | +0.00 | +0.00 | 50.00 | ×0.18 | 9.0000 |
| Labor Displacement | 50.00 | +0.00 | +0.00 | 50.00 | ×0.20 | 10.0000 |
| Wealth Distribution | 50.00 | +0.00 | +0.00 | 50.00 | ×0.12 | 6.0000 |
| Raw sum × momentum | 50.0000 × 1.0000 = 50.00 | |||||
Scenario C - Milestone Bonus Only ✓ PASS
All live scores = 0, 5 milestones confirmed per category, no news, no recent confirmations. Verify bonus = min(8, log₂(6)×2.8) = 7.22.
| Sub-Index | Live | +Bonus | +News | Score | ×Weight | Contrib |
|---|---|---|---|---|---|---|
| Ai Capability | 0.00 | +7.24 | +0.00 | 7.24 | ×0.28 | 2.0272 |
| Robotics | 0.00 | +7.24 | +0.00 | 7.24 | ×0.22 | 1.5928 |
| Economic Abundance | 0.00 | +7.24 | +0.00 | 7.24 | ×0.18 | 1.3032 |
| Labor Displacement | 0.00 | +7.24 | +0.00 | 7.24 | ×0.20 | 1.4480 |
| Wealth Distribution | 0.00 | +7.24 | +0.00 | 7.24 | ×0.12 | 0.8688 |
| Raw sum × momentum | 7.2400 × 1.0000 = 7.24 | |||||
Scenario D - Momentum Multiplier ✓ PASS
All live scores = 40, no milestones, no news, 4 recent confirmations. Momentum = 1.0+min(0.15, 4×0.025) = 1.10. OWI = 40×1.10 = 44.00.
| Sub-Index | Live | +Bonus | +News | Score | ×Weight | Contrib |
|---|---|---|---|---|---|---|
| Ai Capability | 40.00 | +0.00 | +0.00 | 40.00 | ×0.28 | 11.2000 |
| Robotics | 40.00 | +0.00 | +0.00 | 40.00 | ×0.22 | 8.8000 |
| Economic Abundance | 40.00 | +0.00 | +0.00 | 40.00 | ×0.18 | 7.2000 |
| Labor Displacement | 40.00 | +0.00 | +0.00 | 40.00 | ×0.20 | 8.0000 |
| Wealth Distribution | 40.00 | +0.00 | +0.00 | 40.00 | ×0.12 | 4.8000 |
| Raw sum × momentum | 40.0000 × 1.1000 = 44.00 | |||||
Enter raw data values directly and run the OWI formula manually. Every field maps to a real data series. Use actual published values from the sources linked below each field to verify the model produces the expected output.
Pre-loaded with live data - edit any field to simulate
Simulation Output
| Sub-Index | Live | +Bonus | +News | Score | ×Weight | Weighted |
|---|
The transition to optional work is a complex adaptive system - not a linear progression. This simulation models the five OWI stocks with their reinforcing (R) and balancing (B) feedback loops using Euler integration over a 20-year horizon. Adjust the parameters to test how sensitive the prediction is to initial conditions and loop strengths.
Feedback Loop Architecture
Simulation Parameters - Adjust Loop Strengths
Each parameter controls how strongly a feedback loop operates. Values are dimensionless multipliers. The model uses Euler integration with monthly time steps over 20 years from January 2024.
20-Year OWI Trajectory - System Dynamics Projection
Loop Dominance Analysis
Which feedback loops are dominant under current parameters? Dominant loops determine whether the system accelerates, stabilises, or oscillates. This updates live as you adjust parameters above.