Systems/Warehouse
The Warehouse is not a dashboard or a database overview. It is the system that turns raw market data into normalized, reproducible intelligence — and then reasons over that structure to describe how the market is actually behaving.
01
Ingestion
raw feeds in
02
Standardization
canonical schema
03
Interpretation
structured reasoning
04
Outputs
derived datasets
I/Purpose
One governed record that downstream systems read from, instead of many divergent copies.
Raw observations are converted into consistent internal representations before use.
A dataset defined today resolves to the same record when queried later.
01
Ingestion
raw feeds in
02
Standardization
canonical schema
03
Interpretation
structured reasoning
04
Outputs
derived datasets
II/Ingestion layer
Inbound feeds are validated, deduplicated, and converted to internal formats on arrival.
Continuous capture of external market data as it is published.
Malformed and duplicate records are removed before they can propagate.
Every stream is mapped to consistent internal representations on arrival.
III/Standardization layer
A stable instrument identity that survives vendor and symbol changes.
Ordering and timestamp rules that keep history aligned across sources.
Point-in-time pinning so historical queries never silently change.
| Field | Type | Rule |
|---|---|---|
| instrument_id | canonical | stable across vendors |
| event_time | utc_nanos | monotonic per stream |
| symbol | normalized | corporate-action aware |
| source_rev | lineage | point-in-time pinned |
IV/Interpretation layer
Flow interpretation
Regime detection
Probabilistic state, not a directional call.
How depth forms and thins, and how order-flow imbalance shifts around levels.
Inference of accumulation, distribution, and forced versus chosen activity.
Probabilistic classification of trend, volatility expansion, and mean reversion.
V/Analytical outputs
{ "symbol": "SPY", "as_of": "2026-06-03T13:45:11Z", "regime": "compressed_vol", "flow_imbalance": 0.34, "classification": "accumulation", "source_rev": "a3f9c1" }
Derived signals and condition classifications for downstream evaluation — not order routing.
Computed measures produced from the normalized record.
Market states labeled for consistent downstream consumption.
Inputs for evaluating trade decisions — without executing them.
VI/System philosophy
The same inputs and definitions yield the same outputs, every time.
Observation and inference are stored and versioned as distinct layers.
Every interpreted value resolves back to the source revision behind it.
Raw observation and structured interpretation are different things, and the Warehouse never lets them blur. Observation is preserved exactly as received; interpretation is a separate, versioned layer built on top of it. That separation is what makes a conclusion defensible — you can always walk it back to the data that produced it.