Flight Delay Prediction: Temporal & Network Perspectives
  • Home
  • About
  • Paper
  • ML Workflow
    • ML Worflow Overview
    • Phase 1: Baseline
    • Phase 2
  • Aero Flux

Aero Flux

Welcome to Aero Flux!

Feel free to browse the site below or scroll down to read more about what Aero Flux is. For a test run click on the flight number and enter in the following

  • Flight: 2685
  • Date: 2019-11-11
  • Origin: RDM
  • Dest: AZA

What is AeroFlux?

AeroFlux is a predictive intelligence system for U.S. domestic flight delays. Given a specific flight — identified by tail number, date, origin, and destination — it predicts whether that flight will arrive delayed by 15 minutes or more, and estimates the delay in minutes. It does this by modeling the full operational context of the aircraft: where it came from, how the previous two legs performed, the turnaround time, cumulative delays accumulated during the day, and weather conditions at departure.

The output is a full prediction report: a probability score, a delay estimate, a comparison against the actual recorded outcome, a geographic map of the aircraft’s rotation chain, and a breakdown of the key features that drove the prediction.

A precursor to a digital twin

AeroFlux is not yet a digital twin. A true digital twin would ingest live operational data in real time, continuously update its state, and support simulation — answering questions like what happens to the next four flights on this tail if this departure slips 45 minutes?

AeroFlux is the immediate precursor. The machine learning pipeline, feature engineering, model inference layer, and interactive interface are all in place. What it currently lacks is the live data feed connecting it to real-time flight state. That connection is the primary objective for the next phase of development. The progression looks like this:

  • Phase 1 (complete) — Historical proof of concept: models trained, pipeline built, interactive app deployed
  • Phase 2 (next) — Live data integration: connect to FAA SWIM or a commercial flight data API, replace historical lookups with real-time inference, add live weather
  • Phase 3 — Introduce more sophisticated modeling: adding more spatiotemporal and network-aware complexity through ST-GCN and attention-based Transformer models, physics-informed neural networks using telemetry data, richer datasets regarding passenger delays aided by LLM-parsed natural language feeds and much more.
  • Phase 4 — Digital twin: a continuously updated bidirectional model of the domestic flight network with full scenario simulation

How it was built

The data engineering layer uses Polars, a Rust-based DataFrame library that leverages Apache Arrow columnar memory — substantially more efficient than pandas for this scale of data. The pipeline engineers lag features (previous one and two flights on the same tail), turnaround times, cumulative aircraft-day delays, route traffic volumes, and holiday proximity from raw BTS parquet files.

A slim cached parquet is written on first run and loaded directly on subsequent starts to reduce resource demand.

The application is a Plotly Dash app served by Gunicorn on AWS Lightsail.

The data

Due to reasource limitations (4 GB of RAM) and cost ($12/month and much more for more resources) the prototype focuses on November 2019 data to However, the models were trained on over 30 million flight records. See our paper for more details.

The models

Each model consists of a paired classifier (predicts the probability that arrival delay will be 15 minutes or more) and a regressor (predicts the delay in minutes). Two models are currently deployed:

XGB Full uses 43 engineered features across five categories: propagation features (previous leg delays, turnaround time, rotation continuity), aircraft-day features (leg number, cumulative delays, relative leg position), schedule features (departure hour, weekday, month, time bucket, distance), weather features (temperature, wind speed, ceiling height at origin and destination), and network features (route frequency, airport volume, holiday proximity).

XGB Full Aircraft extends this with four additional aircraft characteristics: year of manufacture, number of engines, number of seats, and aircraft age. This allows the model to account for fleet-specific delay patterns.

These models were trained with the ML Pipeline (see the docs the ML Pipeline) before the app was deployed.

Limitations

AeroFlux is a research prototype and has several important limitations to keep in mind.

All predictions are run against historical 2019 data. There is no live feed — you are seeing what the model would have predicted for a flight that has already happened, compared to what actually occurred. This is intentional for validation but is the primary gap for operational use.

The system covers domestic U.S. carrier operations only. International routes, cargo, and regional carriers not reporting to BTS are excluded. There is no modeling of international disruptions cascading into domestic operations.

AeroFlux predicts a single flight. It does not yet propagate a predicted delay forward to downstream flights on the same tail or to connecting passengers. That simulation layer is the core of the digital twin roadmap.

Infrastructure

The app runs on AWS Lightsail (4GB RAM, 2 vCPU). Flight data and model artifacts are stored in Amazon S3 and downloaded to the instance on startup. The domain aeroflux.duckdns.org.

Source code is at github.com/jonathanwilsonami/aero-flux.