Home
  • Project Info
  • Data & Methods
  • Results & Discussion
    • Major Causes
    • Time Pattern
    • Airline Delay Time
  • Reference
  • Full Report

Data and Approaches


Data

Source

We used database “Airline On-Time Performance Data” coming from Bureau of Transportation Statistics (BTS), which has records of the on-time performance of domestic flights operated by large air carriers. The database is accessed from here.

Overview

We selected 23 out of 109 variables of our interest, and chose time ranges from 2012 January to 2016 December. In total, there are 29,722,792 observations of 23 variables in our dataset. Each observation represents one flight. Variables fall into three categories based on variable types: Indicator variables, Continuous variables, Categorical variables, including summary information on the flights performance and descriptive information about the flights.

Definition of flight delay

A flight is counted as “delayed” if it arrived 15min later than scheduled arrival.

5 Causes of delay

  • Carrier Delay: including Aircraft cleaning, Maintenance, Late crew, etc.
  • Weather Delay: including Below minimum conditions, Thunder Storm, Tornado, etc.
  • National Air System Delay (nas_delay): including Air Traffic Control (ATC), Bird strikes, Closed Runways etc.
  • Security Delay: including Lines at screening area that exceed standard time, Bomb threat, Inoperative screening equipment - TSA etc.
  • Late Arriving Aircraft Delay (late_aircraft_delay) means a previous flight with the same aircraft arrived late which caused the present flight to depart late.

Approaches

  • Average delay minute per delayed flight was used as a measurement of delay performance.
  • T-test was used to test if the difference was significant.

Analysis Outline

  • Which cause has the greatest impact on flight delay?
  • Would holiday seasons possibly related to flight delay?
  • Is there a time pattern for flight delay?
  • Which airlines have the most delays?