Evaluation of the causal effect of disruption on delay as a benchmarking tool for public transport network performance

While black-box models are becoming increasingly accurate for prediction problems, the desire to master and understand the relationships between factors and the dependent variables is driving the need for the development of Causal Inference, which is gaining momentum. Urban mass transit systems typically generate large volumes of data on various aspects of operations. Statistical analyses can be used to summarise and present such data, drawn from within and between systems, to understand the drivers of performance. This empirical analysis leverages large-scale publicly available data from the General Transit Feed Specification (GTFS) feed of the Washington DC metro network to characterise the performance of transit systems, focusing on train on-time performance and the space-time propagation of disruption causing subsequent delays.

Project Overview

The project aims to leverage large datasets from public transport networks to assess and benchmark their resilience under disruption scenarios. The study applies the Synthetic Control method to create a synthetic benchmark day, allowing comparison of typical and disrupted service days.

Key Concepts

Synthetic Control Method (SC): Used to create a counterfactual scenario in the absence of disruptions, providing a robust measure for the causal effect of disruptions on delay.
Causal Inference: Addresses the confounding factors in network disruptions to isolate and measure their impact accurately.
Time Unit Adjustments: Uses R scripting to handle and preprocess time data for accuracy in delay and disruption analysis.

Database architecture

Requirements

Synth R package: to install this package, first ensure that devtools is installed with

install.packages("Synth")

GTFS Data: The research uses General Transit Feed Specification (GTFS) data; access to similar data is required for replication or adaptation.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Data_processing		Data_processing
Synthetic_control		Synthetic_control
Tables		Tables
.gitignore		.gitignore
README.md		README.md
Research Paper.pdf		Research Paper.pdf
poster.pdf		poster.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluation of the causal effect of disruption on delay as a benchmarking tool for public transport network performance

Project Overview

Key Concepts

Database architecture

Requirements

About

Releases

Packages

Languages

Guillaume-amann/MScDissertation

Folders and files

Latest commit

History

Repository files navigation

Evaluation of the causal effect of disruption on delay as a benchmarking tool for public transport network performance

Project Overview

Key Concepts

Database architecture

Requirements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages