liu.seSearch for publications in DiVA
Operational message
There are currently operational disruptions. Troubleshooting is in progress.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Robust Real-Time Delay Predictions in a Network of High-Frequency Urban Buses
Linköping University, Department of Computer and Information Science, The Division of Statistics and Machine Learning. Linköping University, Faculty of Arts and Sciences.ORCID iD: 0000-0001-9025-6701
Linköping University, Department of Computer and Information Science, The Division of Statistics and Machine Learning. Linköping University, Faculty of Arts and Sciences. Department of Statistics, Stockholm University, Stockholm, Sweden.ORCID iD: 0000-0003-2786-2519
2022 (English)In: IEEE Transactions on Intelligent Transportation Systems, ISSN 1524-9050, E-ISSN 1558-0016, Vol. 23, no 9, p. 16304-16317Article in journal (Refereed) Published
Abstract [en]

Providing transport users and operators with accurate forecasts on travel times is challenging due to a highly stochastic traffic environment. Public transport users are particularly sensitive to unexpected waiting times, which negatively affect their perception on the system's reliability. In this paper we develop a robust model for real-time bus travel time prediction that departs from Gaussian assumptions by using Student-t errors. The proposed approach uses spatiotemporal characteristics from the route and previous bus trips to model short-term effects, and date/time variables and Gaussian processes for long-run forecasts. The model allows for flexible modeling of mean, variance and kurtosis spaces. We propose algorithms for Bayesian inference and for computing probabilistic forecast distributions. Experiments are performed using data from high-frequency buses in Stockholm, Sweden. Results show that Student-t models outperform Gaussian ones in terms of log-posterior predictive power to forecast bus delays at specific stops, which reveals the importance of accounting for predictive uncertainty in model selection. Estimated Student-t regressions capture typical temporal variability between within-day hours and different weekdays. Strong spatiotemporal effects are detected for incoming buses from immediately previous stops, which is in line with many recently developed models. We finally show how Bayesian inference naturally allows for predictive uncertainty quantification, e.g. by returning the predictive probability that the delay of an incoming bus exceeds a given threshold.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2022. Vol. 23, no 9, p. 16304-16317
Keywords [en]
Predictive models; Uncertainty; Real-time systems; Spatiotemporal phenomena; Delays; Data models; Probabilistic logic; Intelligent transportation systems; bus arrival time predictions; spatiotemporal networks; probabilistic modeling; robustness
National Category
Transport Systems and Logistics
Identifiers
URN: urn:nbn:se:liu:diva-183002DOI: 10.1109/tits.2022.3149656ISI: 000758176500001Scopus ID: 2-s2.0-85124827488OAI: oai:DiVA.org:liu-183002DiVA, id: diva2:1638547
Funder
Swedish Research Council, 2020-02846
Note

Funding: Wallenberg AI, Autonomous Systems and Software Program (WASP) by the Knut and Alice Wallenberg Foundation; Swedish Research CouncilSwedish Research CouncilEuropean Commission [2020-02846]

Available from: 2022-02-17 Created: 2022-02-17 Last updated: 2025-08-28Bibliographically approved
In thesis
1. Bayesian Models for Spatiotemporal Data from Transportation Networks
Open this publication in new window or tab >>Bayesian Models for Spatiotemporal Data from Transportation Networks
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Bayesianska modeller för spatiotemporal data från transportnätverk
Abstract [en]

Urbanization has caused a historical transformation at a global scale, and humanity is moving towards a fully connected society where cities will concentrate population, infrastructure and economic activity. A key element in the cities’ infrastructure is the transportation system, as it facilitates the mobility of people and goods. Transportation systems are constantly generating data from, e.g., GPS, sensors and cameras, and the statistical modeling is challenging due to the complex structure and dynamics of the system, and the inherent uncertainty. In this thesis, we develop Bayesian models with applications to transportation. We specifically focus on models that can be trained on spatiotemporal data coming from transport networks to make predictions on, e.g., bus delays or the actual network topology. Special attention has been given to model scalability issues and uncertainty quantification. We have used real-world data from transportation systems in every study to keep a balance between statistical rigor, novelty, and applicability. 

The thesis consists of four papers. The first study presents a state-of-the-art probabilistic latent network model to forecast multilayer dynamic graphs. The model uses stochastic blockmodeling to reduce the computational burden, and is illustrated on a sample of 10-year data from four major airlines within the US air transportation system. In the second paper, we develop a robust model for real-time bus travel time prediction that departs from Gaussian assumptions by using Student-t errors, and show how Bayesian inference naturally allows for predictive uncertainty quantification in a highly stochastic environment. Experiments are performed using data from high-frequency buses in Stockholm, Sweden. The third paper shows the potential of multi-output Gaussian processes to tackle network-wide travel time prediction in an urban area. We develop a responsive online model based on a coregionalized covariance and test its accuracy on real data from GPS-equipped taxis. Finally, we propose a novel regularization strategy for the vector autoregressive model that is based on a graphical spike-and-slab prior, and present a case study with real airline delay data to assess its predictive performance and analyze network patterns related to the propagation of delays across airports. 

Abstract [sv]

Urbaniseringen har orsakat en historisk förändring på en global skala, och mänskligheten går mot ett uppkopplat globalt nätverkssamhälle där städer kommer att koncentrera befolkning, infrastruktur och ekonomisk aktivitet. Ett nyckelelement i städernas infrastruktur är transportsystemet, eftersom det underlättar rörligheten av människor och varor. Transportsystem genererar ständigt data från tex. GPS, sensorer och kameror, och den statistiska modelleringen är utmanande på grund av systemets komplexa struktur och dynamik, samt dess naturliga osäkerheter.

I denna avhandling utvecklar vi Bayesianska modeller med tillämpningar för transporter. Vi fokuserar specifikt på modeller som kan tränas på spatiotemporala data från transportnätverk för att göra prediktioner av t ex. bussförseningar eller verklig nätverkstopologi. Särskild uppmärksamhet har ägnats åt modellskalbarhetsfrågor och kvantifiering av osäkerhet. Vi har använt data från riktiga transportsystem i varje studie för att skapa en balans mellan statistisk korrekthet, praktiskt tillämpbarhet och vetenskaplig höjd. Avhandlingen består av fyra artiklar. Den första artikeln presenterar en probabilistisk latent nätverksmodell för att prognostisera dynamiska grafer med multipla lager. Modellen använder stokastisk blockmodellering för att minska beräkningsbördan, och illustreras på ett datamaterial bestånde av tio års data från fyra stora flygbolag inom det amerikanska lufttransportsystemet. I den andra artikeln utvecklar vi en robust modell för realtidsprognoser av bussförseningar genom att använda Student-t fördelning och vi visar hur Bayesiansk inferens ger en naturlig kvantifiering av osäkerhet i en mycket stokastisk miljö. Experiment utförs med hjälp av högfrekventa data från bussar i Stockholm. Den tredje artikeln visar potentialen hos fler-dimensionella Gaussiska processer för att generera nätverksövergripande prediktioner av trafikflöden i en tätortsmiljö. Vi utvecklar en responsiv onlinemodell baserad på en co-regionaliserad kovariansstruktur och utvärderar prognosförmåga på verkliga data från GPS-utrustade taxibilar. Slutligen föreslår vi en ny regularisering av den vektorautoregressiva modellen via en nätverksbaserad variabelsselektionsprior, och presenterar en fallstudie på verkliga data över förseningar i kommersiell flygtrafik där vi utvärderar prediktiv förmåga och analyserar nätverksmönster för hur förseningar sprids mellan flygplatser.  

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2023. p. 38
Series
Linköping Studies in Arts and Sciences, ISSN 0282-9800 ; 848Linköping Studies in Statistics, ISSN 1651-1700 ; 17
Keywords
Bayesian statistics, Transportation networks, Spatiotemporal data, Machine learning, Bayesiansk statistik, Transportnätverk, Spatiotemporal data, Maskininlärning
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:liu:diva-191153 (URN)10.3384/9789180750363 (DOI)9789180750356 (ISBN)9789180750363 (ISBN)
Public defence
2023-02-17, Ada Lovelace, Building B, Campus Valla, Linköping, 13:15 (English)
Opponent
Supervisors
Note

Funding agencies: This work was partially supported by the Wallenberg AI, Autonomous Systems and Software Program (WASP) funded by the Knut and Alice Wallenberg Foundation, Sweden. The computations were enabled by resources provided by the Swedish National Infrastructure for Computing (SNIC), partially funded by the Swedish Research Council through grant agreement no. 2018-05973.

Available from: 2023-01-20 Created: 2023-01-20 Last updated: 2023-02-15Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Rodriguez Déniz, HéctorVillani, Mattias
By organisation
The Division of Statistics and Machine LearningFaculty of Arts and Sciences
In the same journal
IEEE Transactions on Intelligent Transportation Systems
Transport Systems and Logistics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 309 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf