SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.
RSS Feed

HELP: Tutorials | FAQ
CONTACT US: Contact info

Search Results

Journal Article

Citation

Schlögl M, Stütz R, Laaha G, Melcher M. Accid. Anal. Prev. 2019; 127: 134-149.

Affiliation

Institute of Applied Statistics and Scientific Computing, University of Natural Resources and Life Sciences, Vienna, Austria.

Copyright

(Copyright © 2019, Elsevier Publishing)

DOI

10.1016/j.aap.2019.02.008

PMID

30856396

Abstract

One of the main aims of accident data analysis is to derive the determining factors associated with road traffic accident occurrence. While current studies mainly use variants of count data regression to achieve this aim, the problem can also be considered as a binary classification task, with the dichotomous target variable indicating events (accidents) and non-events (no accidents). The effects of 45 variables - describing road condition and geometry, traffic volume and regulations, weather, and accident time - are analyzed using a dataset in high temporal (1 h) and spatial (250 m) resolution, covering the whole highway network of Austria over the period of four consecutive years. A combination of synthetic minority oversampling and maximum dissimilarity undersampling is used to balance the training dataset. We employ and compare a series of statistical learning techniques with respect to their predictive performance and discuss the importance of determining factors of accident occurrence from the ensemble of models.

FINDINGS substantiate that a trade-off between accuracy and sensitivity is inherent to imbalanced classification problems.

RESULTS show satisfying performance of tree-based methods which exhibit accuracies between 75% and 90% while exhibiting sensitivities between 30% and 50%. Overall, this analysis emphasizes the merits of using high-resolution data in the context of accident analysis.

Copyright © 2019 Elsevier Ltd. All rights reserved.


Language: en

Keywords

Accident analysis; Binary classification; Imbalanced data; Road safety; Statistical learning

NEW SEARCH


All SafetyLit records are available for automatic download to Zotero & Mendeley
Print