SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.
RSS Feed

HELP: Tutorials | FAQ
CONTACT US: Contact info

Search Results

Journal Article

Citation

Marchegiani L, Newman P. IEEE Trans. Intel. Transp. Syst. 2022; 23(10): 17087-17096.

Copyright

(Copyright © 2022, IEEE (Institute of Electrical and Electronics Engineers))

DOI

10.1109/TITS.2022.3158076

PMID

unavailable

Abstract

This paper is about acoustic event detection and sound source localisation in urban scenarios. Specifically, we are interested in detecting and localising horns and sirens of emergency vehicles. Urban scenarios, though, can be characterised by copious, unstructured and unpredictable traffic noise, which can severely compromise the performance and effectiveness of traditional filtering techniques. By analysing the spectrograms of incoming stereo signals as images, we can leverage image processing techniques and obtain a demonstrably robust system. Indeed, image processing methods, such as convolutional neural networks, which do not operate locally, offer interesting mechanisms for background foreground separation. When applied to spectrograms, those mechanisms allow using the entire context of the soundscape to discover and learn correlations both in the time and frequency domains, de facto implementing noise detection through semantic segmentation. In a multi-task learning scheme, together with signal denoising, we perform acoustic event classification to identify the nature of the alerting sound. Lastly, we use the denoised signals to localise the acoustic source on the ground plane, by regressing the direction of arrival of the sound. Our experimental evaluation shows an average classification rate of 94%, and a median absolute error on the localisation of 7.5° when operating on audio frames of 0.5 s, and of 2.5° when operating on frames of 2.5 s. The system offers excellent performance in particularly challenging scenarios, where the noise level is remarkably high.


Language: en

Keywords

Acoustic event classification; Acoustics; deep learning; Deep learning; Direction-of-arrival estimation; Image segmentation; Multitasking; semantic segmentation; Signal to noise ratio; siren detection; smart vehicles; Spectrogram

NEW SEARCH


All SafetyLit records are available for automatic download to Zotero & Mendeley
Print