SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.
RSS Feed

HELP: Tutorials | FAQ
CONTACT US: Contact info

Search Results

Journal Article

Citation

Liao Y, Yu G, Chen P, Zhou B, Li H. Transportmetrica A: Transp. Sci. 2024; 20(1): e2035846.

Copyright

(Copyright © 2024, Informa - Taylor and Francis Group)

DOI

10.1080/23249935.2022.2035846

PMID

unavailable

Abstract

To adapt to human-driving habits, this study develops a personalised car-following model via a memory-based deep reinforcement learning approach. Specifically, Twin Delayed Deep Deterministic Policy Gradients (TD3) is integrated with a long short-term memory (LSTM) (abbreviated as LSTM-TD3). Using the NGSIM dataset, unsupervised learning-based clustering and data feature analyses are performed. The driving characteristics related to safety, efficiency and comfort are extracted for different driving styles, i.e. aggressive, common and conservative. Then, reward functions are constructed for different driving styles by incorporating their driving characteristics. By resorting to the TD3 policy within a recurrent actor-critic framework, LSTM-TD3 optimises the car-following behaviour via trial-and-error interactions according to the reward functions.

RESULTS show that compared with LSTM-DDPG and DDPG, LSTM-TD3 reproduces personalised car-following behaviour with desirable convergence speed and reward. It reveals that LSTM-TD3 can reflect the essential difference in safety, efficiency and comfort requirements among different driving styles.


Language: en

Keywords

autonomous driving; Car-following; driving styles; long short-term memory; twin delayed deep deterministic policy gradients

NEW SEARCH


All SafetyLit records are available for automatic download to Zotero & Mendeley
Print