End-to-end autonomous driving decision method based on improved TD3 algorithm in complex scenarios

Xu, Tao; Meng, Zhiwei; Lu, Weike; Tong, Zhongwen

doi:10.3390/s24154962

SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.

RSS Feed

HELP: Tutorials | FAQ

CONTACT US: Contact info

Search Results

Journal Article

End-to-end autonomous driving decision method based on improved TD3 algorithm in complex scenarios
Citation	Xu T, Meng Z, Lu W, Tong Z. Sensors (Basel) 2024; 24(15).
Copyright	(Copyright © 2024, MDPI: Multidisciplinary Digital Publishing Institute)
DOI	10.3390/s24154962
PMID	39124010
PMCID	PMC11315049
Abstract	The ability to make informed decisions in complex scenarios is crucial for intelligent automotive systems. Traditional expert rules and other methods often fall short in complex contexts. Recently, reinforcement learning has garnered significant attention due to its superior decision-making capabilities. However, there exists the phenomenon of inaccurate target network estimation, which limits its decision-making ability in complex scenarios. This paper mainly focuses on the study of the underestimation phenomenon, and proposes an end-to-end autonomous driving decision-making method based on an improved TD3 algorithm. This method employs a forward camera to capture data. By introducing a new critic network to form a triple-critic structure and combining it with the target maximization operation, the underestimation problem in the TD3 algorithm is solved. Subsequently, the multi-timestep averaging method is used to address the policy instability caused by the new single critic. In addition, this paper uses Carla platform to construct multi-vehicle unprotected left turn and congested lane-center driving scenarios and verifies the algorithm. The results demonstrate that our method surpasses baseline DDPG and TD3 algorithms in aspects such as convergence speed, estimation accuracy, and policy stability. Language: en
Keywords	autonomous driving; complex scenarios; intelligent decision-making; multiple critics; reinforcement learning

BACK TO RESULTS

NEW SEARCH
Download this record to:
RIS | BibTeX | EndNote

All SafetyLit records are available for automatic download to Zotero & Mendeley

Print
Email

Find full text at...

- Direct link (DOI)
- Publisher website
PubMed Central
- Google Scholar
- Inter-Library Document Request Form (pdf)