SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.
RSS Feed

HELP: Tutorials | FAQ
CONTACT US: Contact info

Search Results

Journal Article

Citation

Xia Z, Zhang B, Gao D. China Saf. Sci. J. 2022; 32(12): 53-62.

Copyright

(Copyright © 2022, China Occupational Safety and Health Association, Publisher Gai Xue bao)

DOI

10.16265/j.cnki.issn1003-3033.2022.12.2727

PMID

unavailable

Abstract

In order to solve the problems faced by deep learning model in text mining of safety inspection minutes, such as small data set size, uneven distribution of sample data and poor effect of NER, a new data enhancement method for NER was proposed. First of all, the named entities in the data set were separated and the same kind of named entities were replaced randomly, which could not only avoid the damage of data enhancement technology to the information of named entities, but also make the distribution of named entities more uniform. Then, by optimizing the noise data and scale parameters of other parts, the effect of NER was further improved. Finally, the separated data was automatically labeled and recombined to avoid the disadvantage of manually marking a large amount of data. The results show that this method can quickly solve the problems such as the small amount of data and the uneven distribution of named entities in the dataset. Compared with the latest AEDA (An Easier Data Augmentation) method, this method achieves better recognition results on data sets such as HSE inspection minutes, and improves the comprehensive evaluation index of the model on one⁃fold expanded data from 92. 83% to 97. 23%. At the same time, the spatial distribution and strong association rules of safety hazards in construction process can be obtained. © 2022 China Safety Science Journal. All rights reserved.


Language: zh

NEW SEARCH


All SafetyLit records are available for automatic download to Zotero & Mendeley
Print