An open dataset for intelligent recognition and classification of abnormal condition in longwall mining

Yang, Wenjuan; Zhang, Xuhui; Ma, Bing; Wang, Yanqun; Wu, Yujia; Yan, Jianxing; Liu, Yongwei; Zhang, Chao; Wan, Jicheng; Wang, Yue; Huang, Mengyao; Li, Yuyang; Zhao, Dian

doi:10.1038/s41597-023-02322-9

Download PDF

Data Descriptor
Open access
Published: 27 June 2023

An open dataset for intelligent recognition and classification of abnormal condition in longwall mining

Wenjuan Yang^1,2,
Xuhui Zhang^1,2,
Bing Ma¹,
Yanqun Wang¹,
Yujia Wu¹,
Jianxing Yan¹,
Yongwei Liu³,
Chao Zhang¹,
Jicheng Wan¹,
Yue Wang¹,
Mengyao Huang¹,
Yuyang Li¹ &
…
Dian Zhao¹

Scientific Data volume 10, Article number: 416 (2023) Cite this article

8935 Accesses
22 Citations
1 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 06 August 2024

This article has been updated

Abstract

The underground coal mine production of the fully mechanized mining face exists many problems, such as poor operating environment, high accident rate and so on. Recently, the intelligent autonomous coal mining is gradually replacing the traditional mining process. The artificial intelligence technology is an active research area and is expect to identify and warn the underground abnormal conditions for intelligent longwall mining. It is inseparable from the construction of datasets, but the downhole dataset is still blank at present. This work develops an image dataset of underground longwall mining face (DsLMF+), which consists of 138004 images with annotation 6 categories of mine personnel, hydraulic support guard plate, large coal, towline, miners’ behaviour and mine safety helmet. All the labels of dataset are publicly available in YOLO format and COCO format. The availability and accuracy of the datasets were reviewed by experts in coal mine field. The dataset is open access and aims to support further research and advancement of the intelligent identification and classification of abnormal conditions for underground mining.

An open paradigm dataset for intelligent monitoring of underground drilling operations in coal mines

Article Open access 13 May 2025

Research on coal mine longwall face gas state analysis and safety warning strategy based on multi-sensor forecasting models

Article Open access 14 June 2024

A near-infrared spectroscopy dataset of coal and coal-measure rock under diverse conditions

Article Open access 14 June 2024

Background & Summary

Coal will remain the dominant energy source worldwide for decades to come¹. Autonomous coal mining machines in longwall mining face can assist or replace human to complete the dangerous mining work, achieve safe and efficient production in coal mine. But it still needs human participation to complete some complex tasks. However, the underground coal excavation of fully mechanized longwall mining face exists many problems, such as poor operating environment, high disaster risk, high accident rate and so on. The intelligence mining has become one of the important ways to address the high-risk underground work, and achieve the goal of safe and efficient underground production². With the rapid development of artificial intelligence technology, the abnormal situation of equipment, environment and personnel are expected to achieve real-time and accurate detection.

In a fully mechanized working face, hydraulic support is indispensable to the whole face’s safe production. As the core equipment for fully mechanized coal mining, hydraulic support can provide a safe working face, and to move the scraper conveyor and shearer in the working face³. It can also reliably and effectively support coal mine roof, isolate mined-out areas, prevent waste rock into the working face. In accordance with the coal mining process of the fully mechanized coal face, once the hydraulic support plate is not in place or not fully recovered during the working process, it may cause the movement interference between hydraulic support and shearer. Hence, it is necessary to find the status of the hydraulic support guard plate in time and deal with it accordingly. For the fully mechanized longwall mining face, large sized coal is easy to cause scraper conveyor blockage, retention and other abnormal state. It is necessary to automatically identify and track large coal, so as to timely judge and warn the abnormal state of large coal. Towline is used in fully mechanized mining face to ensure the power supply and stable operation of shearer. However, in the process of operation, the traction cable would be broken or be removed from the cable slot due to the stacking of cable clamps, and the cable may be torn off, resulting in underground electric leakage, which may eventually lead to electric shock, gas, coal dust explosion, fire and other major coal mine safety accidents. Therefore, it is necessary to conduct real-time status monitoring and intelligent analysis of the towline to ensure that the fault of the towline is detected and handled in time.

Aimed to protect the personnel safety of fully mechanized mining face, it is necessary to identify and track the mine personnel so as to judge whether the mine personnels are in a safe area. The personnel entering the dangerous area should be timely detected and positioned, the corresponding voice reminder processing should be carried out, and the operation of the corresponding equipment should be stopped at the same time. Except the mine workers entering dangerous areas, the coal miners will have a variety of different postures during work. In the complex working environment, the unsafe behaviours of miners will also easily lead to the increase of safety accidents in coal mine, and the abnormal behaviour of the downhole staff also needs attention at any time. Safety helmet is a kind of safety equipment that coal miners must wear at all times during their work. The area where the coal seam is extracted will cause the pressure to transfer from the hydraulic support to the coal wall, which may increase the pressure on the coal wall and eventually causes the phenomenon of coal wall spalling. The coal falling form roof and the collision between personnel and equipment may cause injury accidents. Hence, the safety helmets are related to the safety of coal miners in fully mechanized mining face, and the wearing of the safety helmet for the coal mine staff also needs real-time monitoring.

The above states of the hydraulic support guard plate, large coal, towline, mine worker detection, personal behaviour and the wearing condition of safety helmet are the key contents of abnormal detection and identification in fully mechanized longwall mining face. The monitoring video in fully mechanized mining face is numerous and updated quickly. The abnormal condition of the working face was judged by specialized personnel through real-time video surveillance in traditional production process, this may result in the abnormal condition not be found in time because the visual fatigue during the long-term work. Therefore, it is of great significance to apply artificial intelligence technology to the analysis, identification and warning of the abnormal state, which includes hydraulic support guard plate, large coal, towline, miners’ behaviour and the wearing condition of safety helmet. The object detection using intelligence data-mining is inseparable from datasets and a large number of samples are required for training to achieve better generalization^4,5,6. Hence, it is very necessary to establish an image dataset to identify and warn the underground abnormal conditions of the fully mechanized longwall mining face. Considering that the downhole datasets are still blank at present, this work constructs image dataset DsLMF+ for intelligent recognition of abnormal condition in underground longwall mining face, which mainly consists of the hydraulic support guard plate, large coal, towline, mine safety helmet, coal miners and miners’ behaviour in the fully mechanized face.

Currently, datasets are widely used in automatic driving, object detection, face recognition, natural language processing, text detection, medical and other fields^7,8,9,10. Some widely used object detection datasets are as follows: (1) COCO datasets with large-scale commonly used items as target detection objects^11,12,13; (2) VOC datasets with people, common animals, traffic vehicles, indoor furniture objects as target detection objects^14,15,16; (3) DOTA dataset with airplanes, ships, storage tanks, baseball stadiums, tennis courts, basketball courts, ground runways, ports, bridge as target detection objects^17,18,19; (4) TT100K dataset with common vehicles as the target detection object^20,21,22; (5) WIDER FACE dataset with facial expression, illumination and posture as target detection objects^23,24,25; (6) YOLO format dataset that dedicated to the target detection^26,27,28, etc. In addition to these common datasets, we can also customize the dataset through pytorch framework, but the custom dataset format is complex, diversified and poor sharing²⁹. The downhole datasets are still blank at present, in order to construct and facilitate the promotion and application of image dataset of the fully mechanized face in the field of intelligent coal mining, the compatibility and practicability of the coal mine dataset should be taken into consideration.

On the basis of the analysis on the format and production method of the above commonly used object detection datasets, the production of the datasets in this work has been completed by personnels who are familiar with the fully mechanized mining face in coal mine. The Labelimg software has been used to complete the label annotation of datasets in the YOLO format³⁰, which make it convenient to be used in the currently popular YOLO series target detection networks. At the same time, in order to extend the application range of this dataset, the label format of the dataset has also been converted into the COCO format through label format conversion script, and therefore it could be used in the currently popular COCO target detection methods. Of course, in addition to the COCO label format and the YOLO label format, the rest of data label format can also be converted through the tag conversion script.

The image dataset of the fully mechanized longwall mining face (DsLMF+) is of great significance for the application of object detection using intelligence data-mining in the field of coal mine, which is expected to be able to identify and warn the underground abnormal conditions, solving the problems of underground dangerous and inefficient work and thus accelerate the intellectualization of coal mine.

Methods

The construction process of the image dataset of underground longwall mining face (DsLMF+) is shown in Fig. 1, which is mainly divided into the following three steps: (1) Image data collection; (2) Image data filtering; (3) Data labeling.

Image data collection

The original underground monitoring videos of the fully mechanized coal mining face were offered by several coal mines in Shaanxi Province of China, which were then screened and classified according to the different target object. We signed a scene authorization agreement with Shaanxi Coal and Chemical Industry Group Sunjiacha Longhua Mining Co.,LTD, so as to ensure that the dataset could be disclosed normally. Meanwhile, the agreement also included the authorization for the disclosure of the portraits of the mine personnel, so as to ensure that the miners who are photographed in the coal mine scene were aware of the disclosure of the dataset. The image acquisition equipment is composed of IVG-G5A network HD camera and Openmv IMX335(1/2.8”) lens. The lens focal length is 2.02 mm, and angle of field of view is 119.8°(D), 105.2°(H) and 87.2°(V). The camera can complete the image acquisition with a maximum resolution of 5 megapixels, the frame rate is 1~30FPS, and the used video formats are Flash video (FLV) and MPEG-4. The FFmpeg video processing software is used to process the needed classified videos³¹ and clip relevant images according to the different frame rate settings. The DsLMF+ datasets built in this work consists of 6 categories, which are respectively coal miners, large coal, towline, mine safety helmet, hydraulic support guard plate and miners’ behaviors. Considering that there is no target object to be annotated in some original images data, that is, the images do not include the mine personal, large coal, towline, hydraulic support guard plate and other target categories that need to be annotated. Therefore, some image frames have been removed and the other images are sorted according to the different categories, and the obtained images are used as the original image source of the DsLMF+ dataset.

Image data filtering

The original image source of the DsLMF+ dataset will then be screened. The DsLMF+ dataset collected in this work mainly includes the mine personnel, large coal and hydraulic support guard plate, towline, mine safety helmet and miners’ behaviors, on account of that some images in original datasets might be with no target, incomplete target, and poor image quality that makes it difficult to identify the target, hence those images where might exist some abnormal data should be all removed.

The abnormal images that need to be processed mainly include the following situations: 1) When the fully mechanized mining face is affected by severe environmental factors such as high dust and water mist, it is difficult to identify the coal miners, large coal and hydraulic support guard plate, towline, mine safety helmet and miners’ behaviors in the collected images. 2) Due to the limited field of view of a camera or the occlusion, the target acquisition is incomplete in the process of image acquisition, resulting in only local features of the target are included in the acquired images. 3) When the fully mechanized mining face has stopped working, the camera still continues to collect images, resulting in a large number of repeated images in the collected video images. 4) The target objects in the downhole video acquisition are in a moving state. In the process of converting these videos into pictures, a reasonable frame rate should be adopted according to the different moving speed. However, if the target moves too fast, the picture obtained by video conversion will inevitably be blurred. 5) Due to the influence of the downhole environment and the distance between the target from the camera, the target object at a far distance is difficult to distinguish from other equipment.

All the above abnormal video images need to be manually or automatically eliminated in the process of image dataset production. In order to make it reproducibility of the datasets, we used ResNet50 to build a tri-classification automatic filtering network model to deal with the low-quality images that affected by downhole environmental factors such as high dust, water mist, motion blur, etc. In this work, some high dust and water mist images, defocused and motion blurred image as well as clear image were selected from the collected raw images data, and constructed an image filtering dataset for the training and verification of the tri-classification automatic filter model. The obtained automatic filter model can be used to deal with the invalid images data automatically to increase the reproducibility of our datasets and enhance the chances for other researchers to collaborate with the datasets. The tri-classification automatic filter model has been provided along with the datasets, and its specific usage can be on reference on in its attached README file. In addition, the structural similarity index SSIM can be used to judge and automatically filter out the duplicate or similarity images. For the other cases, considering that it is easy to be affected by personal subjective factors in the process of screening images, the multiple people uniformly reviewed the controversial images in the dataset when removing images from the dataset, especially for those images that are difficult to distinguish.

Data labeling

Finally, the filtered original image datasets were annotated using LabelImg software and named the label, and here we provide an official open source download link (https://github.com/heartexlabs/labelImg) for the Labelimg software. The researchers can set the label in YOLO, VOC or CreateML format and annotate the images according to the instructions provided by the official. In the process of labeling diverse kinds of datasets, the label order needed be determined accordingly. Once the label order is determined, the label order cannot be changed when open the software to label next time. If the order is changed, the label order of the dataset will be automatically changed to the current label order, and the original labeled annotations will appear in the current order, resulting in label confusion in the dataset. The LabelImg tool was used to annotate the training set and validation set in accordance with YOLO format, in the meanwhile, we also converted the YOLO datasets into COCO datasets through script files and retain. This work includes the datasets of the mine personnel, towline, mine safety helmet and large coal with the single-label annotation, as well as the hydraulic support guard plate and miners’ behaviors with multi-label annotations. Figure 2 shows the label annotations of coal miners, large coal, towline, mine safety helmet, miners’ behavior and supporting state of the hydraulic support guard plate.

The single-label datasets of the large coal, mine safety helmet, towline and mine personnel are named as large_coal, mine_safety_helmet, towline and coal miner, respectively. In order to judge whether there is movement interference between the shearer’s operation and the guard plate, the images are labeled according to the unfolding angle of the hydraulic support guard plate in this work, so as to obtain the support state information of the hydraulic support of the fully mechanized mining face. In the process of labeling the guard plate, the label types cover all angles of the hydraulic support guard plate. In order to ensure the accuracy of angle labeling, this work uses the built-in sensor in the hydraulic support of the fully mechanized mining face to detect and extract the angle information of the guard plate in real time. The extracted angle information is not only used to annotate the image of the guard plate in the dataset, but also to verify whether the annotated angle types of the guard plate are reasonable. Among which, In accordance with the different angle of unfolding of the hydraulic support guard plate, the supporting states of the hydraulic support guard plate are divided into eight kinds of type, which were respectively named as hydraulic_support_guard_plate_00, hydraulic_support_guard_plate_00_30, hydraulic_support_guard_plate_30_60, hydraulic_ support_guard_plate_60_90, hydraulic_support_guard_plate_90,hydraulic_support_guard_ plate_90_abnormal, hydraulic _support_guard_plate_90_120 and hydraulic_support_guard_ plate_abnormal. In order to judge whether there will be motion interference between the guard plate and the shearer, the label annotation for the image in which the shearer passing under the hydraulic support guard plate is also marked as Shearer. The involved dataset labels of the hydraulic support guard plate states are shown in Fig. 3.

Among them, hydraulic_support_guard_plate_00 state is the state when the guard plate is fully recovered and there is no interference with shearer operation. The numbers before and after the underline in hydraulic_support_guard_plate_00_30, hydraulic_support_guard_ plate_30_60 and hydraulic_support_guard_plate_60_90 respectively represent the angle range corresponding to the unfolding state of the guard plate. When the guard plate is in these three states, it will interfere with the shearer in operation. In hydraulic_support _guard_plate_90 state, when the unfolding angle corresponding to the state of the guard plate is 90°, the supporting plate is close to the coal wall, which can play a well supporting role on the coal wall and effectively prevent the occurrence of coal wall slab accident in the fully mechanized mining face. In hydraulic_support_guard_plate_abnormal state, there is a problem in the structure of the hydraulic support guard plate, which should be replaced in time. In hydraulic_support_guard_plate_90_abnormal state, the unfolding angle of the guard plate is 90°, and there is a small gap between the guard plate and the coal wall, so the support strength is not enough. In hydraulic_support_guard_plate_90_120 state, the unfolding angle of the guard plate is too large, which resulting in the gap between the guard plate and the coal wall is too large, and the support strength is not enough.

In order to ensure the universality and compatibility of this dataset, we collected the images of mine personnel, large coal, towline, mine safety helmet, miners’ behaviors and hydraulic support guard plate from multiple scenes, respectively. The image data of mine personnel came from 58 different scenes, the image data of large coal came from 18 different scenes, the image data of guard plate came from 159 different scenarios, the image data of towline images came from 65 different scenarios, the image data of mine safety helmet came from 85 different scenarios, and the image data of the miners’ behaviors came from 67 different scenarios. The DsLMF+ datasets built in this work are divided into training set and validation set at the ratio³² of 8:2. There are 30704 mine personnel images with 24563 images in training sets and 6141 in validation set, 21017 large coal images with 16813 images in training sets and 4204 in validation set, 21412 towline images with 17129 in training sets and 4283 in validation set, 20117 mine safety helmet images with 16093 in training set and 4024 in validation set, 24709 miners’ behavior images with 19767 in training sets and 4942 in validation set, and 20045 hydraulic support guard plates images with 16036 in training sets and 4009 in validation set. Tables 1–7 respectively describes the datasets of mine safety helmet, towline, coal miners, miners’ behavior, large coal and guard plate in multiple different scenarios.

Table 1 The summary of the training set and validation set for mine safety helmet.

Full size table

Table 2 The summary of the training set and validation set for towline.

Full size table

Table 3 The summary of the training set and validation set for coal miner.

Full size table

Table 4 The summary for the datasets of miners’ behavior.

Full size table

Table 5 The summary of the training set and validation set for large coal.

Full size table

Table 6 The summary for the datasets of hydraulic support guard plate that from scenario 1 to scenario 108.

Full size table

Table 7 The summary for the datasets of hydraulic support guard plate that from scenario 109 to scenario 159.

Full size table

Data Records

The DsLMF+ dataset of the coal mine image in the fully mechanized longwall mining face has been publicly available at the figshare data repository³³. Data annotations include YOLO format and COCO format. Among them, the image and label files of the dataset in YOLO format are stored as follows: the folder names of each dataset in data2023_yolo are respectively coal_miner_data2023_yolo, large_coal_data2023_yolo, mine_safety_helmet_data2023_yolo, towline_data2023_yolo,miner_behavior_data2023_yolo and hydraulic_support_guard_plate _data2023_yolo. Each folder contains the picture folders and label folders that named as images and labels, in which respectively stores image data and label data. These folders also contain training set folders and verification set folders. The information contained in the label data mainly includes data type, number of labels and label coordinates.

The image and label files of the dataset in COCO format are stored as follows: the folder names of each dataset in data2023_coco are respectively coal_miner_data2023_coco, large_coal_data2023_coco, mine_safety_helmet_data2023_coco, towline_data2023_coco, miner_behavior_data2023_coco and hydraulic_support_guard_plate_data2023_coco. Each of these folders contains the training set image folder, verification set image folder and label folder respectively named as train2017, val2017 and annotations, which are used to store training set pictures, verification set pictures and label files. The information contained in COCO label files contains file name, image width and height, label category and label coordinates, etc.

In addition, the files coal_miner_DsLMF, large_coal_DsLMF, mine_safety_helmet_DsLMF, towine_DsLMF, miner_behavior_DsLMF and hydraulic_support_guard_plate_DsLMF are provided to be used to better distinguish the images of mine personnel, large coal, towline, miners’ behavior, mine safety helmet and guard plate in different scenarios in DsLMF+ datasets, and the image index corresponding to the different scenes are given in the files.

Technical Validation

To ensure the reliability of the DsLMF+ dataset in this work, we also conducted a comprehensive manual review of all images and their corresponding label annotation. The specific review method is as follows: five members with rich working experience in the coal mining field are selected to check the image dataset and label information one by one to see whether there are missing or wrong labels. At the same time, in order to ensure the quality and application effect of the dataset, the five members uniformly reviewed the controversial images in the dataset, such as the size threshold of large coal, the angle involved in the guard plate image and its label, the label veracity of the downhole towline, coal personnel behaviour and the mine safety helmet. Through the collective voting of the five members, the review work of the dataset was completed.

DsLMF+ dataset have provided two types of datasets formats of YOLO and COCO, which make it convenient to be applied for the currently popular top-ranked target detection neural networks. In order to verify the feasibility of the constructed dataset, this work selected YOLOv7³⁴, DETA³⁵ and ViT-Adapter-L³⁶ three top deep learning network from the COCO target detection ranking list, and conducted model training and verification on the DsLMF+ dataset. The access links of DETA, ViT-Adapter-L and YOLOv7 that used to verify the datasets are respectively https://github.com/jozhang97/deta, https://github.com/czczup/vit-adapter and https://github.com/wongkinyiu/yolov7. The DsLMF+ datasets were trained on a machine with Intel(R) Xeon(R) Gold 6330 CPU, RTX A5000 GPU and Ubantu18.04. The hyper-parameters of the above three target detection algorithms were on the reference to the recommended default values. To suit the dataset, some hyper-parameter values such as width, height, batch size, initial learning rate and Epochs are modified. This change was implemented in accordance to the recommendations from the initial YOLOv7, DETA, and ViT-Adapter-L research.

For the dataset verification, the coal miners, large coal, towline, mine safety helmet, hydraulic support guard plate and miners’ behaviours in the datasets are trained and evaluated. The image height and width of the input image are both resized to 640 in the network training. Table 8 presents the benchmark result of ViT-Adapter-L, DETA and YOLOv7 on the DsLMF+ datasets. Figure 4 shows the graphs of the three model’s performance during validation, the mAP value curves of each target detection network model. The mAP values of YOLOv7 detection model can respectively reach 0.986, 0.976, 0.978, 0.868, 0.913 and 0.997, the mAP values of DETA detection model can respectively reach 0.976, 0.960, 0.958, 0.815, 0.914 and 0.989, and the mAP values of ViT-Adapter-L detection model can respectively reach 0.966, 0.961, 0.963, 0.854, 0.928 and 0.989. The above mAP values indicate that the models have good performance, and the DsLMF+ dataset performs well under YOLOv7, DETA and ViT-Adapter-L. The deployed YOLOv7, DETA and ViT-Adapter-L have been respectively used to randomly extract and detect the 6 categories of images of coal miners, large coal, towline, mine safety helmet, hydraulic support guard plate and miners’ behaviours in the DsLMF+ dataset, and the identified target detection results are shown in Fig. 5, the detection effect and accuracy demonstrated the reliability and practicability of DsLMF+ datasets.

Table 8 Benchmark of ViT-Adapter-L, DETA and YOLOv7 performed on coal miners, large coal, towline, mine safety helmet, hydraulic support guard plate and miners’ behaviours in the datasets DsLMF+.

Full size table

Moreover, we will further expand the DsLMF+ dataset to make the dataset have better applicability and universality in the fully mechanized coal mining face. We also encourage other researchers in coal mine field to expand and improve the DsLMF+ dataset. The coal mine image dataset produced in this work is of great significance for the application of deep learning object detection algorithm for the intelligent identification and classification of abnormal conditions for underground mining, which aims to support further research and advancement of intelligence in the fully mechanized longwall mining face .

Table 9 The overview of the site-packages for ViT-Adapter-L, DETA and YOLOv7.

Full size table

Code availability

DsLMF+ datasets are publicly available at the figshare data repository³³, and the code for automatically filtering is also published alongside the dataset, archived as “DsLMF.7z”. Furthermore, the annotation tool Labelimg can be accessed and downloaded through the official website link https://github.com/heartexlabs/labelImg, the specific usage can refer to the corresponding README file. The codes used for training and validation of the DsLMF+ datasets in this work adopts DETA, ViT-Adapter-L and YOLOv7 official published open source scripts, and the code of the above three deep learning network for dataset verification can be accessed via the following website link (https://github.com/jozhang97/deta), (https://github.com/czczup/vit-adapter), and (https://github.com/wongkinyiu/yolov7). Table 9 presents the required site-packages and their corresponding versions for the above three different networks. The software packages can be downloaded according to README files under the corresponding links on different networks, and can be installed with the python package installer (pip). Researchers can complete the label format conversion from YOLO format to COCO format, by visiting the following link (https://github.com/RapidAI/YOLO2COCO), the link provides the label format conversion code and the README file that can be used as a reference.

Change history

06 August 2024
A Correction to this paper has been published: https://doi.org/10.1038/s41597-024-03713-2

References

Yang, L., Birhane, G. E., Zhu, J., Geng, J. Mining employees safety and the application of information technology in coal mining:Review. J. Frontiers in Public Health. 9 (2021).
Gao, Y., Dai, Z. & Yuan, J. A multiobjective hybrid optimization algorithm for path planning of coal mine patrol robot. J. Computational Intelligence and Neuroscience. 6, 1–10 (2022).
Google Scholar
Xu, Z., Li, J. & Zhang, M. A Surveillance Video Real-Time Analysis System Based on Edge-Cloud and FL-YOLO Cooperation in Coal Mine. J. IEEE ACCESS. 9, 68482–68497 (2021).
Article Google Scholar
Azam, B. et al. Aircraft detection in satellite imagery using deep learning-based object detectors. J. Microprocessors and Microsystems. 94, 104630 (2022).
Article Google Scholar
Wang, D. L., Zeng, X. T., Wang, G. F. & Li, R. Stability of a face guard in a large mining height working face. J. International Journal of Simulation Modeling. 20, 547–558 (2021).
Article Google Scholar
Pang, H., Zhang, Y., Cai, W., Li, B. & Song, R. A real-time object detection model for orchard pests based on improved YOLOv4 algorithm. J. Scientific Reports. 12, 13557 (2022).
Article ADS CAS Google Scholar
Lin, L. et al. The SUSTech-SYSU dataset for automated exudate detection and diabetic retinopathy grading. J. Scientific Data. 7, 409 (2020).
Article ADS CAS Google Scholar
Bauer, Z. et al. UASOL, a large-scale high-resolution outdoor stereo dataset. J. Scientific Data. 6, 162 (2019).
Article Google Scholar
Nguyen, H. Q. et al. VinDr-CXR: An open dataset of chest X-rays with radiologist’s annotations. J. scientific data. 9, 429 (2022).
Article Google Scholar
Lipkin, B. et al. Probabilistic atlas for the language network based on precision fMRI data from >800 individuals. J. Scientific data. 9, 529 (2022).
Article Google Scholar
Rostianingsih, S., Setiawan, A. & Halim, C. I. COCO (Creating Common Object in Context) Dataset for Chemistry Apparatus. J. Procedia Computer Science. 171, 2445–2452 (2020).
Article Google Scholar
Srivastava, S. et al. Comparative analysis of deep learning image detection algorithms. J. Journal of Big Data. 8, 66 (2021).
Article Google Scholar
Kiruthika, D. S. & Subalalitha, C. N. Intelligent deep learning empowered text detection model from natural scene images. J. International Journal on Advanced Science Engineering and Information Technology. 12, 1263–1268 (2022).
Article Google Scholar
Francies, M. L., Ata, M. M. & Mohamed, M.A. A robust multiclass 3D object recognition based on modern YOLO deep learning algorithms. J. Concurrency and Computation: Practice and Experience. 34 (2021).
Varadarajan, V., Garg, D. & Kotecha, K. An Efficient Deep Convolutional Neural Network Approach for Object Detection and Recognition Using a Multi-Scale Anchor Box in Real-Time. J. Future Internet. 13, 307 (2021).
Article Google Scholar
Shen, F., Wang, Z. & Lu, Z. Weakly supervised classification model for zero-shot semantic segmentation. J. Electronics Letters. 56, 1247–1250 (2020).
Article ADS Google Scholar
Wu, Q. F. et al. Improved Mask R-CNN for Aircraft Detection in Remote Sensing Images. J. Sensors. 21, 2618 (2021).
Article ADS Google Scholar
Qu, Z., Zhu, F. & Qi, C. Remote Sensing Image Target Detection: Improvement of the YOLOv3 Model with Auxiliary Networks. J. Remote Sensing. 13 (2021).
Xia, G.S. et al. DOTA: A Large-scale Dataset for Object Detection in Aerial Images. J. IEEE Conference on Computer Vision and Pattern Recognition. (2018).
Ruiz, I. & Serrat, J. Hierarchical Novelty Detection for Traffic Sign Recognition.J. Sensors (Basel, Switzerland). 22, 4389 (2022).
Article ADS PubMed PubMed Central Google Scholar
Gao, X. et al. Improved Traffic Sign Detection Algorithm Based on Faster R-CNN. J. Applied Sciences. 12, 8948 (2022).
Article CAS Google Scholar
Lu, Y., Lu, J., Zhang, S. & Hall, P. Traffic signal detection and classification in street views using an attention model. J. Computational Visual Media. 4, 253–266 (2018).
Article Google Scholar
Luo, S., Li, X., Zhang, X. Wide aspect ratio matching for robust face detection. J. Multimedia tools and applications. 1–18 (2022).
Lin, X. et al. Task-oriented feature-fused network with multivariate dataset for joint face analysis. J. IEEE Transactions on Cybernetics. 50, 1292–1305 (2020).
Article Google Scholar
Ming, X. et al. Group Sampling for Scale Invariant Face Detection. J. IEEE Transactions on Pattern Analysis and Machine Intelligence. 44, 985–1001 (2020).
Article Google Scholar
Dai, G., Hu, L., Fan, J., Yan, S. & Li, R. A Deep Learning-Based Object Detection Scheme by Improving YOLOv5 for Sprouted Potatoes Datasets. J. IEEE Access. 10, 85416–85428 (2022).
Article Google Scholar
Zhang, Z. D. et al. FINet: An Insulator Dataset and Detection Benchmark Based on Synthetic Fog and Improved YOLOv5. J. IEEE Transactions on Instrumentation and Measurement, 71 (2022).
Kumar, A., Kalia, A., Verma,K., Sharma,A. & Kaushal,M. Scaling up face masks detection with YOLO on a novel dataset. J. Optik. 239 (2022).
Neelam Jaikishore, C. et al. Implementation of Deep Learning Algorithm on a Custom Dataset for Advanced Driver Assistance Systems Applications. J. Applied Sciences. 12, 8927 (2022).
Article CAS Google Scholar
Luo, Y. & Chen, J. Two-Dimensional Codes Recognition Algorithm Based on Yolov5. J. Academic Journal of Computing & Information Science. 5, 68–72 (2022).
Google Scholar
Zeng, H. & Fang, Y. Implementation of Video Transcoding Client Based on FFMPEG. J. Advanced Materials Research. 1748–1752 (2013).
Guan, Z., Hou, C., Zhou, S. & Guo, Z. Research on Underwater Target Recognition Technology Based on Neural Network. J. Wireless Communications and Mobile Computing. 3, 1–12 (2022).
Google Scholar
Zhang, X. et al. An open dataset for intelligent recognition and classification of abnormal condition in longwall mining. figshare https://doi.org/10.6084/m9.figshare.c.6307599.v1 (2023).
Wang, C.Y., Bochkovskiy, A., & Liao, H.M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors.J. arXiv preprint arXiv:2207.02696. 1–15 (2022).
Ouyang-Zhang, J., Cho, J., Zhou, X. & Krahenbuhl, P. NMS Strikes Back.J. arXiv preprint arXiv:2212.06137. 1–10 (2022).
Chen, Z. et al. Vision Transformer Adapter for Dense Predictions. J. arXiv preprint arXiv:2205.08534. 1–20 (2022).

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (No.52104166) and Shaanxi Coal Joint Founds (No.2021JLM-03). The author would like to thank the several coal mines of Shaanxi Coal and Chemical Industry Group Sunjiacha Longhua Mining Co.,LTD for providing us with effective access to the image database of fully mechanized mining face and agreeing to open the dataset. We are especially grateful to all those who participated in the dataset construction and label annotation process, including instructors, seniors, and other participants.

Author information

Authors and Affiliations

School of Mechanical Engineering, Xi’an University of Science and Technology, No.58, Mid-Yanta Road, Xi’an, 710054, China
Wenjuan Yang, Xuhui Zhang, Bing Ma, Yanqun Wang, Yujia Wu, Jianxing Yan, Chao Zhang, Jicheng Wan, Yue Wang, Mengyao Huang, Yuyang Li & Dian Zhao
Shaanxi Key Laboratory of Mine Electromechanical Equipment Intelligent Detection and Control, No.58, Yanta Road, Xi’an, 710054, China
Wenjuan Yang & Xuhui Zhang
MARCO automatic control system development Co.,LTD, No.20, Fenghui South Road, Xi’an, 710054, China
Yongwei Liu

Authors

Wenjuan Yang
View author publications
Search author on:PubMed Google Scholar
Xuhui Zhang
View author publications
Search author on:PubMed Google Scholar
Bing Ma
View author publications
Search author on:PubMed Google Scholar
Yanqun Wang
View author publications
Search author on:PubMed Google Scholar
Yujia Wu
View author publications
Search author on:PubMed Google Scholar
Jianxing Yan
View author publications
Search author on:PubMed Google Scholar
Yongwei Liu
View author publications
Search author on:PubMed Google Scholar
Chao Zhang
View author publications
Search author on:PubMed Google Scholar
Jicheng Wan
View author publications
Search author on:PubMed Google Scholar
Yue Wang
View author publications
Search author on:PubMed Google Scholar
Mengyao Huang
View author publications
Search author on:PubMed Google Scholar
Yuyang Li
View author publications
Search author on:PubMed Google Scholar
Dian Zhao
View author publications
Search author on:PubMed Google Scholar

Contributions

Xuhui Zhang professor mainly be responsible for the overall planning for organize dataset, Wenjuan Yang associate professor responsible for writing the thesis, Yongwei Liu in charge of collect dataset pictures, Mengyao Huang responsible for abnormal data filtering, Jianxing Yan, Bing Ma, Chao Zhang and Jicheng Wan are responsible for the dataset label annotation, Xuhui Zhang, Wenjuan Yang Yuyang Li, Yue Wang and Dian Zhao are responsible for the dataset label review, and put forward valuable opinion, Yujia Wu and Yanqun Wang are responsible for the training dataset, Manuscripts of the dataset were reviewed and reviewed by all authors.

Corresponding author

Correspondence to Xuhui Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, W., Zhang, X., Ma, B. et al. An open dataset for intelligent recognition and classification of abnormal condition in longwall mining. Sci Data 10, 416 (2023). https://doi.org/10.1038/s41597-023-02322-9

Download citation

Received: 28 November 2022
Accepted: 20 June 2023
Published: 27 June 2023
DOI: https://doi.org/10.1038/s41597-023-02322-9

This article is cited by

An open paradigm dataset for intelligent monitoring of underground drilling operations in coal mines
- Pengzhen Zhao
- Xichao Wang
- Guochu Chen
Scientific Data (2025)
Small target detection in coal mine underground based on improved RTDETR algorithm
- Feng Tian
- Cong Song
- Xiaopei Liu
Scientific Reports (2025)
A lightweight coal mine pedestrian detector for video surveillance systems with multi-level feature fusion and channel pruning
- Bei Jing Xie
- Heng Li
- Zhen Lei
Scientific Reports (2025)
Real-time detection of coal mine safety helmet based on improved YOLOv8
- Jie Li
- Shuhua Xie
- Xianguo Li
Journal of Real-Time Image Processing (2025)
Research on the hydraulic support face guard mechanism and coupling characteristic of rib spalling in large mining heights
- Qingliang Zeng
- Xiaoqi Ma
- Yanping Yue
Scientific Reports (2024)