Automated Detection Model Based on Deep Learning for Knee Joint Motion Injury due to Martial Arts

Xue, Meng; Liu, Yan; Cai, XiaoMei

doi:https://doi.org/10.1155/2022/3647152

Computational and Mathematical Methods in Medicine

On this page

Abstract Introduction Experimental Results Discussion Conclusion Data Availability Conflicts of Interest Authors’ Contributions Acknowledgments References Copyright Related Articles

Special Issue

Health Informatics: Computer Algorithms in Operational Navigation and Medical Data Mining

View this Special Issue

Research Article | Open Access

Volume 2022 | Article ID 3647152 | https://doi.org/10.1155/2022/3647152

Automated Detection Model Based on Deep Learning for Knee Joint Motion Injury due to Martial Arts

Meng Xue,¹Yan Liu,²and XiaoMei Cai³

Academic Editor: Xue Fei Deng

Received06 Mar 2022

Revised08 Apr 2022

Accepted25 Apr 2022

Published17 May 2022

Abstract

Objective. Develop a set of knee joint martial arts injury monitoring models based on deep learning, train and evaluate the model’s effectiveness. Methods. This paper mainly collects knee MRI images of 1546 patients with knee joint martial arts injuries from 2015 to 2020. Through manual annotation, the data set is divided into six categories: meniscus injury, tendon injury, ligament injury, epiphyseal cartilage injury and synovial joint capsule loss. The human knee collaborative MRI image database is established, and the data set is divided into the training and validation sets. And test set. Establish a deep neural network, train the model using the training set and validation set, locate the knee joint injury location, and classify the specific injury type. The model’s validity was validated using the test set, and the model’s sensitivity, specificity, and mean accuracy for detecting lesions were evaluated. Results. In the test set, the accuracy of meniscus injury, tendon injury, ligament injury, bone and bone cartilage injury and synovial joint capsule injury were 83.2%, 89.0%, 88.0%, 85.9%, 85.6% and 83.5%, respectively, and the overall average accuracy value was 86.0%. The sensitivity and specificity of the model were 91.3% and 87.3%, respectively. Conclusion. The application of the deep learning method in the classification and detection of knee joint martial arts injuries can significantly improve the diagnosis effect, reduce the diagnosis time and misdiagnosis rate, and provide decision support for surgery.

1. Introduction

In the teaching and training of martial arts, to achieve fast reaction speed and movement speed, athletes have high requirements on the explosive power of lower limbs; athletes need to complete quick knee bend, knee extension, half knee bend and other movements, which cause a lot of local load on the knee joint, easy to lead to a common knee injury. According to statistics, the prevalence of knee osteoarthritis in Chinese wushu athletes is as high as 15.6% [1].

Medically, knee lesions are usually diagnosed by magnetic resonance imaging (MRI). MRI can clearly show articular cartilage and bone areas and is typically segmented layer by layer by an experienced physician [2]. However, due to the relatively complex anatomical structure of the knee joint, and the injury often involves multiple tissue parts, the diagnosis is challenging, and the phenomenon of missed diagnosis usually occurs.

In recent years, with the development and application of convolutional neural networks in medical image analysis, deep learning-based diagnosis has become a feasible method in medical image segmentation [3].

With the development and application of convolutional neural networks in medical image analysis, deep learning-based diagnosis has become a feasible method in medical image segmentation.

In terms of knee imaging application, four types of parts were to be divided: femur bone (FB), femur cartilage (FC), Tibia bone (TB) and Tibia cartilage (TC) [4]. Due to the differences in shape and size of different parts, it is difficult for conventional image recognition algorithms to identify multiple factors simultaneously. Therefore, few diagnosis methods of knee joint lesions are combined with deep learning methods.

This paper, based on U-net codification and decoding architecture, multi-scale context feature extraction module and multi-output fusion module, is designed for segmentation targets of different sizes in knee MRI. Feature reuse is strengthened [5]. The cascade U-net was proposed for knee joint image segmentation. The knee MRI image library was used to train and verify its effectiveness.

2. Methodology

The U-net used in this paper is an advanced feedforward neural network [6]. Feedforward neural network, also known as a multilayer perceptron, is a one-way multilayer artificial neural network. Data information is transmitted to the next layer through the upper layer, and the knowledge of the next layer does not influence or feedback on the upper layer [7].

Due to the complex structure of the knee joint and the structural imbalance of different tissue sizes, this paper adopted a cascaded U-net network framework with expanded functions and flexibility for multi-mode segmentation recognition of human knee MRI.

2.1. Basic U-Net Model

U-net is essentially a codec model, as shown in Figure 1.

In the coding stage, the network extracts the feature information of the image through the cascaded convolution module. It reduces the resolution through the maximum pooling operation of the feature graph to increase the receptive field of the convolution operation and obtain more global information [8].

In the decoding stage, the corresponding design replaces the maximum pooling operation with a deconvolution operation to restore the resolution of the feature map. At the same time, the number of channels in each small module is halved.

The core operation in the U-net network is to introduce a jump connection between the encoding and decoding layers to reduce the loss of underlying feature information caused by the pooling operation in the encoding stage. The high-level features are helpful for pixel classification, while the low-level features help generate acceptable boundaries. The jump connection directly splices the low-level detail features and high-level semantic features of corresponding stages in the coding-decoding layer. It then carries out feature compression and fusion through the convolution operation, finally achieving high-quality segmentation.

2.2. Cascading U-NET Model

Although U-net has achieved excellent results in biomedical image recognition, there is still room for improvement. Based on the traditional U-net, this paper adds the segmentation network structure, takes the network positioning and pruning of the U-net as the input, and obtains the accurate segmentation result by updating the network training parameters [9]. Therefore, this paper mainly makes the following improvements based on the structure of the positioning network.

2.2.1. Improved down Sampling

In the location network, this paper uses the mode of maximum pooling to construct the pooling layer for down-sampling. This down-sampling mode is conducive to extracting powerful features such as edges. It can strengthen the translation invariability of network features so that high-level features have a larger receptive field [10].

Although the pooling operation may eliminate some unimportant semantic features in feature extraction, it will also delete some significant features. To solve this problem, this paper introduces a convolution layer with a convolution kernel size of 3 × 3 × 3 and a step size of 2. It adds an LReLU activation function layer to replace the original pooling layer. This down-sampling method, which uses convolution operation instead of pooling operation, can reduce the input image’s resolution, reduce the input signal’s size, and increase the receiving field of the features in the subsequent network layer to expand the receptive field.

Convolution layer under-sampling, besides can realize the primary function of the pooling layer, still can keep the input image for more details, more semantic features are extracted. In addition, using the convolution layer to replace the pooling layer for down-sampling can reduce the amount of convolution computation and reduce the memory occupied by the network in the training process [11].

2.2.2. Residual Module

To avoid gradient dispersion and gradient explosion caused by the too high depth of the network model and to further improve the training efficiency and generalization ability of the model, the residual mechanism is introduced to optimize the model.

The residual mechanism is put forward as the original is to solve the deep web layer caused by the increase in the number of network degradation [12, 13]; introducing a residual block can effectively control the gradient diffusion problems, no more parameters are presented at the same time, due to the residual is compared commonly small, residual learning will be more accessible, further strengthened the network characteristic expression ability, improve the network performance.

Figure 2(a) shows the introductory module in U-net, and Figure 2(b) shows the residual module that combines the Batch Normalization (BN) operation. The activation function used is ReLU. Since the dimension changes occurred in the residual module, that is, the number of channels in the input and the number of channels in the output do not match and cannot be directly added, the number of channels is transformed by 1 × 1 convolution, and at the same time, the convolution of 1 × 1 does not introduce too many parameters [14–16].

(a)

(b)

2.2.3. Extended Convolution Module

In a U-NET network, maximum pooling is used to conduct a downsampling operation on a feature graph, which can increase the receptive field of convolution operation while maintaining a small convolution kernel and losing spatial location information to a certain extent [17]. Therefore, an extended convolution mechanism is introduced in this paper, as shown in Figure 3.

(a)

(b)

Figure 3(a) shows a two-dimensional convolution with a kernel size of 3, expansion rate of 1 and step size of 1, which is equivalent to conventional convolution. The receptive field is a 3 × 3 region, and the number of parameters is 9. Figure 3(b) shows a convolution kernel size of 3 and an expansion rate of 2. For the two-dimensional convolution with a step size of 1, except for the position of the blue dot, the weight of other places is 0. Although the number of parameters is still 9 at this time, the size of its receptive field is 7 × 7.

2.2.4. Deep Supervision Mechanism

In this paper, the segmentation network introduces depth supervision mechanism to the hidden layer in the up sampling expansion path, and carries out depth supervision for the last three layers with different sizes, respectively, and outputs the feature information of the hidden layer to the output layer [18].

This paper uses the deep supervision mechanism, does not need to introduce additional with the objective function, supervision is directly through the layer for convolution operation, get the characteristics of the corresponding category information, will be on the characteristics of the deep category information sampling, and the adjacent shallow overlay, feature category information through step by step a repeated operation, finally get an output feature class is used to calculate the objective function. By establishing in-depth supervision of the hidden layer, the hidden layer features are directly extracted, and the semantic feature information of the hidden layer is effectively retained. Meanwhile, the gradient dispersion problem can be effectively controlled and the network performance can be improved through sufficient training of the shallow layer network. The basic structure of depth supervision mechanism is shown in Figure 4.

2.3. Experimental Settings

2.3.1. MRI Classification and Labeling of Knee Joint

According to the clinical diagnostic criteria of MRI of knee joint, knee joint injury was classified into 6 categories and 21 items by location, including all common lesions of sports-related knee joint injury, including:(1) meniscus injury (meniscus I ~ II degree injury, meniscus III degree injury); (2) Tendon injury (quadriceps tendon injury medial femoris tendon injury, lateral femoris tendon injury, gastrocnemius intra or lateral head tendon injury, popliteal tendon injury); (3) ligament injury (anterior cruciate ligament injury, posterior cruciate ligament injury, medial collateral ligament injury, lateral collateral ligament injury, ballad ligament injury, iliotibial band injury); (4) bone and osteochondral damage (osteomalacia, bone marrow edema, exfoliative osteochonitis); (5) synovial capsule injury (joint effusion, synovitis); (6) Peripheral soft tissue injury (subcutaneous fasciitis, lipoedema, popliteal cyst) [19].

In this paper, MRI of patients with martial arts injury of knee joints from 2015 to 2020 was screened. After comparative analysis, repetitive and poor quality data were removed, and 1546 MRIwas finally obtained. For each MRI, the location and type of lesions in the image were located by manual labeling. The annotation results showed that among the 1546 MRI images, there were 1242 cases of meniscus injury, 29 cases of tendon injury, 254 cases of ligament injury, 557 cases of bone and bone cartilage injury, 1494 cases of synovial joint capsule injury, and 340 cases of soft tissue injury. The annotation results are shown in Table 1.

2.3.2. Data Preprocessing

By observing the annotation results, we found that the vast majority of knee joint lesions involved two or more combined injuries. At the same time, among different types of injuries, synovial joint capsule injury accounted for the highest proportion (96.6%), while tendon injury accounted for the lowest proportion (1.9%).

At the same time, to further improve the model’s generalization ability, we divide the training set and the test set by cross-validation. First, divide the original data into ten equal pieces, ensuring that each piece has the same proportion of different types of data as the original data. During each training, one sample is selected as the test set and the rest as the training set. Since the data of the training set and the test set do not intersect, the over-fitting phenomenon can be avoided to a certain extent.

2.3.3. Judgment Criteria

The prediction result of the model includes two parts, one is the location of lesion, the other is the prediction of lesion type.

The Intersection-over-Union (IOU) was used as the criterion for locating the lesion. Intersection-over-Union is a concept used in object detection. It is the overlap rate of candidate bound and ground truth bound generated, that is, the ratio of their intersection to union. The calculation formula of cross ratio is shown in Equation (1).

In the formula, area (C) and area (G), respectively, represent the lesion regions predicted by model and manually labeled. The larger the VALUE of IOU is, the higher the fitting degree of the lesion location predicted by the model and the artificially labeled lesion location is. In this paper, we define that when the IOU value is greater than 0.35, the model’s lesion location prediction is successful.

The sensitivity, specificity, F1 and average accuracy values commonly used in the classification algorithm were used to evaluate the model for the prediction of lesion types.

Since the model prediction is divided into two parts, we define that only when the lesion location and lesion type are predicted successfully, the model belongs to the classification success, and calculate relevant statistical data based on this.

In order to compare the model performance, this paper takes the traditional U-net model as the benchmark and uses the same data set and training method to analyze and compare the predicted results.

3. Experimental Results

Through training, the effect of lesion recognition of cascaded U-NET model and traditional U-NET model is shown in Figure 5.

In Figure 5, the red, green and yellow boxes, respectively, represent the cascade U-NET model, the traditional U-NET model and the manually identified lesion regions, and Case1 to Case 6, respectively, represent the six lesion types mentioned above.

Through observation, the focus area identified by the cascade U-net model is basically the same as the focus area identified by manual annotation, and the model is more accurate in area size than manual annotation. The traditional U-net model can roughly identify the location of the lesion, but the area is large, and there is a certain gap in accuracy compared with the cascaded U-net model. The performance parameters of the two models are shown in Table 2.

In Table 2, the left side represents the recognition effect of the cascading U-net model, and the right side represents the recognition effect of the traditional U-net model.

Through the analysis of the data in the table, it can be found that in the recognition of different types of lesions, the indicators of the cascade U-NET are slightly higher than the traditional U-NET model. In the aspect of ligament injury, the recognition effects of the two models were basically equal, which may be due to the fact that the sample data of ligament injury were few, and the over-sampling model had a higher recognition intensity for such samples. The experimental results show that the cascaded U-NET model has considerable feasibility in the field of knee joint medical image recognition, and has significantly improved performance compared with the traditional model.

4. Discussion

The strenuous exercises and activities of wushu determine that it is more likely to cause sports injuries; according to the investigation, wushu athletes’ knee joint injuries are more serious. In other professions, such as athletes, soldiers, etc., common knee injury is also higher due to the accumulation of exercise load.

At present, the diagnosis of knee joint injury mainly relies on manual recognition MRI. Due to the complex structure of the knee joint, different parts to be identified differ significantly in shape and size. Manual identification takes a long time, with low accuracy, and sometimes the phenomenon of missed diagnosis and misdiagnosis occurs.

In recent years, deep learning has developed rapidly and has been widely used in the medical field. Deep learning has been used to segment human organs and reconstruct human structures. In medical imaging, deep learning model can be trained to replace manual recognition of human characteristics and provide a reference for manual diagnosis.

In this paper, the deep learning algorithm is combined with knee MRI, and the improved cascade U-net model is used to identify the site and type of knee lesions. Finally, good results are achieved on the test data set. Compared with the traditional deep learning model and manual labeling method, it has specific clinical application value.

5. Conclusion

In this paper, we use deep learning algorithms combined with knee MRI diagnosis. Improvements have been made to the traditional U-NET model to improve training efficiency and model efficiency. Compared with the traditional manual labeling method, there is a certain improvement. In the test data set, the average recognition accuracy of the model reached 86.0%, and it could accurately identify the location of lesions and classify the types of lesions, indicating that the model has high application value in medicine and is worthy of further development and research.

Data Availability

The image data used to support the findings of this study have been deposited in the I Do Imaging (IDI) dataset (https://idoimaging.com/home).

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Authors’ Contributions

Meng Xue and Yan Liu contributed equally to the work and should be considered as co-first authors.

Acknowledgments

The authors thank the Orthopedics Department of Jiangmen Hospital of Traditional Chinese Medicine and Jiangmen Vocational and Technical College of Jinan University for the support of patient data.

References

Y. Ding, “Analysis of basic causes and features of knee joint injuries of martial arts sportsmen,” Acta Medica Mediterranea, vol. 1, no. 3, pp. 567–570, 2016.
View at: Google Scholar
P. J. Strouse, “MRI of the knee: key points in the pediatric population,” Pediatric Radiology, vol. 40, no. 4, pp. 447–452, 2010.
View at: Publisher Site | Google Scholar
G. R. Djavanshir, X. Chen, and W. Yang, “A Review of Artificial Intelligence's Neural Networks (Deep Learning) Applications in Medical Diagnosis and Prediction,” IT Professional, vol. 23, no. 3, pp. 58–62, 2021.
View at: Publisher Site | Google Scholar
C. J. Griffith, R. F. LaPrade, S. Johansen, B. Armitage, C. Wijdicks, and L. Engebretsen, “Medial knee injury: part 1, static function of the individual components of the main medial knee structures,” American Journal of Sports Medicine, vol. 37, no. 9, pp. 1762–1770, 2009.
View at: Publisher Site | Google Scholar
D. John and C. Zhang, “An attention-based U-Net for detecting deforestation within satellite sensor imagery,” International Journal of Applied Earth Observation and Geoinformation, vol. 107, no. 107, p. 102685, 2022.
View at: Publisher Site | Google Scholar
S. Ghosh, A. Chaki, and K. Santosh, “Improved U-net architecture with VGG-16 for brain tumor segmentation,” Physical and Engineering Sciences in Medicine, vol. 44, no. 3, pp. 703–712, 2021.
View at: Publisher Site | Google Scholar
M. Z. Alom, C. Yakopcic, M. Hasan, T. M. Taha, and V. K. Asari, “Recurrent residual U-net for medical image segmentation,” Imaging, vol. 6, no. 1, 2019.
View at: Publisher Site | Google Scholar
Y. Han, J. Li, Y. Zhang, Z. Hong, and J. Wang, “Sea ice detection based on an improved similarity measurement method using hyperspectral data,” Remote Sensing Letters, vol. 13, no. 2, pp. 115–125, 2017.
View at: Google Scholar
J. Long, G. Ma, H. Liu, E. Song, and D. Y. Liu, “Cascaded hybrid residual U-Net for glioma segmentation,” Multimedia Tools and Applications, vol. 79, no. 33-34, pp. 24929–24947, 2020.
View at: Publisher Site | Google Scholar
M. Shen, P. Xue, and C. Wang, “Down-sampling based video coding using super-resolution technique,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 21, no. 6, pp. 755–765, 2011.
View at: Publisher Site | Google Scholar
K. A. Kamran and M. S. Ambreen, “A high capacity reversible watermarking approach for authenticating images: exploiting down-sampling, histogram processing, and block selection,” International Journal, vol. 256, no. 3, pp. 162–183, 2014.
View at: Publisher Site | Google Scholar
B. Zhao, R. Hu, X. Jia, and Y. Guo, “Multi-Scale Residual Fusion Network for Super-Resolution Reconstruction of Single Image,” IEEE Access, vol. 8, no. 99, pp. 155285–155295, 2020.
View at: Publisher Site | Google Scholar
H. Bo, C. Zhou, H. Wang, and S. Chen, “Nonlinear tribo-dynamic model and experimental verification of a spur gear drive under loss-of-lubrication condition,” Mechanical Systems and Signal Processing, vol. 153, p. 107509, 2021.
View at: Google Scholar
M. Chardin, J. Naéliton, and Q. H. Tran, “Cohen-Macaulayness and canonical module of residual intersections,” Transactions of the American Mathematical Society, vol. 372, no. 3, pp. 1601–1630, 2019.
View at: Publisher Site | Google Scholar
X. Zhu, Y. Wei, Y. Lu et al., “Comparative analysis of active contour and convolutional neural network in rapid left-ventricle volume quantification using echocardiographic imaging,” Computer Methods and Programs in Biomedicine, vol. 199, p. 105914, 2021.
View at: Google Scholar
M. Zhao, Y. Wei, and K. K. L. Wong, “A generative adversarial network technique for high-quality superresolution reconstruction of cardiac magnetic resonance images,” Magnetic Resonance Imaging, vol. 85, pp. 153–160, 2022.
View at: Publisher Site | Google Scholar
W.-S. Lee and C. Le Roux, “Convolution algebra for extended feller convolution,” Semigroup Forum, vol. 102, no. 1, pp. 184–216, 2021.
View at: Publisher Site | Google Scholar
Y. Liu, M. Cheng, D. Fan, L. Zhang, J. W. Bian, and D. Tao, “Semantic edge detection with diverse deep supervision,” International Journal of Computer Vision, vol. 130, no. 1, pp. 179–198, 2022.
View at: Google Scholar
C. P. Goebel and C. Domes, “Classifications in brief: the Schenck classification of knee dislocations,” Clinical Orthopaedics and Related Research, vol. 478, no. 6, pp. 1368–1372, 2020.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2022 Meng Xue et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

249

Downloads

493

Citations