摘要
Objectives To build and validate deep learning and machine learning fusion models to classify benign, malignant, and intermediate bone tumors based on patient clinical characteristics and conventional radiographs of the lesion. Methods In this retrospective study, data were collected with pathologically confirmed diagnoses of bone tumors between 2012 and 2019. Deep learning and machine learning fusion models were built to classify tumors as benign, malignant, or intermediate using conventional radiographs of the lesion and potentially relevant clinical data. Five radiologists compared diagnostic performance with and without the model. Diagnostic performance was evaluated using the area under the curve (AUC). Results A total of 643 patients' (median age, 21 years; interquartile range, 12-38 years; 244 women) 982 radiographs were included. In the test set, the binary category classification task, the radiological model of classification for benign/not benign, malignant/nonmalignant, and intermediate/not intermediate had AUCs of 0.846, 0.827, and 0.820, respectively; the fusion models had an AUC of 0.898, 0.894, and 0.865, respectively. In the three-category classification task, the radiological model achieved a macro average AUC of 0.813, and the fusion model had a macro average AUC of 0.872. In the observation test, the mean macro average AUC of all radiologists was 0.819. With the three-category classification fusion model support, the macro AUC improved by 0.026. Conclusion We built, validated, and tested deep learning and machine learning models that classified bone tumors at a level comparable with that of senior radiologists. Model assistance may somewhat help radiologists' differential diagnoses of bone tumors.
- 
                                单位南方医科大学
