Chinese Journal of Dermatology ›› 2018, Vol. 51 ›› Issue (7): 486-489.doi: 10.3760/cma.j.issn.0412-4030.2018.07.002

Previous Articles     Next Articles

Comparison of diagnostic performance of dermatologists versus deep convolutional neural network for dermoscopic images of pigmented nevus and seborrheic keratosis

Wang-Shiqi 1,Jie Liu 3, 1, 3, 4, 3,   

  • Received:2017-11-14 Revised:2018-03-30 Online:2018-07-15 Published:2018-06-29
  • Contact: Jie Liu E-mail:Liujie04672@pumch.cn
  • Supported by:
    CAMS Innovation Fund for Medical Sciences

Abstract: Wang Shiqi, Liu Jie, Zhu Chenyu, Shu Chang, Zhou Hangning, Xie Fengying, Xu Tao, Jin Hongzhong Department of Dermatology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100730, China (Wang SQ, Liu J, Zhu CY, Shu C, Jin HZ); Beijing Key Laboratory of Precision Medicine for Diagnosis and Treatment of Allergic Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100730, China (Liu J); Image Processing Center, School of Astronautics, Beihang University, Beijing 100191, China (Zhou HN, Xie FY); Department of Epidemiology and Statistics, School of Basic Medicine, Peking Union Medical College, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, Beijing 100005, China (Xu T) Corresponding author: Liu Jie, Email: Liujie04672@pumch.cn 【Abstract】 Objective To compare the diagnostic accuracies of deep convolutional neural network (CNN) and dermatologists for pigmented nevus and seborrheic keratosis. Methods CNN network ResNet?50 was trained with 5 094 dermoscopic images of pigmented nevus and seborrheic keratosis using transfer learning, so as to establish a CNN two?classification model. Then, this model was applied to the automatic classification of 30 dermoscopic images of pigmented nevus and 30 dermoscopic images of seborrheic keratosis. Meanwhile, in combination with clinical photos of skin lesions, 95 experienced dermatologists who had received dermoscopy training gave their diagnosis for the above 60 dermoscopic images. The diagnostic accuracies were compared between the two methods, and misclassified images were further analyzed. Results The CNN automatic classification model had the diagnostic accuracies of 100% (30/30) and 76.67% (23/30) for pigmented nevus and seborrheic keratosis respectively, and the total accuracy was 88.33% (53/60). The average diagnostic accuracies of 95 dermatologists were 82.98%(25.8/30)and 85.96% (24.9/30) for pigmented nevus and seborrheic keratosis respectively, and the total accuracy was 84.47% (50.7/60). There were no significant differences in the diagnostic accuracies for pigmented nevus or seborrheic keratosis between the CNN automatic classification model and 95 dermatologists(χ2 = 0.38, P > 0.05). The dermoscopic images misclassified by CNN were divided into 3 categories: special?type lesions with high pigment content and marked keratosis, typical skin lesions with interference factors, and typical skin lesions without definite reasons for misclassification. Conclusions The performance of CNN automatic classification model is similar to that of experienced dermatologists in the two classification of pigmented nevus and seborrheic keratosis. The reasons for misclassification by CNN still need to be explored by dermatologists and professionals in artificial intelligence.

Key words: Nevus, pigmented, Keratosis, seborrheic, Dermoscopy, Neural networks (computer), Artificial intelligence, Convolutional neural network