Skip to main content
Download PDF
- Main
Classification of imbalanced oral cancer image data from high-risk population
- Song, Bofan;
- Li, Shaobai;
- Sunny, Sumsum;
- Gurushanth, Keerthi;
- Mendonca, Pramila;
- Mukhia, Nirza;
- Patrick, Sanjana;
- Gurudath, Shubha;
- Raghavan, Subhashini;
- Tsusennaro, Imchen;
- Leivon, Shirley T;
- Kolur, Trupti;
- Shetty, Vivek;
- Bushan, Vidya;
- Ramesh, Rohan;
- Peterson, Tyler;
- Pillai, Vijay;
- Wilder-Smith, Petra;
- Sigamani, Alben;
- Suresh, Amritha;
- Kuriakose, Moni Abraham;
- Birur, Praveen;
- Liang, Rongguang
- et al.
Abstract
Significance
Early detection of oral cancer is vital for high-risk patients, and machine learning-based automatic classification is ideal for disease screening. However, current datasets collected from high-risk populations are unbalanced and often have detrimental effects on the performance of classification.Aim
To reduce the class bias caused by data imbalance.Approach
We collected 3851 polarized white light cheek mucosa images using our customized oral cancer screening device. We use weight balancing, data augmentation, undersampling, focal loss, and ensemble methods to improve the neural network performance of oral cancer image classification with the imbalanced multi-class datasets captured from high-risk populations during oral cancer screening in low-resource settings.Results
By applying both data-level and algorithm-level approaches to the deep learning training process, the performance of the minority classes, which were difficult to distinguish at the beginning, has been improved. The accuracy of "premalignancy" class is also increased, which is ideal for screening applications.Conclusions
Experimental results show that the class bias induced by imbalanced oral cancer image datasets could be reduced using both data- and algorithm-level methods. Our study may provide an important basis for helping understand the influence of unbalanced datasets on oral cancer deep learning classifiers and how to mitigate.Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.
Main Content
For improved accessibility of PDF content, download the file to your device.
If you recently published or updated this item, please wait up to 30 minutes for the PDF to appear here.
Enter the password to open this PDF file:
File name:
-
File size:
-
Title:
-
Author:
-
Subject:
-
Keywords:
-
Creation Date:
-
Modification Date:
-
Creator:
-
PDF Producer:
-
PDF Version:
-
Page Count:
-
Page Size:
-
Fast Web View:
-
Preparing document for printing…
0%