Abstract: Skin cancer is an uncommon but serious malignancy. Dermoscopic images examination
and biopsy are required for cancer detection. Deep learning (DL) is extremely effective in learning
characteristics and predicting malignancies. However, DL requires a large number of images to
train. Image augmentation and transferring learning were employed to overcome the lack of images
issue. In this study we divided images into two categories: benign and malignant. To train and test
our models, we used the public ISIC 2020 database. Melanoma is classified as malignant in the ISIC
2020 dataset. Along with categorization, the dataset was studied to demonstrate variation. The per-
formance of three top pretrained models was then benchmarked in terms of training and validation
accuracy. Three optimizers were employed to optimize the loss: RMSProp, SGD, and ADAM. Using
ResNet, VGG16, and MobileNetV2, we obtained training accuracy of 98.73%, 99.12%, and 99.76%,
respectively. Using these three pretrained models, we attained a validation accuracy of 98.39%.
Keywords: pretrained model; transfer learning; skin cancer; deep learning; ISIC 2020
1. Introduction
One of the most frequent types of cancer is skin cancer. Melanoma is responsible for
75% of skin cancer fatalities, according to the American Cancer Society. As a result, der-
matologists examine each patient's moles for melanoma, the most common kind. UV
radiation causes melonocyte cells in the human body to be damaged, leading in mela-
noma, a kind of skin cancer. Skin cancer is the most frequent malignancy, accounting for
one-third of all malignancies globally, according to the World Health Organization
(WHO). Basal cell carcinoma (BCC), squamous cell carcinoma (SCC), and malignant
melanoma are the most common kinds (MM). Non-melanocytic cancer is caused by BCC
and SCC, and the majority of skin cancers are non-melanocytic, although melanoma is
the deadliest because it spreads quickly if not diagnosed and treated early. Thus, several
organs in the human body, such as malignant genes, hair color, and an increased inci-
dence of benign melanocytic nevi and dysplastic nevi, are implicated in Melanoma diag-
nosis. Skin cancer is really caused by abnormal melanocyte cell growth, which affects
surrounding tissues by multiplying and spreading through lymph nodes [1]. Skin cancer
is presently a major public health concern, with over 123,000 new cases diagnosed each
year worldwide. Surprisingly, melanoma accounts for 75% of skin cancer mortality. Fur-
thermore, according to the American Cancer Society, 100,000 new instances of mela-
noma will be detected by the end of 2020, with 7,000 people dying from the condition.
Melanoma is responsible for around 9000 deaths in the United States each year [2]. How-
ever, if this cancer is not detected early, the cost of treatment is significant, costing over
$134,000 in its fourth stage [3]. Dermatologists first detect melanoma by examining pho-
tos of the cancer and moles, among other treatment options. They also check for "ugly
ducklings" or outlier lesions around the moles that might be melanoma. If they're lucky,
the outcomes will be precise and accurate. As a result, artificial intelligence (AI) has the
potential to assist physicians in accurately identifying melanoma. AI-based detection
2.1 Background
Skin cancer was first discovered using human eyes in the early twentieth century.
[11] identified a few examples of such approaches, including size, bleeding, and ulcera-
tion. This procedure, however, depends mostly on physicians' eye examinations rather
than any sophisticated or advanced technology. Another detection approach, dermos-
copy of epiluminescence microscopy, has been demonstrated to be 75% to 85% accurate
[12]. A biopsy was conducted to see if a practitioner was unable to determine whether a
mole is melanoma or not [11]. Several CNN-based classifiers have recently entered the
picture, aiding dermatologists in better identifying melanoma. In fact, [3] identified skin
lesion images as benign or malignant with 95.23 percent accuracy. Their CNN was built
with four convolutional layers, the ReLu activation function, and a softmax classifier.
They used the ADAM and SGD algorithms to decrease neural network loss. They em-
ployed SGD and ADAM to add noise to increase accuracy. Using ISIC 2018 skin lesion
pictures, the proposed classifier was trained and assessed. [13] proposed a new model
that classifies skin lesions as benign or malignant using a novel regularizer technique.
Their binary classifier could distinguish between benign and malignant cancers in im-
ages. They observed that the AUCs for nevus vs melanoma lesion, seborrheic keratosis
versus basal cell carcinoma lesion, seborrheic keratosis versus melanoma lesion, and so-
lar lentigo versus melanoma lesion were 0.77, 0.93, 0.85, and 0.86, respectively. Their
model has an average accuracy of 97.49 percent. Their technique benefited doctors in
categorizing various skin lesions. [2] developed a CNN architecture for skin lesion
grouping to attain excellent dermoscopy picture group precision. They employed an
approach that merged the group layers of four different deep neural network topologies.
In terms of accuracy, their results indicated that their technique beat the other CNNs. In
this study, they developed a new CNN model based on a novel regularizer. Further-
more, [8] assessed the efficacy of their CNN on 21 board-certified dermatologists utiliz-
ing biopsy-proven clinical images and two critical binary grouping use cases. Their deep
learning CNN surpassed dermatologists in detecting skin cancer using dermoscopy and
digital imaging. In addition, [14] suggested a well-performing automated computerized
system, and their technique included learning. They utilized 2000 pictures from ISIC
2017 and attained an accuracy rate of 93.6 percent.
2.1 Methodology
Labeling the images is the first stage in data preparation. Each image in the ISIC
2020 dataset was labeled with a benign or malignant target. We divided the dataset into
two parts: train and validation. The pretrained models were trained using a train set. We
employed an 80/20 data split, with 80% of images used for training and 20% used for
validation. The final stage in data preparation was to rescale the images from 0 to 1. Be-
cause we employed RGB images with a pixel range of 0 to 255, rescaling images de-
creased training time and eliminated image pixel inconsistencies.
Data augmentation is a strategy for correcting data imbalance. The most prevalent
approaches used in data augmentation are oversampling and undersampling. To correct
the class imbalance, oversampling uses exact copy duplicates or modified copies of the
original data from the minority class. To execute data augmentation at random, we uti-
lized rotation range=20, width shift range=0.2, height shift range=0.2, and horizontal
flag=True. Data augmentation was utilized exclusively in the training dataset to keep the
model impartial, whereas the testing dataset was kept unmodified save for rescaling the
pictures between 0 and 1.
1,000 image categories illustrate object classifications we encounter in our daily lives,
such as dog and cat breeds, household products, automobile types, and so on. As a re-
sult, pretrained models are highly excellent at extracting features with high accuracy
while requiring less training time.
2.1.5. ResNet
Other designs introduced deep networks by increasing network depth. Deeper net-
works introduced an issue known as vanishing gradient, which decreases neural net-
work efficiency. To tackle the vanishing gradient problem, ResNet was proposed, with
the key idea being to bypass one or more layers dubbed 'identity shortcut connections.'
2.1.6. MobileNetV2
Google introduced the MobileNetV2 concept. Because of its lightweight and mini-
mal complexity, this architecture is appropriate for mobile devices. Version 1 of Mo-
bileNet featured depthwise separable convolution, whereas version 2 introduced a supe-
rior module called inverted residual.
3. Results
3.1. Dataset
Dermatologists classified benign and malignant skin tumors into nine categories. Figure 2 depicts
the disease's eight classifications; the unknown class was not included.
86.5% of the individuals with moles had nevus disease, which was classed as benign. Melanoma
patients were diagnosed with malignant cancers. Table 1 explains how nine types of disorders are
classified as benign or malignant skin malignancies. Figure 3 shows sample images of the benign
and malignant classes.
solar lentigo 7 0
unknown 27124 0
Figure 4 depicts the whole experimental setup. We employed data preparation first, followed by
data augmentation in the second stage. Three models were chosen for pretrained training. Except
for the output layer, we used all layers. A flatten layer, dense layer, dropout layer, and final out-
put layer were added. After we concluded training, we used accuracy to evaluate the performance
of our trained model.
Data Preprocessing
Data Augmentation
Pretrained Models
VGG16 0.9912
MobileNetV2 0.9976
4. Discussion
Using a massive dataset of 33,126 images, we achieved a validation accuracy of
98.39%. Using the pretrained model AlexNet, they attained an accuracy of 0.7853 and
0.9086 in [15] and [16]. Similarly, in [15], ResNet, VGGNet, and Xception were used to
obtain accuracy of 0.9208, 0.8870, and 0.9030, respectively. Table 3 displays testing accu-
racy from the literature. The greatest accuracy across all of these pretrained models was
0.9420 in [17]. One pretrained model was utilized in all of these architectures by integrat-
ing stages such as data augmentation, data standardization, and so on. [16] employed a
hybrid of AlexNet, deep convolutional neural network (DCNN), and support vector ma-
chine (SVM). However, none of the designs outperformed our attained accuracy. Further-
more, we tackled the challenges of overfitting and data preparation while minimizing the
need to develop models from scratch.
5. Conclusions
Melanoma is one of the most serious skin malignancies, shortening people's lives
dramatically. However, early identification avoids any problematic issue. AI can help de-
tect this sort of cancer at an early stage. In this work, we employed pretrained models to
compare performance while taking an evaluation metric into account (accuracy). We used
RMSProp, SGD, and ADAM to optimize the models. Pretrained models were employed
to get the maximum accuracy while spending the least amount of time creating models
from scratch. Furthermore, we addressed the issue of overfitting and offered alternative
data processing techniques with dataset insights. We achieved a validation accuracy of
98.39%, outperforming the prior pretrained model's performance despite the need for a
complex model. The findings of this study can be applied in medical science to help phy-
sicians diagnose skin cancer early and save lives.
