Breast Cancer Mass Detection in Mammograms Using Gray Difference Weight and MSER Detector

Page created by Roy Moreno
 
CONTINUE READING
Breast Cancer Mass Detection in Mammograms Using Gray Difference Weight and MSER Detector
SN Computer Science (2021) 2:63
https://doi.org/10.1007/s42979-021-00452-8

    ORIGINAL RESEARCH

Breast Cancer Mass Detection in Mammograms Using Gray Difference
Weight and MSER Detector
B. V. Divyashree1 · G. Hemantha Kumar1

Received: 29 August 2020 / Accepted: 5 January 2021 / Published online: 30 January 2021
© The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. part of Springer Nature 2021

Abstract
Breast cancer is a deadly and one of the most prevalent cancers in women across the globe. Mammography is widely used
imaging modality for diagnosis and screening of breast cancer. Segmentation of breast region and mass detection are crucial
steps in automatic breast cancer detection. Due to the non-uniform distribution of various tissues, it is a challenging task to
analyze mammographic images with high accuracy. In this paper, background suppression and pectoral muscle removal are
performed using gradient weight map followed by gray difference weight and fast marching method. Enhancement of breast
region is performed using contrast limited adaptive histogram equalization (CLAHE) and de-correlation stretch. Detec-
tion of breast masses is accomplished by gray difference weight and maximally stable external regions (MSER) detector.
Experimentation on Mammographic Image Analysis Society (MIAS) and curated breast imaging subset of digital database
for screening mammography (CBIS-DDSM) show that the method proposed performs breast boundary segmentation and
mass detection with best accuracies. Mass detection achieved high accuracies of about 97.64% and 94.66% for MIAS and
CBIS-DDSM dataset, respectively. The method is simple, robust, less affected to noise, density, shape and size which could
provide reasonable support for mammographic analysis.

Keywords Breast cancer · Mammography · Gradient weight map · Gray difference weight · Fast marching · De-correlation
stretch · MSER detector

Introduction                                                                  computer-based diagnosis of mammography, it is impor-
                                                                              tant to highlight breast features as the radiologist decision
Breast cancer is the most common cancer among women                           depends on the clinical evidences on the mammograms.
after 40 years and demands early detection which reduces                      Image segmentation is a crucial part in automatic detec-
the mortality rate [1, 2]. Among various imaging techniques                   tion of breast cancer which involves background suppres-
used for breast cancer screening and early diagnosis, mam-                    sion (skin–air boundary), identification of pectoral muscle
mography, a low dose X-ray imaging technique, reveals the                     boundary, breast region extraction and detection of abnormal
possibility of presence of breast cancer from the surrounding                 regions. Masses are the most commonly found as one of the
infiltrated breast tissues [3–8]. Although having limitation,                 abnormalities in mammograms. Masses are uncontrolled
mammography has 85–90% sensitivity for the detection of                       growth of tissues in the breast having some abnormal shape
breast cancer and remains as an effective screening tool for                  and contour.
the detection of abnormalities [9–11].                                           Existing studies on image segmentation focusing on
   The accurate detection of breast cancer in mammograms                      breast region and pectoral muscle segmentation, adopted
is difficult for the radiologist because of errors in imag-                   various techniques like thresholding methods, contour-
ing condition, overlapped cancer and normal tissues. In                       based methods, region growing methods and so on. Then,
                                                                              the detection of masses is a complex task as the computer-
                                                                              aided diagnosis (CAD) must deal with the broad range of
* B. V. Divyashree
  divyashreenivas@gmail.com                                                   possibility like its density (fatty, glandular, dense type),
                                                                              shape, sizes and margins of masses. The vast literature report
1
     Department of Studies in Computer Science, University                    is presented in Table 1.
     of Mysore, Manasagangotri, Mysore, Karnataka 570006,
     India

                                                                                                                   SN Computer Science
                                                                                                                         Vol.:(0123456789)
Breast Cancer Mass Detection in Mammograms Using Gray Difference Weight and MSER Detector
Table 1  Existing literature and their advantages and limitations
                      Literature details                                          Method used                                                   Advantages and limitations

                      Chen et al. [12]                                            Region growing                                                Obtained good accuracy in identifying breast boundary. However,
                                                                                                                                                                                                                       63 Page 2 of 13

                      Segmentation of breast region                                                                                              the boundary obtained for pectoral suppression is lower than the
                                                                                                                                                 actual boundary
                      Rampun et al. [13]                                          Region-based active contour model                             Computational complexity is simple. But, the boundary obtained
                      Estimation of breast boundary                                                                                              was lower than the actual boundary and slower processing time

SN Computer Science
                      Shi et al. [14]                                             Unsupervised pixel-wise labeling and texture filters          The method motivated by achieving good accuracies. But, the
                      Breast boundary segmentation and calcification detection                                                                   segmentation errors could be further avoided
                      Makandar et al. [15]                                        Adaptive thresholding method, watershed transformation and    The method promised high accuracy which could be further
                      Segmentation of masses in breast region                      contour-based method                                          improved
                      Rahmati et al. [16]                                         Maximum likelihood active contour method                      Showed robustness for choosing the seed point and possess high
                      Segmentation of masses                                                                                                     segmentation accuracy. The main problem of this method is that,
                                                                                                                                                 it needs expert intervention and is time consuming
                      Neto et al. [17]                                            Swarm optimization, area filters and texture descriptors      Reduced false positives. However, the method could not detect
                      Segmentation of masses                                                                                                     small masses in non-dense breast density type and failed to
                                                                                                                                                 detect mass in case of dense breast density types
                      Soulami et al. [18]                                         Meta-heuristic algorithm electromagnetism-like optimization   Unable to remove the muscle completely in some cases and hence
                      Segmentation of masses                                       (EMC)                                                         achieved less accuracy in segmentation of mass
                      Anitha et al. [19]                                          Dual stage adaptive thresholding (DuSAT) method               Achieved good average sensitivity, But, lowered its performance in
                      Segmentation of abnormality                                                                                                speculated type of lesions with less sensitivity
                      Wang et al. [20]                                            Gestalt psychology and morphological characteristics          More efficient and innovative data analysis method. But, the
                      Detection of mass                                                                                                          method faced difficulty in generalizing the parameters for differ-
                                                                                                                                                 ent dataset
                      Geraldo et al. [21]                                         Diversity analysis, geostatistical and concave geometry       Though the method was robust to different datasets, by altering
                      Detection of masses                                                                                                        filter parameters for suspicious regions reduced the false-positive
                                                                                                                                                 rate, but noticed misclassification results
                      Agarwal et al. [22]                                         Deep learning                                                 Efficient, shown improved performance with high sensitivity and
                      Detection of masses                                                                                                        low false-positive rates. But, less sensitive to small masses and
                                                                                                                                                 needs large datasets
                      Suresha et al. [23]                                         Modified fuzzy C-means clustering and deep neural network     Classification accuracy is improved. But, considered single dataset
                      Segmentation, detection and classification of masses
                      Zeiser et al. [24]                                          Deep learning                                                 Able to detect masses in high-density images. But, detected mass
                      Segmentation of masses in mammograms                                                                                        is smaller than the actual mass. Also, could not detect the masses
                                                                                                                                                  in images that have density similar to other breast tissue
                      Vikhe and Thool [25]                                        Wavelet processing and adaptive threshold technique           Simple, computation time used is more
                      Mass detection in mammographic image
                      Divyashree et al. [26]                                      Texture feature extraction and KNN classifiers                The method considers whole breast region for classification
                      Segmentation and classification of mammograms                                                                               instead of particular abnormal region of interest and obtained
                                                                                                                                                  satisfactory outputs
                      Divyashree et al. [27]                                      Thresholding method and quad tree decomposition               Simple method, very well taken care of information loss during
                      Locating the region of interest in mammograms                                                                               segmentation of breast region and suitable for subtle masses.
                                                                                                                                                  But, failed in locating large masses
                                                                                                                                                                                                                       SN Computer Science (2021) 2:63
Breast Cancer Mass Detection in Mammograms Using Gray Difference Weight and MSER Detector
SN Computer Science (2021) 2:63                                                                                  Page 3 of 13 63

   Many of the studies in the state-of-the-art developed com-   masses can be extracted from the breast region that helps
puter-aided design system for mammographic segmentation,        radiologist’s decision for further clinical references.
detection and classification. Presently, there is a renowned
need to reduce the segmentation errors noticed in dense         Dataset
and extremely dense breast density types. Also, developing
robust model for all kinds of masses including circumscribed    For experimentation of the proposed work, Mammogram
and speculated type of lesions is still challenging. Hence,     Image Analysis Society (MIAS) database and curated breast
in this paper, an automated method is proposed to first seg-    imaging subset digital database for screening mammogra-
ment the breast region using gradient weight, gray difference   phy (CBIS-DDSM) databases are considered [28, 29]. These
weight and fast marching method. Then, enhancement of           datasets are publicly available benchmark datasets that
breast region is accomplished using contrast limited adaptive   contains 322 images and 2620 images, respectively. CBIS-
histogram equalization (CLAHE) and de-correlation stretch       DDSM dataset is standardized and updated version of digi-
followed by detection of mass using MSER detector. The          tal database for screening mammography (DDSM). From
main contribution of the paper includes estimation of breast    CBIS-DDSM dataset out of 2620 images, randomly selected
region precisely and detection of mass accurately using         200 images are considered. The images in the MIAS dataset
MSER detector that is suitable for varied size and shape.       have size of 1024 × 1024 in “Portable Gray Map” (PGM)
   The remaining part of the paper is organized as; In          format; whereas, the images in the CBIS-DDSM dataset are
“Materials and Methods”, datasets are explained followed        in DICOM format with varied size.
by detailed description of methods used to accomplish breast
region segmentation and detection of mass. In “Results”,        Background Suppression
to show the performances of the proposed method, experi-
mental results are presented. Lastly, discussions and future    Gradient Weight‑Based Segmentation
improvements of the proposed paper are presented.
                                                                A certain number of digital images in dataset contain differ-
                                                                ent artifacts similar to dense regions in the breast. In our pre-
                                                                vious work, to obtain the precise region of interest (skin–air
Materials and Methods                                           boundary), gradient weight map method is adopted [30].
                                                                The weights are computed for each pixel based on gradient
This paper proposes a more efficient and robust model for       magnitude using 3 × 3 windows. Then, the image is com-
image segmentation. The model performs background sup-          plemented to highlight the skin–air boundary followed by
pression, segmentation of pectoral muscle, segmentation         binarization. Although binarization reduces the processing
of breast region and detection of mass in the breast region     time, it does not lose any useful information in the breast
of mammograms. The workflow of the proposed model               region as binarization is adopted in the proposed work only
is shown in Fig. 1. From the integrated methodology, the        to segment the region of interest (skin–air boundary). The

Fig. 1  Workflow of the proposed model

                                                                                                       SN Computer Science
Breast Cancer Mass Detection in Mammograms Using Gray Difference Weight and MSER Detector
63 Page 4 of 13                                                                                           SN Computer Science (2021) 2:63

invalid connected components present in the background               difference between representative value and the intensity
region were eliminated using area opening operation with             of the pixel. If the difference between pixels is small, then
predefined threshold. Further, the largest connected com-            the gray difference weight assigns larger weights; and if
ponent is extracted which is a skin–air boundary (breast             the difference is large, the pixel weight assigned is small.
region and pectoral region). Final refinement is performed              Also, threshold is fixed on the intensity difference val-
with morphological opening operation, where the square and           ues to suppress the output weights more than the cutoff
rectangular regions fewer than the threshold are removed.            (cutoff = 25) and assigns least output weight values to
The skin–air boundary detected image is shown in Fig. 2.             those pixels. This cutoff value restricts the pectoral region
The segmented breast region plus pectoral region will then           from being added with some parts of breast region. The
be interpreted by the next level of segmentation.                    gray-scale intensity difference array obtained was not hav-
                                                                     ing significant information about the actual transitions
Breast Region Segmentation                                           between the boundaries. To solve this issue, a numerical
                                                                     technique, Fast marching method is adopted that tracks the
Pectoral Muscle Segmentation Using Gray Difference                   topological changes of boundaries and displays segmented
Weight and Fast Marching Method                                      binary image. The resultant image provided the pectoral
                                                                     muscle and some isolated areas from the breast region
Since the tissues present in the pectoral muscle and breast          (Fig. 3b, e). These areas are ignored using morphological
region are alike, to find abnormal tissues in the breast             erosion operation and provided segmented pectoral muscle
region, the pectoral muscles need to be removed. Trian-              (Fig. 3c, d).
gular pectoral muscles appear in the rightmost corner or
in the leftmost corner of the medio-lateral oblique (MLO)
views of the mammogram. The pectoral muscles present                 Segmentation of Breast Region Using Exclusive‑OR
in the right side are flipped to the left. To segment the            Operation
pectoral muscle, gray difference weight method adapted
computes weights for each pixel in the background image              To segment only breast region, the pectoral muscle seg-
wherein the reference gray-scale intensity value (repre-             mented image (P) and the skin–air boundary detected
sentative value) is the average of the pixel intensity values        image (S) undergo bitwise XOR operation. The resultant
of pixel locations that is specified by vectors of seed point.       image is set to maximum value if value P or S occurred,
This seed point is chosen empirically. The pixel weights             but not both. The segmented breast region is as shown in
are computed for every pixel that is the absolute value of           Fig. 4.

Fig. 2  Background suppressed images of MIAS (first row) and CBIS-   Fig. 3  Pectoral muscle segmented images of MIAS (first row) and
DDSM (second row), a, d are original images, b, e are background     CBIS-DDSM (second row), a, d are original background suppressed
suppressed image after gradient operation and c, f are background    images, b, e are gray difference weight images and c, f are segmented
suppressed image in original form                                    pectoral muscle images

SN Computer Science
SN Computer Science (2021) 2:63                                                                               Page 5 of 13 63

Fig. 4  Breast region segmented
image of MIAS (first row) and
CBIS-DDSM (second row), a,
e are background suppressed
image, b, f are pectoral muscle
segmented images, c, g are
breast region segmented images
after bitwise XOR operation
and d, h are breast region seg-
mented outputs in original form

Segmentation of Masses in Breast Region                         are assigned for pixels having large difference. The output
                                                                of gray difference image is as shown in Fig. 6b. Later,
Image Enhancement                                               MSER detector is adopted to detect the masses in the
                                                                mammograms. MSER is a blob detection method that
Since the invisible abnormal tissues reside in the dense        extracts various co-variant regions in an image. MSER
normal tissues, finding the masses in the breast region is a    regions of an image are connected components wherein
challenging task. Hence, to improve the visibility, combina-    local intensity remains stable for a large set of thresh-
tion of CLAHE and de-correlation stretching methods were        old values. The MSER detector first sorts pixels based
used. The CLAHE is a modified adaptive histogram equali-        on the intensity level, finds candidates for each intensity
zation (AHE) technique and both implements equalization         level and determines the representative for each level.
except CLAHE uses contrast amplification limiting method        Since the malignant masses are hidden inside the dense
that reduces the noise in the image. De-correlation stretch     breast tissues, maximum intensity values are considered
enhances the regions in an image having intensity variations    as suspicious regions. By the adoption of MSER detector,
to a wide range using band-band correlation. The pixels of      only suspicious regions are detected. Finally, the sizes are
those regions are stretched to equalize the band variances.     tracked and the growth rate is monitored for local mini-
The combination of both CLAHE and de-correlation stretch        mums and displays the pixels belonging to MSER detector
methods yielded better visual interpretation with better fea-   as output (Fig. 6c, d).
ture discrimination. The enhanced mammographic image is             The aim of the proposed paper is to detect the masses
shown in Fig. 5.                                                in the mammogram, but MSER detector adopted detects
                                                                suspicious region rather than mass. Hence, the algorithm
Suspicious Region and Mass Detection                            needs two iterations of image enhancement and MSER
                                                                detection. The suspicious region image is again passed
In the presented paper, to detect the masses present in         through the enhancement using CLAHE, gray differ-
the breast region, gray difference weights are employed         ence weight and MSER detector algorithms. This is per-
[11], wherein weights are computed for each pixel based         formed to eliminate some more unwanted regions that
on intensity difference. Since the masses have the highest      are present in the suspicious regions. The resultant mass
intensity value at its center and decreases outwards (i.e.,     detected image is as shown in Fig. 6f. The features are
towards boundary of mass), highest intensity value is cho-      first extracted in breast region then in suspicious region to
sen as representative value. Larger weights are assigned        maximize the accuracy and reliability.
for the pixels having small difference and smaller weights

                                                                                                     SN Computer Science
63 Page 6 of 13                                                                                          SN Computer Science (2021) 2:63

Fig. 5  Enhanced images of
breast region of MIAS (first
row) and CBIS-DDSM (second
row), a, d are original breast
region segmented images, b, e
are CLAHE images and c, f are
de-correlation stretch images

                                                                         images in MIAS datasets and 200 images in CBIS-DDSM
                                                                         dataset. The CBIS-DDSM dataset provides both CC view
                                                                         and MLO view images, including calcification and mass
                                                                         images. There are 292 MLO view mass images in the data-
                                                                         set. Hence we randomly selected 200 images that contain
                                                                         mass for the validation of the proposed method.

                                                                         A. Breast boundary segmentation in both MIAS and CBIS-
                                                                            DDSM images
                                                                         	  For the detection of masses in the mammogram, back-
                                                                            ground suppression, pectoral muscle removal, segmenta-
                                                                            tion of breast region and suspicious regions segmenta-
                                                                            tion are to be accomplished in sequence. The algorithm
                                                                            is then validated for its consistency and robustness on
                                                                            two different datasets (MIAS and CBIS-DDSM). In
                                                                            the first stage, proposed method is evaluated for breast
Fig. 6  Detection of masses in breast region, a is original image with      region segmentation. Breast region is the area accom-
mass in breast region, b is image with ground truth on mass, c MSER
(I iteration), d MSER (II iteration), e detected mass and f detected        plished after suppressing background region (skin–air
mass inside the ground truth ring                                           boundary) and pectoral muscle. For performance evalu-
                                                                            ation of breast region segmentation, breast region, back-
                                                                            ground region and pectoral muscle regions are termed
Results                                                                     as Br, Bg and Pm, respectively. Both the datasets con-
                                                                            tain ground truths with respect to mass detection but,
To evaluate the proposed work, MIAS datasets and CBIS-                      ground truths for the segmentation of breast region are
DDSM datasets are considered. For breast region segmen-                     not available in the datasets. So, ground truths for seg-
tation, the experimentation is conducted on all the 322                     mentation of breast regions are manually marked under
                                                                            the guidance of expert radiologist and compared with

SN Computer Science
SN Computer Science (2021) 2:63                                                                                       Page 7 of 13 63

  the obtained segmentation results. As per the radiolo-                  ground truths. Meanwhile, pectoral muscle segmenta-
  gist suggestion, the results obtained are divided into                  tion and removal algorithms performed well for good-
  well-segmented, under-segmented, over-segmented and                     quality images, i.e., the segmented boundary was along
  mis-estimated images. Segmentation result that matches                  the ground truth. However, there are some problems in
  the real boundary almost exactly are well-segmented                     few of images which had no significant contrast between
  images, segmented result that are slightly lower than the               the pectoral muscle and the breast region, more glandu-
  real boundary considered as under-segmented images,                     lar images with shadowing effect, multilayered pectoral
  segmented result that adds extra outside regions nearby                 muscle. The over- and under-segmented problems iden-
  apart from the real boundary are over segmented images                  tified reflect minor segmentation errors. The proposed
  and segmented result that mismatches to large extent are                method could detect boundaries in all types of densities
  considered as the mis-estimated images.                                 for varied shape and size. In addition to this, selection
	 Figure 7 shows the original images marked with                          of seed point for varied sizes is a challenging issue. The
  ground truths and the obtained segmented images. Fig-                   algorithm was experimented repeatedly on all the sizes
  ure 7a–c shows examples of the well-segmented images,                   of densities and empirically chosen the seed point for
  under-segmented images and over-segmented images.                       pectoral muscle segmentation. For simplicity, flipped
	  To describe about the images, the skin–air boundaries                  the right MLO view to the left MLO view and fixed
  detected in all the test samples were almost similar to the             the seed point for single MLO view. There were three
                                                                          images in the dataset which do not have pectoral muscle
                                                                          at all, hence those images are ignored from this level
                                                                          of segmentation. The absolute gray difference threshold
                                                                          maintained for the segmentation of pectoral muscle is 25
                                                                          which could able to suppress all other intensity values
                                                                          except pectoral muscle intensity values and the threshold
                                                                          value remains same for both the datasets.
                                                                        	  Besides MIAS dataset, the proposed segmentation
                                                                          algorithm could be applied on CBIS-DDSM dataset
                                                                          also. Both MIAS images and CBIS-DDSM images are
                                                                          gray-scale images having same bit depth and resolution
                                                                          with the only difference being their size. The segmenta-
                                                                          tion methodology is same for both datasets, only change
                                                                          is that the CBIS-DDSM images needs single iteration of
                                                                          MSER detector for mass detection.
                                                                        	  To evaluate the accuracy of segmentation, three met-
                                                                          rics sensitivity (TPR), specificity (TNR), false-positive
                                                                          rate (FPR) and correctness (PPV) were considered. The
                                                                          metrics are defined as follows,
                                                                                             TP
                                                                           Sensitivity =           ,                             (1)
                                                                                           TP + FN

                                                                                             TN
                                                                           Specificity =           ,                             (2)
                                                                                           TN + FP

                                                                                                      FP
                                                                           False positive rate =           ,                     (3)
                                                                                                   FP + TN

                                                                                              TP
                                                                           Correctness =            ,
                                                                                            TP + FP

                                                                                                TP + TN
Fig. 7  Examples of breast region segmentation outputs in which first      Accuracy =                        .                   (5)
                                                                                           TP + FP + TN + FN
two rows are a samples of well-segmented images, third row is b
sample of under-segmented images and last row is c sample of over-
segmented images

                                                                                                               SN Computer Science
63 Page 8 of 13                                                                                        SN Computer Science (2021) 2:63

	  For further evaluation, Jaccard index [31] and Dice                side regions of the segmented images, respectively. The
  similarity coefficients [32] were considered. For cal-              over- and under-segmented errors caused by gray dif-
  culating the similarity index, metrics are defined for              ference weight and fast marching method resulted in
  simplicity as follow, Ss for segmented output of the pro-           imprecise detection of breast boundary. Besides, the
  posed method and Sg for the ground truth image:                     dataset contains images which are extremely dense
                                                                      breast density types and had poor visualization due to
                  |Ss ∩ Sg|
    Jaccard =               ,                                (6)      uneven intensity distributions at the borders of pecto-
                  |Ss ∪ Sg|                                           ral muscle. These images are considered as more noisy
                                                                      images and the algorithm mis-estimated those breast
                |Ss ∪ Sg|                                             regions with less accuracy. Though the accuracy is
    Dice = 2               .                                 (7)
               |Ss| + |Sg|                                            maximum (97.68%) in case of well-segmented images,
                                                                      the decrease in accuracies of under-, over-segmented and
	  The experimental results are listed in Table 2 com-                mis-estimated images dropped the overall accuracy to
  paring segmented output obtained from the proposed                  94.12%.
  method with the ground truth of MIAS dataset.                     	  Besides, the method produced Jaccard similarity ratio
	  In terms of segmentation of breast region, sensitivity             of 94.15% and dice = 96.99% which is slightly smaller
  represents the ratio of number of true-positive pixels to           than the accuracy metric considered. The reason is
  the number of pixels in true breast regions; specificity            that those two-similarity metrics are sensitive to shape
  represents the ratio of number of true-negative pixels              changes between ground truths and segmented output
  to the number of pixels marked as non-breast region in              than calculating overlapping area ratios. Jaccard indexes
  the true background; false-positive rate represents the             are low in all the cases than dice similarity as the union
  ratio of number of false-positive pixels to the number              of ground truth and the segmentation area were stable
  of pixels marked as non-breast region in the true back-             for different cases. Hence, proposed method is robust
  ground; and correctness is the average portion of cor-              to different quality of images with respect to dice coef-
  rectly detected breast region pixels.                               ficient.
	  In Table 2, the average values of sensitivity, specific-         	  Similarly, Table 3 depicts that the proposed method
  ity, false-positive rates, correctness and accuracies are           also resulted with accuracy of 90.38% on 200 images
  displayed. In case of over- and under-segmented images,             from CBIS-DDSM dataset, indicating the accuracy drop
  the ground truth edges are located at the inside and out-

Table 2  Results for MIAS                                 Well-segmented   Under-segmented   Over-segmented   Misestimated    Overall
dataset
                                Number of images          225              43                45               10              322
                                Sensitivity (%)           98.53            99.47             99.14            65.61           90.68
                                Specificity (%)           92.15            91.22             93.33            34.92           77.70
                                False-positive rate (%)   2.28             9.35              7.42             3.49            5.63
                                Correctness (%)           97.37            96.44             93.92            80.11           91.96
                                Accuracy (%)              97.68            94.11             94.64            90.28           94.12
                                Jaccard (%)               94.15            90.96             89.73            85.57           90.10
                                Dice (%)                  96.99            93.02             92.31            88.57           92.72

Table 3  Results of CBIS-                                 Well-segmented   Under-segmented   Over-segmented   Mis-estimated   Overall
DDSM dataset
                                Number of images          145              18                17               20              200
                                Sensitivity (%)           99.42            88.09             83.08            67.32           84.47
                                Specificity (%)           94.01            86.26             79.83            42.74           75.71
                                False-positive rate (%)   7.40             6.26              7.98             4.27            6.47
                                Correctness (%)           94.01            86.26             79.83            42.74           75.71
                                Accuracy (%)              94.88            90.92             89.95            85.80           90.38
                                Jaccard (%)               89.68            88.26             87.09            81.45           86.62
                                Dice (%)                  92.86            83.63             82.68            87.94           86.77

SN Computer Science
SN Computer Science (2021) 2:63                                                                             Page 9 of 13 63

   compared to the other dataset. The reason is inconsist-        compared with ground truths for location and boundary
   ency in size and varied noises in CBIS-DDSM images.            of mass.
B. Mass detection for abnormality quantification                	  Segmentation results depicted in Fig. 8 shows the
	  The other major focus in the present paper is on the           mass detection outputs in MIAS dataset wherein, each
   detection of masses based on gray difference weight            row contains from left to right original image, ground
   and MSER detector. For the validation of detection of          truth circled image, MSER detector output at first itera-
   masses in MIAS dataset, 85 images were used and in             tion, MSER detector output at second iteration, mass
   CBIS-DDSM dataset, 200 images were used. Ground                detected image and mass located inside the circled
   truths are provided in the dataset in the form of x, y co-     image. The segmented output is inside the ring and
   ordinates and radius in MIAS dataset. From the ground          hence, it is the success of the proposed method. Figure 8
   truth information, a ring is plotted over the image.           includes segmented circumscribed, speculated and other
   The algorithm is considered as successful only if, the         ill-defined mass outputs obtained from the proposed
   segmented image obtained from the proposed method              method.
   appeared inside the ground truth ring, In CBIS-DDSM          	  The images shown in third column are obtained after
   dataset, ground truth images are provided in the form of       enhancement, gray difference weight and MSER detec-
   binary images. For this dataset, the segmented image is        tion (I iteration). The proposed method is tested on all

Fig. 8  Examples of mass detection in MIAS dataset

                                                                                                   SN Computer Science
63 Page 10 of 13                                                                                SN Computer Science (2021) 2:63

  types of densities that includes fatty, glandular and dense     racy after experimentation. Hence, to improve the effi-
  type. The contrast enhancement limits are empirically           cacy, we resized the CBIS-DDSM images to 256 × 256.
  chosen as 0.05 which remains constant for all types of          Whereas, the MIAS dataset images bear uniform size
  mammographic images. In addition to contrast enhance-           and yielded best accuracy without undergoing resiz-
  ment, the tolerance value chosen for de-correlation             ing. The proposed method is directly applied to CBIS-
  stretch is 0.01. Since masses have maximum intensity,           DDSM images and is able to obtain good results. Only
  the representative value is also maximum and the abso-          difference is that for CBIS-DDSM dataset, results in the
  lute gray-scale intensity difference threshold chosen for       single iteration of MSER detector have been obtained.
  gray difference weight is 14 and it may vary from 10            Some of the images exhibited few unexpected extra
  to 20 for more noisy images (extremely dense breast             regions. Those regions are eliminated by binarizing the
  density type images with poor visualization). The first         image to specified level (0.7) and the binarization level
  iteration of MSER detector retains extra regions other          varies from 0.7 to 0.85 for more noisy images. This
  than masses. Forth column represents the MSER detec-            small change is due to the different imaging conditions
  tion of second iteration wherein it eliminates all other        adopted while taking mammography. Contrast limits,
  regions except masses. Masses obtained from the pro-            tolerance value and absolute gray difference threshold
  posed method are displayed in fifth column and the sixth        maintained are 0.5, 0.01 and 25, respectively.
  column shows masses encircled with the ground truth           	  The MIAS dataset contains the ground truth in the
  information provided in the dataset.                            form of radius and coordinate points. Hence, circle is
	 Figure 9 shows the results obtained for CBIS-DDSM               plotted using ground truth on the original images and
  dataset wherein first and second images of each figure          the validation of the algorithm is performed based on
  are original image and ground truth marked on origi-            whether the output mass obtained from the proposed
  nal image, respectively. Third image is obtained after          method exists inside the circle or not. Table 3 is tabu-
  applying enhancement and gray difference weight. Forth          lated making use of Eq. (8). CBIS-DDSM dataset con-
  image is the MSER detected image and the last image is          tains ground truths that are provided in the form of
  the mass detected image.                                        binary mass region and square shaped original patch
	  Since CBIS-DDSM dataset has images in various                  region. Hence, the proposed method evaluated this
  sizes, the images are normalized. First, the images are         dataset similar to MIAS dataset and also measured per-
  resized to 1024 × 1024 and noticed the drop in accu-

Fig. 9  Sample images of mass detection in CBIS-DDSM dataset

SN Computer Science
SN Computer Science (2021) 2:63                                                                                     Page 11 of 13 63

Table 4  Experimental results on detection of masses                   	  Though the difference between centroids of the
Dataset      Number Number Number          Detected      Accuracy        ground truth mass and the obtained mass is less, focus
used         of images of masses of masses masses        %               is needed to identify some missing areas at the bound-
                                 unlisted  listed                        ary of the detected masses. Hence along with the overall
                                           (true                         detection accuracy, we have experimented to validate
                                           positive)
                                                                         the boundaries of the detected masses in multiple ways
MIAS          85          85         2           83      97.64           to provide evidence of the consistency of the proposed
CBIS-        200         200         8          192      94.66           method. The segmentation results were evaluated for
 DDSM                                                                    CBIS-DDSM dataset for detected mass considering sen-
                                                                         sitivity, specificity and accuracy as parameters using the
                                                                         Eqs. (1), (2) and (5). From the table, we can notice that
                                                                         accuracy of the segmentation of mass is slightly lower
Table 5  Experimental results on segmentation of masses in CBIS-
                                                                         than the overall detection accuracy of mass and they are
DDSM dataset
                                                                         independent of each other.
Number of       Sensitivity    Specificity    Accuracy   Dice          	  In Table 4, segmentation results for masses are dis-
images          (%)            (%)            (%)        coefficient
                                                         (%)
                                                                         played for CBIS-DDSM dataset. Nevertheless, for MIAS
                                                                         dataset, detection accuracy is computed and segmenta-
200             91.21          90.27          94.32      91.46           tion accuracy for mass detection is not computed. The
                                                                         reason is, the dataset contains radius and coordinate
                                                                         points of the mass which is a ground truth for detection
  formance using sensitivity, specificity, precision and                 of mass, and it does not contain ground truth in terms of
  accuracy parameters (Tables 4, 5).                                     segmentation of mass. The average sensitivity, specific-
	  The position of every mass detected undergo a com-                    ity, accuracy and dice similarity coefficient obtained for
  parison with its corresponding ground truth. The loca-                 CBIS-DDSM dataset from the proposed method were
  tion of the interested point (x1, y1) of the detected image            about 91.21, 90.27, 94.32 and 91.46%, respectively.
  is compared with the centroid of its respective ground                 Thus, the results listed depict that the proposed method
  truth (x2, y2) by taking the difference among them (Dx                 yields best sensitivity and specificity outputs achieving
  and Dy). The differences are calculated using equation                 accuracy of about 94.32%. The proposed method could
  shown below:                                                           help the radiologist in identifying cancer masses more
                                                                         accurately.
      Dx = ||x1 − x2 ||, Dy = ||y1 − y2 ||.                      (8)

	  If the difference value is ≤ factor of 10, then the points
  are very much closer to each other. Based on this we
  have calculated the accuracy using the Eq. (9):                      Discussion
                      Number of correctly detected images
      Accuracy =                                          . (9)        Comparing with the state-of-the-art methods, the proposed
                          Total number of images
                                                                       work executes multiple tasks to obtain good performance
	  For both the datasets, the detection accuracy obtained              in every step that highlights the different morphological
  is determined considering the total number of true-                  characteristics of hidden objects for detection of masses. A
  positives cases to the total number of cases involved.               straight comparison with existing work is difficult as some
  As shown in Table 3, in MIAS dataset, 85 images were                 of them considered partial dataset cases, eliminating certain
  used for experimentation. Among them 83 masses were                  images. However, we have provided few of existing ones of
  detected and shown exact locations as specified in the               the same kind.
  ground truth. Meanwhile, two masses were not detected                   Comparison on segmentation of breast region: compar-
  that are marked in the dataset. In CBIS-DDSM dataset,                isons with few of the existing studies comprises of most
  200 images were considered and the proposed method                   common methods on the detection of masses are shown in
  was able to detect 192 images with exact locations as                Table 6.
  given in dataset but failed to detect eight mass images.                In comparison with the state-of-the-art methods, the
  The accuracy obtained for CBIS-DDSM dataset is quite                 accuracies of detection of breast masses from both MIAS
  lower than that of MIAS dataset. The reason is because               and DDSM dataset are listed in Table 6. Summarizing per-
  of the varied quality of examination, and noise added                formances of representative methods from the literature, the
  due to different imaging conditions.                                 method adopted in the proposed work achieved maximum

                                                                                                            SN Computer Science
63 Page 12 of 13                                                                                           SN Computer Science (2021) 2:63

Table 6  Comparative study on    Detection approach                               Author                        Dataset        Accuracy %
detection of masses for MIAS
dataset                          Gray difference weight and MSER detector         Proposed method               MIAS           97.64
                                                                                                                DDSM           94.66
                                 Threshold-based segmentation                     Makandar et al. [15]          MIAS           94.5
                                 Meta-heuristic algorithm electromagnetism-like   Soulami et al. [18]           MIAS           78.57
                                  optimization                                                                  DDSM           91.07
                                 Dual stage adaptive thresholding (DuSAT)         Anitha et al. [19]            MIAS           93.5
                                                                                                                DDSM           92.5
                                 Gestalt psychology                               Wang et al. [20]              MIAS           92
                                                                                                                DDSM           93.84
                                 Quad tree decomposition                          Divyashree et al. [27]        DDSM           90

accuracy in case of MIAS dataset and quite less accuracy             the robustness of the algorithm could be tested with other
in case of DDSM dataset. Slightly lower accuracies are               standard datasets available. The work can also be extended
obtained in case of DDSM dataset due to more noisy images            to detect other abnormalities in mammography.
compared to MIAS dataset. These results on detection of
masses provide solid foundation to focus on the masses
listed for further analysis.
                                                                     Conclusion

                                                                     In the analysis of mammographic image, segmentation of
Future Scope                                                         breast region and detection of masses are important steps
                                                                     to be accomplished for proper diagnosis of breast cancer.
Breast region segmentation is the crucial step in the mam-
                                                                     In this paper, background suppression, pectoral muscle
mographic study, which defines the boundaries of the breast
                                                                     removal, breast region extraction and enhancement fol-
for further quantifications. The segmentation of breast
                                                                     lowed by detection of mass are performed to analyze mam-
region is performed using gray difference weight and fast
                                                                     mograms. Gradient weight map is utilized for background
marching method. But, segmentation errors in segmenting
                                                                     suppression, pectoral muscle boundary is identified and
breast region could be further reduced. Because, accurate
                                                                     removed to obtain breast region alone using gray difference
segmentation of breast region avoids missing of abnormal
                                                                     weight and fast marching method. Later, the breast region is
tissues particularly localized at the boundary regions. The
                                                                     enhanced using CLAHE and de-correlation stretch. Gray dif-
visualization of the breast region in mammographic image
                                                                     ference weight and MSER detectors are employed to detect
can be performed still better to distinguish tissues in more
                                                                     the mass. The experimental results proved that the method
noisy images.
                                                                     proposed produced comparable results as per the ground
   The breast imaging reporting and data system (BI-RADS)
                                                                     truth provided in the dataset for segmentation of breast
[33] defines indicators such as calcifications, glandular den-
                                                                     region and mass detection and achieved higher accuracies.
sity and particular masses that have been proved symptom
                                                                     The algorithm is tested on MIAS and CBIS-DDSM dataset
for breast cancer [34, 35]. The features characterizing indi-
                                                                     for its efficiency.
cators, as derived from segmentation and mass detection
results, may conduct primary screening for radiologists to           Acknowledgements The authors would like to thank Dr. Deepashree
save manpower.                                                       Basavalingu, Consultant Radiologist, Blackpool Teaching Hospitals,
   The detection of mass is achieved using gray difference           NHS Foundation Trust, United Kingdom for her certification of ground
                                                                     truths, valuable help and comments in carrying out this work. The first
weight and MSER detector in which the method achieved
                                                                     author would like to thank the Ministry of Tribal Affairs, Govern-
high accuracy. In future, the improvements could focus on            ment of India for awarding the National Fellowship (201718-NFST-
enriching the detection rate of mass as the proposed method          KAR-00159) to carry out this research work.
failed to detect masses in some cases. Especially in case
of CBIS-DDSM dataset, the detection rate could be further            Funding The first author would like to thank the Ministry of Tribal
                                                                     Affairs, Government of India for awarding the National Fellowship
improved. The detected masses in some of the images were
                                                                     (201718-NFST-KAR-00159) to carry out this research work.
under-segmented and correct masses were not identified in
some extremely dense breasts regions. To deal such images
                                                                     Compliance with Ethical Standards
and to improve the performances, in future, along with the
CLAHE technique, technique that further amplifies the noisy          Conflict of Interest Corresponding author declares that no conflict of
data can be inculcated as a post processing method. Later,           interest.

SN Computer Science
SN Computer Science (2021) 2:63                                                                                              Page 13 of 13 63

Ethical Approval This article does not contain any studies with human   19. Anitha J, Dinesh Peter J, Immanuel APS. A dual stage adaptive
participants or animals performed by any of the authors.                    thresholding (DuSAT) for automatic mass detection in mammo-
                                                                            grams. Comput Methods Programs Biomed. 2017;138:93–104.
                                                                        20. Wang H, Feng J, Qirong B, et al. Breast mass detection in digital
                                                                            mammogram based on gestalt psychology. J Healthc Eng. 2018.
References                                                                  https​://doi.org/10.1155/2018/40156​13.
                                                                        21. Geraldo J, Rocha S, Almeida J, Paiva A, Silva A, Gattass M.
                                                                            Breast cancer detection in mammography using spatial diversity,
 1. DeSantis C, Ma J, Bryan L, et al. Breast cancer statistics, 2013.
                                                                            geostatistics, and concave geometry. Multimedia Tools Appl.
    CA Cancer J Clin. 2014;64(1):52–62.
                                                                            2019;78:13005–031. https​://doi.org/10.1007/s1104​2-018-6259-z.
 2. Oeffinger KC, Fontham ETH, Etzioni R, et al. Breast cancer
                                                                        22. Agarwal R, Díaz O, Yap MH, Lladó X, Martí R. Deep learning
    screening for women at average risk: 2015 guideline update from
                                                                            for mass detection in full field digital mammograms. Comput Biol
    the American Cancer Society. JAMA. 2015;314(15):1599–614.
                                                                            Med. 2020. https​://doi.org/10.1016/j.compb​iomed​.2020.10377​4.
 3. Jalalian A, Mashohor SBT, Mahmud HR, et al. Computer-aided
                                                                        23. Suresh R, Nagaraja Rao A, Eswara Reddy B. Detection and clas-
    detection/diagnosis of breast cancer in mammography and ultra-
                                                                            sification of normal and abnormal patterns in mammograms using
    sound: a review. Clin Imaging. 2013;37(3):420–6.
                                                                            deep neural network. Concurr Comput Pract Exp. 2019. https​://
 4. Chen B, Ning R. Cone-beam volume CT breast imaging: feasibil-
                                                                            doi.org/10.1002/cpe.5293.
    ity study. Med Phys. 2002;29(5):755–70.
                                                                        24. Zeiser FA, da Costa CA, Zonta T, Marques NMC, Roehe AV,
 5. Mann RM, Kuhl CK, Kinkel K, et al. Breast MRI: guidelines
                                                                            Moreno M, da Rosa Righi R. Segmentation of masses on mammo-
    from the European Society of Breast Imaging. Eur Radiol.
                                                                            grams using data augmentation and deep learning. J Dig Imaging.
    2008;18(7):1307–18.
                                                                            2020;33(4):858–68. https:​ //doi.org/10.1007/s10278​ -020-00330-​ 4.
 6. Olsen O, Gøtzsche PC. Cochrane review on screening for breast
                                                                        25. Vikhe PS, Thool VR. Mass detection in mammographic images
    cancer with mammography. Lancet. 2001;358(9290):1340–2.
                                                                            using wavelet processing and adaptive threshold technique. J Med
 7. Johns PC, Yaffe MJ. X-ray characterisation of normal and neo-
                                                                            Syst. 2016;40(4):82.
    plastic breast tissues. Phys Med Biol. 1987;32(6):675–95.
                                                                        26. Divyashree BV, Soujanya MR, Keerthana M, Naveen M, Heman-
 8. Mustra M, Grgic M, Rangayyan RM. Review of recent advances
                                                                            tha Kumar G. Segmentation and classification of mammogram
    in segmentation of the breast boundary and the pectoral muscle
                                                                            into normal and abnormal using texture features. Int J Recent
    in mammograms. Med Biol Eng Comput. 2016;54(7):1003–24.
                                                                            Technol Eng. 2020. https​://doi.org/10.35940​/ijrte​.A1964​.05912​0.
 9. Dromain C, Boyer B, Ferre R, et al. Computed-aided diag-
                                                                        27. Divyashree BV, Amarnath R, Naveen M, Hemantha Kumar G.
    nosis (CAD) in the detection of breast cancer. Eur J Radiol.
                                                                            Novel approach to locate region of interest in mammograms for
    2013;82(3):417–23.
                                                                            Breast cancer. J Intell Syst Appl Eng. 2018;6(3):185–90. https​://
10. Oliver A, Freixenet J, Marti J, et al. A review of automatic mass
                                                                            doi.org/10.18201​/ijisa​e.20186​44775​.
    detection and segmentation in mammograic images. Med Image
                                                                        28. Suckling J, Parker J, Dance DR, Astley S, Hutt I, Boggis CRM,
    Anal. 2010;14(2):87–110.
                                                                            Ricketts I, Stmatakis E, Cernaez N, Kok SL, Taylor P, Betal D,
11. Gonzalez RC, Woods RE, Eddins SL. Digital image processing
                                                                            Savage J. The mammographic image analysis society digital mam-
    using MATLAB: and mathworks, MATLAB Sim SV 07. Gates-
                                                                            mogram database. In: Proceedings of the 2nd international work-
    mark Publishing; 2009.
                                                                            shop on digital mammography; 1994. pp. 375–378.
12. Chen Z, Zwiggelaar R. A combined method for automatic iden-
                                                                        29. Lee RS, Gimenez F, Hoogi A, Rubin D. Curated Breast Imaging
    tification of the breast boundary in mammograms. In: 5th Inter-
                                                                            Subset of DDSM [Dataset]. The Cancer Imaging Archive. 2016.
    national conference on biomedical engineering and informatics
                                                                            https​://doi.org/10.7937/K9/TCIA.2016.7O02S​9CY.
    (BMEI), IEEE; 2012. pp. 121–125.
                                                                        30. Divyashree BV, Hemantha Kumar G. Segmentation and enhance-
13. Rampun A, Morrow PJ, Scotney BW, et al. Fully automated breast
                                                                            ment of mammograms for the detection of cancer using Gradient
    boundary and pectoral muscle segmentation in mammograms.
                                                                            weight map and de correlation stretch. NCVPRIPG. Accepted for
    Artif Intell Med. 2017;79:28–41.
                                                                            publication 2020. (In Press).
14. Shi P, Zhong J, Rampun A, Wang H. A hierarchical pipeline
                                                                        31. Tan PN, Steinbach M, Kumar V. Introduction to data mining. 1st
    for breast boundary segmentation and calcification detection in
                                                                            ed. Boston: Addison-Wesley Longman Publishing Co., Inc.; 2005.
    mammograms. Comput Biol Med. 2018. https:​ //doi.org/10.1016/j.
                                                                        32. Dice LR. Measures of the amount of ecologic association between
    compb​iomed​.2018.03.011.
                                                                            species. Ecology. 1945;26(3):297–302.
15. Makandar A, Halalli B. Threshold based segmentation tech-
                                                                        33. Magny SJ, Shikhman R, Keppke AL. Breast imaging reporting
    nique for mass detection in mammography. J Comput.
                                                                            and data system. StatPearls publishing; 2020. https​://www.ncbi.
    2016;11(6):472–8.
                                                                            nlm.nih.gov/books​/NBK45​9169/.
16. Rahmati P, Adler A, Hamarneh G. Mammography segmenta-
                                                                        34. Boyd NF, Guo H, Martin LJ, et al. Mammographic density
    tion with maximum likelihood active contours. Med Image Anal.
                                                                            and the risk and detection of breast cancer. N Engl J Med.
    2012;16:1167–86.
                                                                            2007;356(3):227–36.
17. Neto OPS, Silva AC, Paiva AC, Gattass M. Automatic mass
                                                                        35. Vachon CM, Van Gils CH, Sellers TA, et al. Mammographic den-
    detection in mammography images using particle swarm opti-
                                                                            sity, breast cancer risk and risk prediction. Breast Cancer Res.
    mization and functional diversity indexes. Multimed Tools Appl.
                                                                            2007;9(6):217–26.
    2017;76(18):19263–89.
18. Soulami KB, Saidi MN, Honnit B, Anibou C, Tamtaoui A.
                                                                        Publisher’s Note Springer Nature remains neutral with regard to
    Detection of breast abnormalities in digital mammograms
                                                                        jurisdictional claims in published maps and institutional affiliations.
    using the electromagnetism like algorithm. Multimedia Tools
    Appl. 2019;78(10):12835–63. https​: //doi.org/10.1007/s1104​
    2-018-5934-4.

                                                                                                                   SN Computer Science
You can also read