Performance Evaluation of Iris Based Recognition System Implementing PCA and ICA Encoding Techniques

Page created by Harvey Wolf
 
CONTINUE READING
Performance Evaluation of Iris Based Recognition System
             Implementing PCA and ICA Encoding Techniques
                     Vivekanand Dorairaj1, Natalia A. Schmid2, and Gamal Fahmy3
                    Lane Department of Computer Science and Electrical Engineering
                    West Virginia University, P.O. Box 6109, Morgantown, WV 26506

                                                       ABSTRACT

In this paper, we describe and analyze the performance of two iris-encoding techniques. The first technique is based on
Principle Component Analysis (PCA) encoding method while the second technique is a combination of Principal
Component Analysis with Independent Component Analysis (ICA) following it. Both techniques are applied globally.
PCA and ICA are two well known methods used to process a variety of data. Though PCA has been used as a
preprocessing step that reduces dimensions for obtaining ICA components for iris, it has never been analyzed in depth
as an individual encoding method. In practice PCA and ICA are known as methods that extract global and fine features,
respectively. It is shown here that when PCA and ICA methods are used to encode iris images, one of the critical steps
required to achieve a good performance is compensation for rotation effect.
     We further study the effect of varying the image resolution level on the performance of the two encoding methods.
The major motivation for this study is the cases in practice where images of the same or different irises taken at
different distances have to be compared.
     The performance of encoding techniques is analyzed using the CASIA dataset. The original images are non-ideal
and thus require a sequence of preprocessing steps prior to application of encoding methods. We plot a series of
Receiver Operating Characteristics (ROCs) to demonstrate various effects on the performance of the iris-based
recognition system implementing PCA and ICA encoding techniques.

Keywords: iris recognition, principal component analysis, independent component analysis, image encoding,
performance evaluation, biometrics

                                                 1. INTRODUCTION

Iris provides outstanding recognition performance when used as a biometric. Iris patterns are believed to be unique due
to the complexity of two underlying processes (1) environmental and (2) genetic that influence the generation of iris
pattern. These result in textural patterns that are unique to each eye of an individual and even distinct between twins.
Iris as a biometric has been extensively studied over the last decade. The first complete iris based recognition system
was designed and patented by J. Daugman [1]. It was followed by a number of other works. The most referred in the
literature are [2,3,4]. The systems described in these works capture and further process a single or multiple copies of
infra-red or visible light frontal view images of high resolution with no strong blur or occlusions (we call this setting
“ideal”) in order to guarantee good performance.
     The literature contains a large number of publications that describe different parts of J. Daugman’s system [1-4].
The following general steps are involved:
1. Localization of region of interest – During this step, the pupil, sclera, and eyelids are segmented.
2. Normalization – Transforms a localized iris region from Cartesian coordinates to doubly dimensionless polar
   coordinates.
3. Encoding – Uses 2D Gabor wavelets to encode image content that is then quantized to two levels based on phase
   information of the output. The result of encoding step is presented as a binary template called “Iris Code”.
4. Matching – Matching is performed using the Hamming distance.
In practice J. Daugman’s system performs exclusively well. However, since the system is patented, there are no details
provided on how to select various parameters for its implementation. The uncertainty of specifications in the literature

1
  E-mail: vivekand@csee.wvu.edu
2
  E-mail: natalias@csee.wvu.edu, Telephone: (304) 293-0405 ext. 2557
3
  E-mail: fahmy@csee.wvu.edu, Telephone: (304) 293-0405 ext. 3588
IRIS LOCALIZATION
                                            POLAR REPRESENTATION                 ENHANCEMENT AND MASKING

                              ICA Block                                                PCA Block
                                                                        Eigen iris 1      Eigen iris 2   Eigen iris 3

                            S= A ˆ X, where
                                     −1
                                                                        Eigen iris 4     Eigen iris 5    Eigen iris 6
                    X = PCA components
                    S = ICA components
                    ˆ −1 = Estimated unmixing
                    A
                           matrix                                      Eigen iris 94    Eigen iris 95    Eigen iris 96

                    Fig. 1: Block-diagram of the system implementing PCA/ICA encoding techniques for iris.

and unavailability of “ideal” iris images for testing the system led us to reproducing step 1 and partially reproducing and
redesigning steps 2, 3, and 4 of J. Daugman’s system in our initial study of the iris [12]. Our experience has shown that
application of 2D Gabor filters produces good results. However, the performance of the entire iris recognition system is
sensitive to variations of Gabor filter parameters. The parameters of 2D Gabor filters are database dependent, meaning
that they have to be finely tuned to obtain an optimal performance each time a new database is used. The results for the
modified J. Daugman’s system constitute a baseline for performance comparison.
     Most of current research works focus on redesigning the second and the third subsystems and on dealing with non-
ideal nature of iris image acquisition in practice. A large number of filter-based iris encoding algorithms [2,4,15,16]
have been designed over the past few years. These methods often demonstrate outstanding performance, especially on
datasets of good image quality. Filter parameters used for image encoding in these algorithms are typically tuned to
extract local information/features while paying less attention to the global information. In this work, we analyze the
performance of two encoding strategies (1) PCA and (2) ICA that are capable of capturing both global and local features
when applied to an iris image. The level of extracted features depends on the mode in which algorithms operate. Iris is
not a novel application for use of ICA. We are aware of a few previously published works that use ICA method for iris
image encoding [7,8,17]. However, in all these works the ICA was used in a mode of operation that extracts only local
features, as proposed by Hyvarinen [9]. The purpose of this research is not to design a perfect encoding algorithm, but
rather to explore an opportunity of using global image encoding / feature extraction algorithms to process the iris and
study the effect of combining global and local features on the performance of iris recognition system. PCA is often
used as a preprocessing step to ICA with the goal to uncorrelate data and thus simplify extraction of ICA components
[9]. To the best of our knowledge, PCA has never been analyzed as an individual iris encoding technique.
     We further study the effect of varying the image resolution level on the performance of these two methods. The
major motivation for this study is the cases in practice where images of the same or different irises taken at different
distances have to be compared.
     The following distinctive features characterize our approach:
1. Global iris encoding.
2. Use of Euclidean and Hamming distances to measure the performance.
3. Compensation for image rotation - takes into account the effect of head tilt.
     The remaining parts of the paper are organized as follows. Sec. 2 provides a description of the two proposed
encoding approaches and explains the distinctive features of our work. Sec. 3 presents the results.

                                  2. PROPOSED ENCODING TECHNIQUES

The general block-diagram of our system is shown in Fig.1. Below is a brief description of the two encoding blocks.
2.1 Encoding Using PCA
In this section, we briefly characterize the PCA method adopted to perform iris encoding (for a details on PCA see
[6,14]). A typical PCA algorithm operates in two modes: training and testing. During the training mode, the principal
components are extracted using labeled training data. During the testing mode, the performance of the iris identification
system is evaluated.
     Let M be the number of iris classes. Suppose that a training set X 1 , X 2 , K , X M , a sequence of normalized and
preprocessed iris images indexed in accordance with iris class, is available. These data are used to form the scatter
matrix
                                                      M
                                                 1
                                           Σ=
                                               M − 1 m =1 ∑
                                                          ( X m − X )( X m − X ) T ,

where X is the empirical mean. Here we assume that images are reshaped into vector columns. Since Σ is positive
definite and symmetric, it can be decomposed using the eigenvalue decomposition method known also as the Karhunen-
Loeve expansion. Thus
                                                          Σ = QΛQ T ,
where Λ is the diagonal matrix of the eigenvalues of Σ arranged in decreasing order and Q is the orthogonal matrix
whose columns form the eigenvectors of Σ . Geometrically, the eigenvectors are the basis vectors of the transformed
                                                                                                   ~
coordinate system. In practice, the smallest eigenvalues of Σ are disregarded, and new matrix Q with vector columns
corresponding to the essential eigenvalues is formed. Thus the new transformed space has smaller dimension than the
original space (data compression concept). For iris images, as will be demonstrated in the Sec. 3, the compression is
poor.
      To test the algorithm, we use additional dataset called testing dataset composed of iris images independent of the
training set. Each vector in the testing set is further projected onto the axes of the new transformed space, and the
coefficients of projections are collected in the vector of features, W . To measure the distance between two projected
iris images, we involve two distances: (i) Euclidean and (ii) Hamming. To involve the second measure, we quantize
the values of individual coefficients in the vector W to “1” or “0” if the feature value is greater than zero and less than
or equal to zero, respectively. In order to take rotation into consideration, one of two segmented and enhanced iris
images is rotated systematically on either direction (up to a few degrees) and templates are extracted for each rotated
                                  ~                  ~
version of the image. Let W1 = Q T Y1 and W2 = Q T Y2 be two PCA coefficient vectors corresponding to two distinct
normalized and preprocessed iris images Y1 and Y2 from the testing set. Then to compensate for rotation the
following minimization step is performed
                                                             ~       ~
                                                     min d (Q T Y1 ; Q T Y2 (θ )),                             (1)
                                                θ ∈[ −θ ,θ ]
                                                  max   max

                                                                                                     ~
where d (⋅) denotes the Euclidean or Hamming distance between two projected iris images and Q is the matrix
composed of eigenvectors corresponding to only essential eigenvalues. Note that for normalized images the rotation
operation reduces to the cyclic shift operation of the second image with respect to the first image.

2.2 Following up with ICA
     ICA is a subspace analysis technique. It aims to find a set of independent sources that capture the underlying
randomness of the observed signals. ICA has been applied to iris analysis in [7,8,17]. In those works, randomly
selected patches of a small size from iris images form a training set. This training set is then processed using PCA to
reduce its dimensionality and decorrelate components, before applying ICA locally. In theory, whitening before ICA is
not a necessary step. Though, whitening aids estimation of independent components. The training templates available
after applying PCA form the input to the ICA block. An unmixing matrix, which represents the ICA basis vectors, is
estimated from these PCA input templates by minimizing the mutual information (a measure of dependence) between
transformed components in ICA space. Previous ICA algorithms [7,8,17] do not take rotation (image alignment) into
account. While this step seems unnecessary in [7,8,17], compensation for a rotation uncertainty is a critical step for our
non-ideal iris application. In this work, we use ICA as a follow up encoding method in anticipation that it will pick
individual fine features present in iris images and thus will improve the performance of PCA method. Unlike previous
ICA algorithms used for iris recognition, we do not divide iris images into patches during the training step but rather use
the entire iris image to estimate the unmixing matrix and further to extract ICA components.
64 x 360

                                                              32 x 180

                                                              16 x 90

                                                               8 x 45

                                      Fig. 2: Downsampling by decimation (example).

                  ~ ~          ~
     Denote by X 1 , X 2 , K , X M a sequence of preprocessed, normalized, and whitened iris images indexed by their
class. During the training mode we assume that each class is represented by a single iris image. It can be easily
generalized to a multi-image case.
     ICA is a blind source-separation method. It assumes that observed data can be represented as a linear combination
of a number of independent signals. The unknowns are the mixing coefficients and the independent input signals. Let
 ~                                               ~ ~          ~
 X be a matrix with vector columns given by X 1 , X 2 , K , X M and S be a matrix composed of unknown independent
input signals arranged in columns. Then ICA assumes the following forward model
                                                            ~
                                                            X = AS,
where A is the unknown mixing matrix. As argued in [9], the results of linear mixing of non-Gaussian signals are
more Gaussian than the input signals. Then to estimate the mixing matrix A and one of the components of S , one has
to define a measure of non-gaussianity. One of the theoretically sound criteria is the maximization of the negentropy
given by
                                                 J = H Gaus ( A −1 X) − H ( A −1 X),                               (2)
where H Gaus is the entropy of the data under the assumption that data are Gaussian distributed and under the constraint
of the same covariance matrix for the distributions in H Gaus and in H (see [9] for more detailed explanation). Once the
mixing matrix and one of the input signals are estimated, the remaining input signals can be obtained by invoking the
Gram-Schmidt orthogonalization procedure. To deal with empirical case, (2) is approximated by expressions
involving empirical moments.
     To test the performance of ICA method, we use testing data that are different from the training set. Similar to the
case with PCA encoding, we apply two distance measures to perform matching (i) Euclidean and (ii) Hamming
distances. This is done for the purpose of performance comparison. To overcome the effect of rotation during testing,
we project each rotated version of the two images into PCA space and then into ICA space and obtain the minimum
                                                                      ~     ~           ~       ~
score between the templates of the different rotated versions. Let W1 = SQ T Y1 and W2 = SQ T Y2 be two vectors of
ICA coefficients corresponding to two distinct normalized, preprocessed iris images Y1 and Y2 from the testing set.
The following minimization procedure is applied compensates for rotation
                                                            ~         ~ ~
                                                    min d (SQ T Y1 ; SQ T Y2 (θ )),                             (3)
                                               θ ∈[ −θ ,θ ]
                                                  max   max

                                                                                               ~
where d (⋅) denotes the Euclidean or Hamming distance between two projected iris images and Q is the matrix of
essential eigenvectors introduced in Sec. 2.2.

2.3 Effect of Varying the Resolution
In order to measure the effectiveness of the proposed algorithms based on PCA and ICA techniques, with respect to
variations in resolution, we interpolate our iris images to a lower resolution grid. Two strategies of reducing image
resolution were considered (i) down sampling and (ii) averaging. The first strategy is performed by selecting every
second pixel in both the row and column directions. The second strategy is performed by averaging the original higher
resolution image over non-overlapping blocks of size 2-by-2. Hence, in both the cases we have one point in the lower
resolution grid for every four points in the next higher resolution grid. Sec. 3 presents synthesized results. Empirical
study of the effect of varying the resolution on the performance of the iris based identification system is an ongoing
work at WVU. An example of downsampling a normalized iris image is presented in Fig. 2

                                                      3. RESULTS

All experiments were performed on the CASIA dataset provided by the Chinese Academy of Sciences [5]. The CASIA
dataset contains “non-ideal” iris images of 108 irises with 7 images per iris. The images in this dataset are strongly
occluded, blurred, and defocused. Sample images from CASIA datasets are shown in Fig. 3.
     As a baseline curve, we use the receiver-operating characteristic (ROC) for modified J. Daugman’s system [12].
The results of the modified J. Daugman’s system implementation are shown in Fig. 6 (black solid line).
     In our experiments, global PCA method when applied to iris images, extracted 99 essential eigenvalues. The value
of 99-th eigenvalue is approximately 15 times smaller than the value of the first eigenvalue. This resulted in a poor
compression. This also emphasizes the fact that individual iris is rich in texture.

                           USER 1                     USER 19                       USER 62

                                        Fig. 3: Sample images from CASIA dataset.

      Compensation for rotation: We first demonstrate the effect of rotation on the performance of PCA and global ICA
encoding methods. Fig. 4 and 5 display two sets of histogram distributions of genuine and imposter Euclidean distance
scores obtained using the data from the CASIA dataset. The left and the right panels in Fig. 4 show the results for iris
verification system implementing PCA encoding technique without and with compensation for rotation, respectively.
Since iris images during processing are transformed to have a pseudo-polar representation, the angle values used to
optimize the performance are measured in the number of pixels. Each image is normalized such that one pixel in the
normalized image corresponds to one degree in the original image. The range of angles used during these experiments
was set to [-10, 10]. Fig. 5 presents similar results for encoding with ICA technique. The ROC curves demonstrating
the effect of rotation are shown on the left and right panels in Fig. 6 for PCA and ICA encoding techniques,
respectively. The results are displayed both for Euclidean (ED) and Hamming (HD) distances. One can conclude that
the compensation for head tilt (rotation) leads to a substantial improvement of performance. Only for results involving
both Hamming distance and ICA, rotation operation is performed during the PCA step. The lowest scoring PCA
template is projected on ICA space.
      Two scenarios: The performance of the system in Fig. 1 is further evaluated using two different scenarios. In the
first scenario, we formed two sets of iris images, training and testing, from the CASIA database. Each set consisted of
one image from 100 different irises. The training set was formed from the third image of first 100 irises, while the
testing set was formed from the second image of the same irises. In the second scenario, we used the training only to
extract eigenvectors and discarded all training set images. During testing, PCA components (for the first technique) or
ICA components (for the second technique) of each image from the test data set were obtained and compared against
PCA/ICA components of the other images in the testing set. Thus, the second scenario can be viewed as a “blind”
testing. The ROC curves for both scenarios are shown in Fig. 7. The left panel in Fig. 7 demonstrates the results when
Hamming distance is used as the matching score distance. The right panel demonstrates similar results for the casewhen
Euclidean distance is applied to calculate the matching scores. As expected, the results of testing under the “blind”
scenario are slightly degraded (both for PCA and ICA) compared to the results obtained under the first scenario.
Fig. 4: The left panel shows the histogram distributions of the genuine and imposter Euclidean distances when PCA encoding
     technique with no compensation for rotation is applied. The right panel presents similar results for the case when the
     compensation for rotation is performed.

Fig. 5: The left and the right panels show the histogram distributions of the genuine and imposter Euclidean distances when ICA
     encoding technique without and with compensation for rotation is applied, respectively.

Fig. 6: The left panel displays 4 ROC curves for PCA encoding techniques with and without compensation for rotation. The ROC
     curves marked with “+” are for the case when Hamming distance is used as a matching score. The ROCs marked with “o”
     describe the case when Euclidean distance is used as a matching score. The solid line marked in black corresponds to the ROC
     obtained with the modified Daugman’s algorithm. The right panel shows similar results for ICA encoding.

The advantage of the “blind” scenario is in its flexibility. The coefficients forming PCA and ICA vectors need not be
stored in a user database.
     Effect of varying the resolution: Fig. 8 and 9 demonstrate the effect of varying the resolution on the performance of
two proposed encoding methods employing Euclidean distance. It can be seen that the averaging strategy outperforms
the down sampling strategy. This is an anticipated result, for the averaging operation retains a larger amount of
information in the averaged image than the operation of down sampling. Note also a significant drop in performance for
some low resolutions.
     Overall, our results demonstrate a potential of applying global PCA and ICA techniques for iris encoding.
Fig. 7: The figure shows the ROC curves describing the performance of iris based verification system under two distinct scenarios:
     (i) typical and (ii) “blind.” The results shown on the left panel are for the case when Hamming distance is used as a matching
     score. The results shown on the right panel are when Euclidean distance is used.

Fig. 8: The left panel shows the effect of downsampling by averaging 2*2 neighborhood blocks, on ICA. The right panel shows the
affect of downsampling by leaving out every other pixel, on ICA.

Fig. 9: The left panel shows the effect of downsampling by averaging 2*2 neighborhood blocks, on PCA. The right panel shows the
     affect of downsampling by leaving out every other pixel, on PCA.

                                                       REFERENCES

1.    Daugman J. (1993) “High confidence visual recognition of persons by a test of statistical independence.” IEEE
     Transactions on Pattern Analysis and Machine Intelligence, vol. 15(11), pp. 1148-1161.
2.    Li Ma , Tan T., Wang Y. and Zhang D., “Personal Identification Based on Iris Texture Analysis”, IEEE
     Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no.12 ,pp. 1519-1533, 2003.
3.    Wildes R.P., “Iris Recognition: An Emerging Biometric Technology”, Proceedings of the IEEE. vol.85, No.9,
     September 1997.
4. L. Masek, Recognition of Human Iris Patterns for Biometric Identification, M.S. Dissertation, The University if
     Western Australia, 2003, http://www.csse.uwa.edu.au/~pk/studentprojects/libor/
5. CASIA Iris Image Database (ver. 1.0), http://www.sinobiometrics.com/casiairis.htm
6. Turk M.A., and Pentland A.P., “Face Recognition using eigenfaces” , Proceedings of the IEEE Conference on
     Computer Vision and Pattern Recognition, pages 58690, Hawaii, June 1992.
7. Kwanghyuk Bae, Seung-In Noh, Jaihie Kim, "Iris Feature Extraction Using Independent Component Analysis", 4th
     International Conference Audio-and Video-Based Biometric Person Authentication, Guildford, UK, June, 2003,
     pp.838-844.
8. Seung-In Noh, Kwanghuk Pae, Chulhan Lee, and Jaihie Kim, "Multiresolution Independent Component Analysis for
     Iris Identification", The 2002 International Technical Conference on Circuits/Systems, Computers and
     Communications, 2002, Phuket, Tailand, July 2.
9. A. Hyvärinen, J. Karhunen, E. Oja, “Independent Component Analysis”, John Wiley and Sons, 2001.
10. FastICA MATLAB Package . Available online at http://www.cis.hut.fi/projects/ica/fastica
11. Bruce A. Draper, Kyungim Baek, Marian Stewart Bartlett, J. Ross Beveridge, “Recognizing Faces with PCA and
     ICA”, CVIU,91(1-2):115-137, July 2003.
12. Vivekanand Dorairaj, Yash N. Shah, and Natalia A. Schmid, “Exploring and Developing Processing Techniques for
     Iris Recognition System”, Biometric Consortium Conference, Crystal city, VA, USA, September 20, 2004.
13. Juha Koikkalainen, Jyrki Lötjönen: "Image Segmentation with the Combination of the PCA- and ICA-Based Modes
     of Shape Variation", ISBI 2004: 149-152.
14. M. Turk and A. Pentland. “Eigenfaces for recognition”. Journal of Cognitive Neuroscience, 1(1):71–86, 1991.
15. Li Ma, Yunhong Wang, and Tieniu Tan. “Iris recognition using circular symmetric filters”. Proc. International
    Conference on Pattern recognition, volume 2, pages 414–417, 2002.
16. Li Ma, Tieniu Tan, Yunhong Wang, Dexin Zhang, “Efficient Iris Recognition by Characterizing Key Local
    Variations”, IEEE Trans. on Image Processing, Vol. 13, No.6, pp. 739- 750, 2004.
17. Ya-Ping Huang, Si-Wei Luo, En-Yi Chen, “An Efficient Iris Recognition System”, Proc. of the First
    International Conference on Machine Learning and Cybernetics, Beijing, 4-5 Nov 2002.
You can also read