IEICE TRANS. INF. & SYST., VOL.E92–D, NO.12 DECEMBER 2009

2527

LETTER

Face Recognition Based on Nonlinear DCT Discriminant Feature Extraction Using Improved Kernel DCV Sheng LI† , Student Member, Yong-fang YAO† , Xiao-yuan JING†a) , Heng CHANG† , Shi-qiang GAO† , David ZHANG†† , and Jing-yu YANG††† , Nonmembers

SUMMARY This letter proposes a nonlinear DCT discriminant feature extraction approach for face recognition. The proposed approach first selects appropriate DCT frequency bands according to their levels of nonlinear discrimination. Then, this approach extracts nonlinear discriminant features from the selected DCT bands by presenting a new kernel discriminant method, i.e. the improved kernel discriminative common vector (KDCV) method. Experiments on the public FERET database show that this new approach is more effective than several related methods. key words: DCT frequency bands selection, the improved KDCV, nonlinear DCT feature extraction, face recognition

1.

Introduction

Discrete cosine transform (DCT) is a widely used image processing technique [1], and discriminant analysis is an effective image feature extraction and recognition technique [2]. To date, many linear discriminant methods have been put forward, such as linear discriminant analysis (LDA) [3], direct LDA (DLDA) [4], and discriminative common vector (DCV) [5]. In our previous work, we combined the DCT and linear discriminant techniques and presented a DCT-LDA face recognition method which outperforms some conventional linear discriminant methods [6]. As an extension of linear discriminant technique, the kernel-based nonlinear discriminant analysis technique has now been widely applied to the field of pattern recognition. Baudat et al. developed a commonly used generalized discriminant analysis (GDA) method for nonlinear discrimination [7]. Jing et al. put forward a kernel DCV (KDCV) method [8]. Shen et al. combined Gabor wavelets and GDA for face identification and verification [9]. In this paper, we develop DCT-LDA and propose a nonlinear DCT discriminant feature extraction approach for face recognition. First, we provide the representation of DCT frequency bands and select appropriate bands. Second, we extract the nonlinear discriminant features from the selected bands by presenting a new kernel discriminant method, that is, the improved KDCV method which takes advantages of both Manuscript received May 8, 2009. Manuscript revised July 24, 2009. † The authors are with the Nanjing University of Posts and Telecommunications, Nanjing, 210003 China. †† The author is with the Dept. of Computing, Hong Kong Polytechnic University, Hong Kong. ††† The author is with the Nanjing University of Science and Technology, Nanjing, China. a) E-mail: [email protected] DOI: 10.1587/transinf.E92.D.2527

kernel discriminative common vectors and different vectors. The nearest neighbor classifier is adopted in classifying the extracted features. We employ a large public face database, the FERET database, as the test data. Experiments demonstrate the effectiveness of the proposed new approach. 2.

Recognition Approach

Suppose that each gray image sample in the sample set X is sized C × D where C ≥ D. We perform the two-dimensional DCT on each image and obtain its transformed image [1]. Figure 1 shows the face demonstration images of DCT transform: (a) an original image; (b) its DCT transformed image and frequency bands representation. According to Fig. 1 (b), most information or energy of face image is concentrated in the left-up corner, that is, in the low frequency bands. We use a half square ring to express the kth frequency band [6], where 1 ≤ k ≤ C. If we select the kth band, then we keep the corresponding frequency band values of DCT transformed image, otherwise we set the band values to change to zero. The nonlinear discriminant analysis technique generally uses the kernel-based method to realize a conceptual nonlinear transformation from an input feature space into a high-dimensional space. With respect to a given nonlinear mapping function Φ, the input data space Rn can be mapped into the kernel space F: Φ : Rn → F, x → Φ(x). Suppose that there are c known pattern classes in the sample set X, li is the training sample number of the ith class, and there will c  Φ be a total of M = li training samples. Let S ΦB and S W repi=1

resent the between-class scatter matrix and the within-class scatter matrix in F, respectively. They are defined as: S ΦB =

Fig. 1

c   1   Φ Φ T li μi − μΦ μΦ , i −μ M i=1

(1)

Face demonstration images of DCT transform and filtering.

c 2009 The Institute of Electronics, Information and Communication Engineers Copyright 

IEICE TRANS. INF. & SYST., VOL.E92–D, NO.12 DECEMBER 2009

2528

Φ SW

li  c  T 1  = , Φ(xim ) − μΦ Φ(xim ) − μΦ i i M i=1 m=1

(2)

where Φ(xim ) is the mth training sample of the ith class, μΦ i is the mean value of the ith class samples, and μΦ is the mean value of all training samples. Define the Fisher discriminant criterion in F as: J(ϕ) =

ϕT S ΦB ϕ Φ ϕT S W ϕ

.

(3)

The optimal discriminant vector ϕ can be expressed by a linear combination of the observations in F, we have ϕ = M  a j Φ(x j ) = Hα, where H = [Φ(x1 ), Φ(x2 ), . . . , Φ(x M )] j=1

and α = (a1 , a2 , . . . a M )T . We thus obtain [7]: J Φ (α) =

αT

αT (KUK) α , (K (IN − U) K) α

(4)

where matrix, K =   K is an M × M kernel symmetric  , Ki j = Φ (xi )T Φ xj ; IN is an M × M idenKi j i, j=1,2,···,M tity matrix; and U is an M × M block diagonal matrix. U = diag (U1 , . . . , Uc ) where Ui (i = 1, 2, · · · , c) is a li × li matrix, the elements of which are equal to 1 / li . Let 

S B = KUK

and



S W = K (IN − U) K,

(5)

the nonlinear discriminant capability of the band can be calculated by: 

Fk =

tr(S B ) 

tr(S W )

,

     Φ xim = Φ xicom + Φ xim

 di f

,

(8)

      where Φ xicom ∈ V ⊥ , Φ xim di f ∈ V, Φ xicom and   Φ xim di f separately represent the common vector and   different vector parts of Φ xim . Φ indicates a given nonlinear mapping function. In this paper, Φ is expressed kernel function k (c1 , c2 ) =  by the Gaussian   exp − c1 − c2 2 2δ2 . Note that for all the samples of the ith class, their common vector parts are same [10]. Step2. Compute the optimal projection transforms Wcom and Wdi f : According to KDCV [8], we use the comΦ , and mon vectors to construct the scatter matrix S com then get the projection transform Wcom composed of the Φ with nonzero eigenvalues. We obeigenvectors of S com tain the following kernel discriminative common vectors:   T Φ xicom , i = 1, 2, · · · , c, (9) yicom = Wcom where yicom is identical for the ith class and the feature dimension of yicom is c − 1. In this paper, we use the different vectors to calculate Wdi f . Take Φ xim di f of each training sample to construct the corresponding within-class scatter matrix Φ Φ SW di f and between-class scatter matrix S B di f . Wdi f is designed to satisfy the Fisher discriminant criterion:   W T S Φ Wdi f  di f B di f . J(Wdi f ) =  (10) W T S Φ Wdi f  di f W di f

(6) 



where tr () indicates the trace of matrix, S B and S W separately represent the corresponding between-class scatter matrix and the within-class scatter matrix in the kernel space [8]. In the experiments, we calculate the nonlinear discriminant capabilities of all frequency bands, and then we rank them in a descending order and select a part of bands with the largest discriminant capabilities. Next, we present an improved KDCV method to extract nonlinear discriminant features from the selected DCT bands as follows: Step1. Compute the common vectors and different vectors: The principle of KDCV is to acquire the nonlinear pro jection transform in the null space of S W :    J (W) = arg max W T S B W  |W T S W W |=0   , (7) Φ W  = arg max W T S com  |W T S W W |=0 

Φ where S com is the common vector scatter matrix, S W is an M × M symmetric matrix, and M is the total number  of training samples. Let V be the non-null space of S W,  and V ⊥ be the null space of S W . For any sample Φ xim of sample set X in the kernel space, we have [8]:

Wdi f is composed of the eigenvectors corresponding to  −1 Φ S ΦB di f . We obthe nonzero eigenvalues of S W di f tain the following kernel discriminative different vectors:   (11) yim di f = WdiT f Φ xim di f , i = 1, 2, · · · , c, where the feature dimension of yim di f is c − 1. Step3. Construct the synthesized optimal projection transform W for the improved KDCV: W is constructed by serially combining Wcom and Wdi f :  T W = Wcom WdiT f . (12) The whole recognition procedure of proposed approach is implemented as follows: (i) Perform DCT on each image sample of X. Select appropriate DCT frequency bands for the transformed images and express the selected bands in the form of feature vector. We thus obtain a one-dimensional sample  set X . (ii) Use the improved KDCV to compute the optimal projection transform W in the kernel space. We acquire the kernel discriminative common vector ycom and kernel discriminative different vector ydi f :

LETTER

2529

 W · Φ (x) = Wcom Wdi f · Φ (x)  . = ycom ydi f Normalize ycom and ydi f as ⎡ ⎤ ⎢⎢ ycom ydi f ⎥⎥



⎥⎥⎥⎦ , y = ⎢⎢⎢⎣

ycom ydi f

(13)

(14)

where represents the 2-norm of vector. We obtain a  new sample set Y corresponding to X ; (iii) Take the nearest neighbor classifier with the cosine distance measure to classify Y. The distance d () between two arbitrary samples, y1 and y2 , is defined by d(y1 , y2 ) = − 3.

yT1 y2 .

y1 y2

Fig. 2

Demonstration face images of five subjects.

(15)

Experimental Results

We use a public large face image database, the FERET database [11], as the test data. In the experiment it contains 600 frontal facial images corresponding to 200 individuals with each person contributing 3 images. The images in this database were captured under various illuminations and facial expressions. Each image is 384 × 256 with 256 gray levels. Since many images in this database include the background and the body chest region, we adopt the FERET protocol preprocessing software to automatically crop every image sample. That is, the facial region of each image is cropped off by using the centers of eyes and mouth, and the cropped images are resized to 128 × 128. Figure 2 shows the demonstration face images of five subjects. In the test, two images of each subject are randomly chosen for training, and the remaining one is used for testing. We perform the two-dimensional DCT on the images. Figure 3 shows the nonlinear discriminability values of all DCT bands by using Formula (6). According to the nonlinear discriminability values shown in Fig. 3, we rank all frequency bands in a descending order and in turn choose a part of them with the largest nonlinear discriminability values. Figure 4 shows the recognition rates of the selected DCT frequency bands. While using the selected bands from No. 1 to No. 23, we get the highest recognition rate of 99%. We compare the proposed approach with five representative discriminant methods which include the DCTLDA method [6], the generalized discriminant analysis (GDA) method [7], the DCT-GDA method, the DCT-KDCV method, and the Gabor-GDA method [9]. DCT-GDA and DCT-KDCV take the same method of selecting frequency bands as the proposed approach. In the test, the numbers of selected frequency bands are 22, 25, and 21 for DCTLDA, DCT-GDA, and DCT-KDCV, respectively. All compared methods use the same classifier, i.e. the nearest neighbor classifier. Table 1 shows the recognition rates of all compared methods. The proposed approach performs better than other methods. From Table 1, the proposed approach improves the recognition rates at least by 1.5% (=

Fig. 3

Nonlinear discriminability values of all DCT bands.

Fig. 4

Recognition rates of selected DCT frequency bands.

Table 1

Recognition rates of compared methods.

Methods Proposed Approach DCT-LDA GDA DCT-GDA DCT-KDCV Gabor-GDA

Recognition rates (%) 99% 93% 90% 96% 97.5% 97.5%

99% − 97.5%). Besides, compared with the commonly used nonlinear discriminant method GDA, the proposed approach saves 75.64% (= (65.76 − 16.02)/65.76 × 100%) computing time (second), where the computing time indicates the time of achieving discriminant features.

IEICE TRANS. INF. & SYST., VOL.E92–D, NO.12 DECEMBER 2009

2530

4.

Conclusions

The proposed nonlinear DCT discriminant approach is obviously superior to DCT-LDA, and it can extract more effective nonlinear discriminant features than the two representative methods of GDA and KDCV. And it outperforms the Gabor-GDA method. By selecting the DCT frequency bands, our approach also improves the computing speed of nonlinear discriminant methods. Acknowledgements The work described in this paper was fully supported by the National Natural Science Foundation of China (NSFC) under Project No. 60772059, the Natural Science Research Foundation of Jiangsu Province Universities under Project No. 07KJB520081, and the Research Foundation of Nanjing University of Posts and Telecommunications under Project No. NY207027, Project No. NY208051. References [1] Z.M. Hafed and M.D. Levine, “Face recognition using the discrete cosine transform,” Int. J. Comput. Vis., vol.43, no.3, pp.167–188, 2001. [2] D. Zhang, X.Y. Jing, and J. Yang, “Biometric image discrimination

(BID) technologies,” IGP/INFOSCI/IRM Press, 2006. [3] A.K. Jain, R.P.W. Duin, and J. Mao, “Statistical pattern recognition: A review,” IEEE Trans. Pattern Anal. Mach. Intell., vol.22, no.1, pp.4–37, 2000. [4] H. Yu and J. Yang, “A direct LDA algorithm for high-dimensional data with application to face recognition,” Pattern Recognit., vol.34, no.12, pp.2067–2070, 2001. [5] H. Cevikalp, M. Neamtu, M. Wilkes, and A. Barkana, “Discriminative common vectors for face recognition,” IEEE Trans. Pattern Anal. Mach. Intell., vol.27, no.1, pp.4–13, 2005. [6] X.Y. Jing and D. Zhang, “A face and palmprint recognition approach based on discriminant DCT feature extraction,” IEEE Trans. Syst., Man Cybern. B, vol.34, no.6, pp.2405–2415, 2004. [7] G. Baudat and F. Anouar, “Generalized discriminant analysis using a kernel approach,” Neural Comput., vol.12, no.10, pp.2385–2404, 2000. [8] X.Y. Jing, D. Zhang, J.Y. Yang, Y.F. Yao, and M. Li, “Face and palmprint pixel level fusion and kernel DCV-RBF classifier for small sample biometrics recognition,” Pattern Recognit., vol.40, no.11, pp.3209–3224, 2007. [9] L.L. Shen, L. Bai, and M. Fairhurst, “Gabor wavelets and general discriminant analysis for face identification and verification,” Image Vis. Comput., vol.25, no.5, pp.553–563, 2007. [10] M.B. Gulmezoglu, V. Dzhafarov, and A. Barkana, “The common vector approach and its relation to principal component analysis,” IEEE Trans. Speech Audio Process., vol.9, no.6, pp.655–662, 2001. [11] P.J. Phillips, H. Moon, S.A. Rizvi, and P.J. Rauss, “The FERET evaluation methodology for face-recognition algorithms,” IEEE Trans. Pattern Anal. Mach. Intell., vol.22, no.10, pp.1090–1104, 2000.

Face Recognition Based on Nonlinear DCT ...

Dec 12, 2009 - the kernel-based nonlinear discriminant analysis technique has now been widely ... alized discriminant analysis (GDA) method for nonlinear.

238KB Sizes 5 Downloads 234 Views

Recommend Documents

Face Recognition Based on SVM ace Recognition ...
features are given to the SVM classifier for training and testing purpose. ... recognition has emerged as an active research area in computer vision with .... they map pattern vectors to a high-dimensional feature space where a 'best' separating.

Face Recognition Based on Local Uncorrelated and ...
1State Key Laboratory for Software Engineering, Wuhan University, Wuhan, ... of Automation, Nanjing University of Posts and Telecommunications, 210046, ...

Authorization of Face Recognition Technique Based On Eigen ... - IJRIT
IJRIT International Journal of Research in Information Technology, Volume 2, ..... computationally expensive but require a high degree of correlation between the ...

Three dimensional face recognition based on geodesic ...
dimensional face recognition systems, on the other hand, have been reported to be less .... to methods based on PCA applied to the 3D point clouds, fitting implicit ... surfaces, and PCA applied to range images.21 Its performance was equivalent to an

Face Recognition Using Composite Features Based on ...
Digital Object Identifier 10.1109/ACCESS.2017.DOI. Face Recognition Using Composite. Features Based on Discriminant. Analysis. SANG-IL CHOI1 ... ing expressions, and an uncontrolled environment involving camera pose or varying illumination, the recog

Authorization of Face Recognition Technique Based On Eigen ... - IJRIT
IJRIT International Journal of Research in Information Technology, Volume 2, ..... computationally expensive but require a high degree of correlation between the ...

Appearance-Based Automated Face Recognition ...
http://sites.google.com/site/jcseuk/. Appearance-Based Automated Face. Recognition System: Multi-Input Databases. M.A. Mohamed, M.E. Abou-Elsoud, and M.M. Eid. Abstract—There has been significant progress in improving the performance of computer-ba

Support vector machine based multi-view face detection and recognition
theless, a new problem is normally introduced in these view- ...... Face Recognition, World Scientific Publishing and Imperial College. Press, 2000. [9] S. Gong ...

GA-Fisher: A New LDA-Based Face Recognition Algorithm With ...
GA-Fisher: A New LDA-Based Face Recognition. Algorithm With Selection of Principal Components. Wei-Shi Zheng, Jian-Huang Lai, and Pong C. Yuen. Abstract—This paper addresses the dimension reduction problem in Fisherface for face recognition. When t

Multithread Face Recognition in Cloud
Oct 12, 2016 - biometrics systems have proven to be an essential security tools, ... showcase cloud computing-enabled face recognition, which utilizes ...

Multithread Face Recognition in Cloud
Oct 12, 2016 - this responsibility to third-party vendors who can maintain cloud ... showcase cloud computing-enabled face recognition, which utilizes ...

A new approach for Face Recognition Based on PCA ...
Now we determine a second projection space (Fisher space). To do that, we apply the algorithm of linear discriminate analysis (LDA) [9] on the set of vectors Yi.

Language Recognition Based on Score ... - Semantic Scholar
1School of Electrical and Computer Engineering. Georgia Institute of ... over all competing classes, and have been demonstrated to be effective in isolated word ...

Language Recognition Based on Score ... - Semantic Scholar
1School of Electrical and Computer Engineering. Georgia Institute ... NIST (National Institute of Standards and Technology) has ..... the best procedure to follow.

pdf-0738\face-detection-and-recognition-on-mobile-devices-by ...
pdf-0738\face-detection-and-recognition-on-mobile-devices-by-haowei-liu.pdf. pdf-0738\face-detection-and-recognition-on-mobile-devices-by-haowei-liu.pdf.

RFID Based Face Attendance System RFID Based Face ... - IJRIT
ability to uniquely identify each person based on their RFID tag type of ID card make .... Fortunately, Intel developed an open source library devoted to easing the.

Survey on Face Recognition Using Laplacian faces
1Student, Pune University, Computer Department, K J College Of Engineering and Management Research. Pune .... Orlando, Florida, 2002, pp.3644-3647.

Survey on Face Recognition Using Laplacian faces - International ...
Abstract. The face recognition is quite interesting subject if we see in terms of security. A system can recognize and catch criminals and terrorists in a crowd. The proponents of large-scale face recognition feel that it is a necessary evil to make

RFID Based Face Attendance System RFID Based Face ... - IJRIT
This paper describes the development of a employee attendance system .... One can increase both the false alarm rate and positive hit rate by decreasing the.

SURF-Face: Face Recognition Under Viewpoint ...
A grid-based and dense extraction of local features in combination with a block-based matching ... Usually, a main drawback of an interest point based fea- .... navente at the Computer Vision Center of the University of Barcelona [13].

SURF-Face: Face Recognition Under Viewpoint ...
Human Language Technology and ..... In CVPR, Miami, FL, USA, June. 2009. ... In International Conference on Automatic Face and Gesture Recognition,. 2002.

Rapid Face Recognition Using Hashing
cal analysis on the recognition rate of the proposed hashing approach. Experiments ... of the images as well as the large number of training data. Typically, face ...