An Elliptical Boundary Model for Skin Color Detection Jae Y. Lee and Suk I. Yoo School of Computer Science and Engineering, Seoul National University Shilim-Dong, Gwanak-Gu, Seoul 151-742, Korea

Abstract Automatic skin detection has been intensively studied for human-related recognition systems. A Gaussian model for skin detection is a well-known approach with its simplicity and generality. In this paper, we discuss limitations of the Gaussian model and suggest a new statistical color model for skin detection, called an elliptical boundary model. The suggested model overcomes the limitations of the Gaussian model with better performance. From the experiment performed on six chrominance spaces, it gives the much higher correct detection ratio than a single Gaussian model. As compared to the mixture of Gaussian model suggested to improve the correct detection ratio of a single Gaussian at the expense of slower speed, the elliptical boundary model still gives the better correct detection ratio with faster speed.

Keywords: skin detection, skin color model, elliptical boundary model, Gaussian model G

1. Introduction Many research results on automatic skin detection have been published [1, 2 ,3, 4, 5, 6, 7, 8, 9]. The well-known statistical color models to estimate skin density in chrominance space may be the single Gaussian model [4, 5, 6], the mixture of Gaussian model [8, 9], and histograms [2, 3]. The histogram model based on local approximation is simple and fast but becomes effective only when training data is sufficiently large to be dense. Moreover, it requires additional memory to keep the histograms. The Gaussian model based on the global feature of distribution has the advantage of generality. Specially a single Gaussian model is simple and fast. It

however does not adequately represent the variance of the skin distribution occurred in the situation where illumination condition varies. To overcome this drawback, the mixture of the Gaussian model has been suggested. It is however hard to be trained and slow. In this paper, we suggest a new statistical color model for skin detection, called an elliptical boundary model. This elliptical boundary model can be easily constructed from training data in a fast speed and its performance is better than both the single and the mixture of Gaussian model.

2. Chrominance Histograms G To devise the appropriate model for skin detection, we investigate the characteristics of skin and non-skin distributions in terms of chrominance histograms. The normalized 2D chrominance space is derived from the original 3D color space by eliminating intensity [4]. The skin and non-skin chrominance histograms are then constructed from 2,000 skin images and 4,000 non-skin images in Compaq database which contains 4,675 skin images and 8,965 non-skin images randomly collected from the World Wide Web [2]. Histograms are computed with 400 Ý400 bin resolution in six different chrominance spaces, r-g, CIE-u*v*, CIEa*b*, CIE-xy, I-Q, and Cb-Cr. From these histograms, the following three characteristics have been observed. For the lack of space, we present histograms in two, r-g and CIE-u*v*, of the six chrominance spaces, in Figure 1 and Figure 2. First, most non-skin chrominances

(a)

(b)

Figure 1. Non-skin chrominance histograms. (a) r-g space. (b) CIE-u*v* space.

like a normal distribution but with a skew peak toward the gray point. It is because the dark skin color is unstable and diffuses toward red over the wide surface area [7]. We can observe this asymmetry more clearly with a side view of histogram in Figure 3. Any color model for skin detection should thus reflect these characteristics to give the better performance. In the next section we briefly review the Gaussian models and discuss its limitations.

3. Gaussian Models G

3.1 A Single Gaussian Model (a)

(b)

Figure 2. Skin chrominance histograms. Each cross mark denotes gray point. (a) r-g space. (b) CIE-u*v* space.

Assuming that the skin density is modeled by a single Gaussian, the skin likelihood of an input chrominance vector X is given by

P( X ) = where λ defined as

2π Λ

1/ 2

 1  exp − λ2   2 

T

(b)

Figure 3. (a) Skin histogram in r-g chrominance space. (b) A side view of (a). N denotes non-skin density peak and S denotes skin density peak.

concentrate on one point, so called a gray point (r=g=1/3 in the r-g space, and u*=v*=0 in the CIE-u*v* space). It implies that the performance of an object detection system based on color information strongly depends on the degree of overlapping between object color area and the gray point area in chrominance space. Secondly, skin cluster is simple-shaped despite of the unconstrained nature of web images. This is due to both color normalization and a large amount of data. Finally, skin density looks

(1)

is the Mahalanobis distance

λ2 = ( X − µ ) Λ−1 ( X − µ )

N S/

(a)

1

(2)

The mean vector µ and covariance matrix Λ of the density are estimated from a training set. Given threshold λT2 , X is classified as skin chrominance if λ2 < λT2 and as non-skin chrominance otherwise. The inequality λ2 < λT2 defines an elliptical area whose center is given by µ and whose principal axes are by Λ . This area however does not meet tightly the skin chrominance distribution since µ is skewed toward the gray point as mentioned in section 2. Moreover it contains the gray point, which results in large false detections. Figure 4 illustrates this situation.

3.2 A Mixture of Gaussian Model In a mixture of Gaussian suggested to overcome the limitation of a single Gaussian,

(a)

model is trained from a set of training data in two steps, preprocessing and parameter estimation. In preprocessing step we remove outliers so that the trained model reflects the main density of the underlying data set. In parameter estimation step we estimate model parameters from the preprocessed data set.

(b)

Figure 4. Class boundaries formed from a single Gaussian model with 95% skin detection rate. (a) r-g chrominance space. (b) CIE-u*v* chrominance space.

the skin likelihood of an input chrominance vector X is given by t

P ( X ) = ∑ wi g ( X ; µ i , Λ i )

(3)

i =1

where g ( X ; µ , Λ ) is a 2-dimensional Gaussian density with mean vector µ and covariance matrix Λ , and the wi are the mixing parameters of the Gaussians meeting Σwi = 1 . These model parameters can be estimated by means of the ExpectationMaximization (EM) algorithm [10]. Although the mixture of Gaussian can model arbitrarily complex densities, it is computationally expensive in training and evaluation: The mixture model is hard to be trained since the EM algorithm needs very high time complexity and the skin detection takes long time since each of the Gaussians used for the mixture model should be evaluated to generate the skin likelihood of the chrominance [2]. In the next section we suggest a new statistical color model which may overcome these limitations.

4. An Elliptical Boundary Model The skin density is simple-shaped as discussed in section 2 and the skin area in each chrominance space fits well an ellipse. This can be observed visually from Figure 2. With this observation, we suggest a new statistical color model for skin detection, called an elliptical boundary model. This

1) Training Data Set: Initially training data set consists of skin chrominance samples. 2) Preprocessing: Outliers are removed by eliminating k % sample data from the training set which have low frequency. The value of k , where 0≤ k ≤5, is determined by the amount of noise and negligible data in the training set. 3) Parameter Estimation: Let X 1 ,..., X n be all the distinctive chrominance vectors of the preprocessed training data set and f ( X i ) = f i ( i =1,…, n ) be the number of samples with chrominance value of X i . An

elliptical boundary model Φ = ( X ;ψ , Λ ) is then defined as

Φ ( X ) = [X − ψ ] Λ−1 [X − ψ ] T

(4)

where the two model parameters, ψ and Λ are given by

1 n ∑ Xi n i =1 1 n T Λ = ∑ f i ( X i − µ )( X i − µ ) N i =1

ψ =

and where N =

(5) (6)

n

∑f i =1

is the total number of

i

samples in the preprocessed training data set and the vector µ =

1 N

n

∑f X i =1

i

i

is the mean

of chrominance vectors.

θ and input Given threshold chrominance X of a pixel, X is classified as skin chrominance if Φ ( X ) < θ and as non-skin chrominance otherwise. Note that the value of threshold θ trades off

(b)

Skin Detection Rate

Skin Detection Rate

(a)

Elliptical Boundary Single Gaussian Mixture of Gaussian

False Detection Rate

5. Experimental Results In this section, we compare the performance of the elliptical boundary model to those of the single and the mixture of Gaussian model. Each model has been trained from 2,000 skin images and tested by 2,000 skin and 4,000 non-skin images from Compaq database [2]. The test skin images are distinct from the training ones. In training the elliptical model, the value of k for removing outliers is set to 1. The mixture model constructed from six Gaussians has been trained using the EM algorithm [10]. Figure 6 shows the ratios of correct detection and false detection for each of the three models in six different chrominance

Skin Detection Rate

Elliptical Boundary Single Gaussian Mixture of Gaussian False Detection Rate

Elliptical Boundary Single Gaussian Mixture of Gaussian False Detection Rate

Elliptical Boundary Single Gaussian Mixture of Gaussian False Detection Rate

Skin Detection Rate

Skin Detection Rate

correct detections by false detections. As the value of threshold increases, the number of correct detections increases but the number of false detections also does. The equality Φ ( X ) = θ defines an elliptical boundary between skin chrominance and non-skin chrominance whose center is given by ψ and whose principal axes are by Λ . This boundary meets well the skin chrominance distribution as shown from Figure 5. It includes most of skin chrominance area and the least non-skin chrominance area especially near the gray point. Note that this model is not affected by the degree of skew of skin distribution since ψ is the center of distribution area computed from distinctive data not counting their frequencies.

Skin Detection Rate

False Detection Rate

Figure 5. Class boundaries formed from an elliptical boundary model with 95% skin detection rate. (a) r-g chrominance space. (b) CIE-u*v* chrominance space.

Elliptical Boundary Single Gaussian Mixture of Gaussian

Elliptical Boundary Single Gaussian Mixture of Gaussian False Detection Rate

Figure 6. ROC curves in six different chrominance spaces for elliptical boundary model, single Gaussian model, and mixture of Gaussian model. From top to bottom and left to right: r-g, CIE-a*b*, CIE-xy, CIEu*v*, Cb-Cr, and I-Q spaces.

spaces, r-g, CIE-u*v*, CIE-a*b*, CIE-xy, I-Q, and Cb-Cr. We can see that the elliptical boundary model gives the best performance in every chrominance space. As expected, the single Gaussian gives the worst performance. The mixture of Gaussian gives better performance than that of the single Gaussian but worse than that of the elliptical model, in average, as shown in Table 1. In terms of time complexity, the suggested elliptical model is as fast as the single Gaussian but much faster than the mixture of Gaussian: In order to classify one pixel, the elliptical model needs evaluate Equation (4) only and the single Gaussian model needs Equation (2). The mixture model however needs evaluate Equation (3)

Table 1. Selected ratios of correct detection and false detection. SD is skin detection rate and FD is false detection rate. Elliptical Boundary Model

FD

SD Average r-g CIE-a*b* CIE-xy CIE-u*v* Cb-Cr I-Q

90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0%

23.3% 35.7% 21.3% 32.4% 25.3% 37.0% 20.9% 31.2% 22.3% 34.0% 25.0% 39.7% 25.0% 39.7%

Single Gaussian Model

FD

SD 90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0%

47.0% 67.8% 54.4% 68.5% 52.0% 71.1% 58.5% 72.2% 50.4% 69.7% 33.3% 62.7% 33.3% 62.7%

which is much more complicated and timeconsuming than Equation (4).

[2]

6. Conclusions

[3]

As exhibited in the experiments, the elliptical boundary model outperforms both the single Gaussian model and the mixture of Gaussian model in six chrominance spaces. Moreover, it is simple and fast. One drawback of the suggested model is that its usage is limited to binary classification, where the continuous information given by probability density function is not preserved.

Acknowledgement This work was supported partially by the BK21 project and by the project, RIACT 04212000-0008.

[4]

[5]

[6]

[7]

References [1]

R. Feraud, O.J. Bernier, J.-E. Viallet, and M. Collobert, “A Fast and Accurate Face Detector Based on Neural Networks,” IEEE Trans. Pattern Analysis and Machine Intelligence,

[8]

Gaussian Mixture Model

SD 90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0% 90.0% 95.0%

FD 38.4% 47.8% 34.3% 39.8% 41.4% 52.1% 42.4% 52.0% 45.2% 52.6% 37.1% 52.0% 30.0% 38.0%

2001. M.J. Jones and J.M. Rehg, “Statistical Color Models with Application to Skin Detection,” Proc. CVPR, 1999. H. Wu, Q. Chen, and M. Yachida, “Face Detection From Color Images Using a Fuzzy Pattern Matching Method,” IEEE Trans. Pattern Analysis and Machine Intelligence, 1999. J. Yang, W. Lu, and A. Waibel, “SkinColor Modeling and Adaptation,” Proc. ACCV, 1998. B. Menser and F. Muller, “Face Detection in Color Images Using Principal Component Analysis,” Proc. Image Processing and its Applications, 1999. L. Fan and K.K. Sung, “Face Detection and Pose Alignment Using Colour, Shape and Texture Information,” Proc. Visual Surveillance, 2000. J.-C. Terrillon, M.N. Shirazi, H. Fukamachi, and S. Akamatsu, “Comparative Performance of Different Skin Chrominance Models and Chrominance Spaces for the Automatic detection of Human Faces in Color Images,” Proc. Automatic Face and Gesture Recognition, 2000. Y. Raja, S.J. McKenna, and S. Gong,

“Tracking and Segmenting People in Varying Lighting Conditions Using Colour,” Proc. Automatic Face and Gesture Recognition, 1998. [9] T.S. Jebara and A. Pentland, “Parameterized Structure from Motion for 3D Adaptive Feedback Tracking of Faces,” Proc. CVPR, 1997. [10] R.A. Redner and H.F. Walker, “Mixture Densities, Maximum Likelihood, and the EM Algorithm,” SIAM Review, 1984.

An Elliptical Boundary Model for Skin Color Detection - CiteSeerX

... Y. Lee and Suk I. Yoo. School of Computer Science and Engineering, Seoul National University ... depends on the degree of overlapping between object color ...

151KB Sizes 3 Downloads 257 Views

Recommend Documents

A Formal Study of Shot Boundary Detection - CiteSeerX
Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. ... not obtained by thresholding schemes but by machine learning ...... working toward the M.S. degree in the Department of Computer ...

A Formal Study of Shot Boundary Detection - CiteSeerX
Based on the comparison of the existing approaches, optimal criteria for ... in computer perfor- mance and the growth of the Internet, have led to the widespread.

An Active Contour Model for Spectrogram Track Detection
Oct 26, 2009 - Department of Computer Science, University of York, Heslington, .... for Q = UT Vx(t)y(t) − µ where µ and Σ the mean and standard deviation of.

An Optimization Model for Outlier Detection in ...
Department of Computer Science and Engineering, ... small subset of target dataset such that the degree of disorder of the resultant dataset after the removal ... Previous researches on outlier detection broadly fall into the following categories.

An Active Contour Model for Spectrogram Track Detection
Oct 26, 2009 - feature detection and an analysis of the algorithm's complexity. 2.1. The Active ..... Vol. 57 of. Advances in Intelligent and Soft Computing.

Semantic-Shift for Unsupervised Object Detection - CiteSeerX
notated images for constructing a supervised image under- standing system. .... the same way as in learning, but now keeping the factors. P(wj|zk) ... sponds to the foreground object as zFG and call it the fore- ..... In European Conference on.

Profile Injection Attack Detection for Securing ... - CiteSeerX
to Bamshad Mobasher for inspiring and encouraging me to pursue an academic career in computer science. His thoroughness and promptness in reviewing my ...

Profile Injection Attack Detection for Securing ... - CiteSeerX
6.4.2 Robustness Comparison Against Nuke Attacks . . . . . . . . . . . . . . 34. 7 Defense ..... As a result, we have focused on profile analytics data and attribute ...... Data Mining: Practical machine learning tools and techniques, 2nd. Edition. M

Concept Boundary Detection for Speeding up SVMs
Abstract. Support Vector Machines (SVMs) suffer from an O(n2) training cost, where n denotes the number of training instances. In this paper, we propose an algorithm to select boundary instances as training data to substantially re- duce n. Our propo

Face Detection Algorithm based on Skin Detection ...
systems that analyze the information contained in face images ... characteristics such as skin color, whose analysis turns out to ..... software version 7.11.0.584.

Face Detection Using Skin Likelihood for Digital Video ...
This project is based on self-organizing mixture network (SOMN) & skin color ... develop accurate and robust models for image data, then use the Gaussian ...

A Model for Cross Boundary Collaboration
problems and practices need an open development software environment allowing ... for the collaborative design of iPhone applications as a proof of concept for ...

Internal boundary layer model for the evolution of ...
Feb 5, 2012 - and ranging (lidar) data we computed z02 =10−2 m (Supplementary ... portantly, the analytical expression was not fit to sand flux data; all ...

A Model for Cross Boundary Collaboration
ABSTRACT. As designing software systems becomes more complex, the involvement .... work, I will focus on visualizing the life cycle of communication activities ...

A Wall Boundary Computation Model by Polygons for ...
Figure 2: Comparison of the height and the leading coordinate of the water. They show the comparison of the height of the water and the comparison of the ...

A Model for Cross Boundary Collaboration
proof of concept for this conceptual model. The reflective meta-wiki will explore the use of boundary objects to facilitate participation and communication in the.

Pattern recognition techniques for automatic detection of ... - CiteSeerX
Computer-aided diagnosis;. Machine learning. Summary We have employed two pattern recognition methods used commonly for face recognition in order to analyse digital mammograms. ..... should have values near 1.0 on the main diagonal,. i.e., for true .

Variational Restoration and Edge Detection for Color ... - CS Technion
computer vision literature. Keywords: color ... [0, 1]3) and add to it Gaussian noise with zero mean and standard .... years, both theoretically and practically; in particular, ...... and the D.Sc. degree in 1995 from the Technion—Israel Institute.

Variational Restoration and Edge Detection for Color ...
subdivided into three classes: low-, intermediate- and high-level ... and of course the edge detector performance is very poor (d). ..... (See Fig. 4 for an illustration.

Variational Restoration and Edge Detection for Color ... - CS Technion
Mumford-Shah functional (that will be the main theme of this work) can be ... [0, 1]3) and add to it Gaussian noise with zero mean and standard deviation 0.8 (Fig.

Mobile Phone Anomalous Behaviour Detection for Real ... - CiteSeerX
Keywords: Mobile device forensics, Android, information ..... 10. CYBERLAWS 2011 : The Second International Conference on Technical and Legal Aspects of ...

Intelligent Mobile Agent for Intrusion Detection System - CiteSeerX
a finished intelligent tool prototype for intrusion detection. Intrusion ..... They receive alerts from correlator agents, analyse ... Monitoring and Surveillance.

Texture Detection for Segmentation of Iris Images - CiteSeerX
Asheer Kasar Bachoo, School of Computer Science, University of Kwa-Zulu Natal, ..... than 1 (called the fuzzification factor). uij is the degree of membership of xi.