Automatic Wood Defect Classification Lulu He [email protected] 1.

INTRODUCTION

Lumber boards often suffer from defects like knots. So before they can be turned into durables, different kinds of wood knots have to be identified and treated accordingly. Identifying wood defects by visual inspection is an open question and even human graders fail to achieve high accuracy. So an interesting question is whether this can be automated using computers. This problem seems to fall in the broad category of image similarity matching or image classification. Considering the nature of the images (see Fig 1), it seems more appropriate to use low level features like color and texture. The work in [2] attempts to detect wood defects in large lumber boards. It uses color features along with SOM clustering for the task. In our work we attempt the more difficult problem of differentiating between defects of different kinds. Specifically we would like to classify a given wood knot image as dry, encased or sound. Section 2 describes our approach and section 3 presents our experimental evaluation.

2.

APPROACH

Our task is to classify a wood defect image into one of the several possible defect classes. We use a bag of words approach framework for the classification task. The following sections describe our approach in detail.

2.1

Descriptors

In this paper we try out three different descriptors: color, shape and texture. Such descriptors have been shown to be useful in tasks like image similarity matching and content based image retrieval.

Neelabh Gupta [email protected] Each image can now be represented by a histogram of words. Given a global vocabulary of size m, the ith entry of a histogram is the proportion of all descriptors in the image having label i. To compare two histograms

H 1=u1, u 2. .. u m  , H 2= w1, w2. .. w m  we use the



2

distance defined as:

2

2

  H 1 , H 2 =0.5  i u i−wi  /ui w i

2.3

Kernel-Based Classification

For classification, we use Support Vector Machines (SVM). To incorporate 2 distance into the SVM framework, we use extended Gaussian kernels[3]: 2

K  H i , H j =exp −  H i , H j / A  The resulting kernel is the

2

kernel.

We use the one versus all approach to build a 3 class SVM.

3.

EXPERIMENTAL EVALUATION

We use an online dataset of wood knot images (http://www.ee.oulu.fi/research/imag/knots/KNOTS/). Our dataset has a total of 90 images of three different classes of defects. The classes are dry_knot, encased_knot and sound_knot. See Fig. 1 for some examples.

The HSV values of each pixel is used as the color feature. Edge orientation histograms are used as the shape descriptor. A given image is divided into overlapping rectangular blocks. After evaluating the edges for each rectangular patch, we build a histogram of edge orientation directions. For instance we quantize the edge orientations into 8 bins, and build a histogram where each bin count is weighted by its edge magnitude. For describing texture we convolve the image with a bank of filters, in our case the LM filter banks[1].

2.2 Comparing distribution of local descriptors After evaluating the descriptors as described in the previous section we need to represent their distributions in the training and testing images. To do this we obtain a global visual vocabulary by clustering descriptors from the training set. We use k-means to form our clusters. The cluster centers now represent the words in our vocabulary.

Fig. 1

Different kinds of wood knots

The dataset is divided into two sets for training and testing using three-fold cross validation. Twenty images from each class for training and ten from each for testing are used. Since it is a three class problem, we use the average class accuracy as our performance measure.

All the three descriptors are evaluated separately. A vocabulary is built for each and then tested separately. Fig. 2 and Fig. 3 show how the results are affected by the vocabulary size. Fig. 4 shows that the non-linear 2 kernel outperforms the linear kernel. We obtain the best accuracy of 67.78% for the texture features. Feature

Class Accuracy

HSV

66.00%

Edge orientations

52.00%

Texture

67.78%

[4] Matlab codes for Leung-Malik filter bank. http://www.robots.ox.ac.uk/~vgg/research/texclass/filters.htm l [5] Matlab Support Vector Machine Toolbox. http://www.isis.ecs.soton.ac.uk/resources/svminfo/

0.53

Table 1 Best Average Class Accuracy obtained for different features vs. a chance rate of 33.33%

4.

[3] Zhang, J., Marszalek, M., Lazebnik, S. and Schmid,C. Local features and kernels for classification of texture and object categories: A comprehensive study. International Journal of Computer Vision, 73(2): 213-238, 2007. http://www.springerlink.com/content/c62x2vr4v1262751/full text.pdf

0.52

0.52 0.51

CONCLUSION

The results are encouraging and in future evaluation with a more extensive dataset can be carried out. Evaluation of the incorrect classification showed that it was more difficult to differentiate between dry and encased knots. This could be resolved by combining several features like texture and shape together. Another issue would be to take the scale changes also into account.

0.5

0.5

0.49

0.49

K = 30 K = 40 K = 50

0.48 0.47 Average Class Accuracy

0.66

Fig. 3

0.66

Effect of vocabulary size(K) on performance

0.66 0.65 0.65

for shape features 0.64

0.64

0.64

0.63

0.64

K=25 K=50 K=100 K=200

0.63

0.6

0.52

0.5

0.63

0.4

0.4

0.62 Average Class Accuracy

Fig. 2

Effect of vocabulary size(K) on performance for HSV features

5.

REFERENCES

[1] Malik, J. , Belongie, S. , Shi, J. and Leung, T .Textons, contours and regions: cue combination in image segmentation. Proceedings of the International Conference on Computer Vision, 1999. http://ieeexplore.ieee.org/stamp/stamp.jsp? arnumber=00790346 [2] Silven, O., Niskanen, M., Kauppinen, H. 2000. Wood Inspection with Non-supervised Clustering. COST action E10 Workshop – Wood properties for industrial use, Espoo, Finland, June 18-22. DOI= http://www.ee.oulu.fi/~olli/Projects/COSTE10.20.6.2000.pdf

0.3

X² Kernel Linear Kernel

0.2 0.1 0 Average Class Accuracy

Fig. 4



2

kernel outperforms the linear kernel.

The results shown are for shape features.

Proceedings Template - WORD

our vocabulary. Each image can now be represented by a histogram of words. ... The results are encouraging and in future evaluation with a more extensive ...

209KB Sizes 0 Downloads 224 Views

Recommend Documents

Proceedings Template - WORD
This paper presents a System for Early Analysis of SoCs (SEAS) .... converted to a SystemC program which has constructor calls for ... cores contain more critical connections, such as high-speed IOs, ... At this early stage, the typical way to.

Proceedings Template - WORD - PDFKUL.COM
multimedia authoring system dedicated to end-users aims at facilitating multimedia documents creation. ... LimSee3 [7] is a generic tool (or platform) for editing multimedia documents and as such it provides several .... produced with an XSLT transfo

Proceedings Template - WORD
Through the use of crowdsourcing services like. Amazon's Mechanical ...... improving data quality and data mining using multiple, noisy labelers. In KDD 2008.

Proceedings Template - WORD
software such as Adobe Flash Creative Suite 3, SwiSH, ... after a course, to create a fully synchronized multimedia ... of on-line viewable course presentations.

Proceedings Template - WORD
We propose to address the problem of encouraging ... Topic: A friend of yours insists that you must only buy and .... Information Seeking Behavior on the Web.

Proceedings Template - WORD
10, 11]. Dialogic instruction involves fewer teacher questions and ... achievment [1, 3, 10]. ..... system) 2.0: A Windows laptop computer system for the in-.

Proceedings Template - WORD
Universal Hash Function has over other classes of Hash function. ..... O PG. O nPG. O MG. M. +. +. +. = +. 4. CONCLUSIONS. As stated by the results in the ... 1023–1030,. [4] Mitchell, M. An Introduction to Genetic Algorithms. MIT. Press, 2005.

Proceedings Template - WORD
As any heuristic implicitly sequences the input when it reads data, the presentation captures ... Pushing this idea further, a heuristic h is a mapping from one.

Proceedings Template - WORD
Experimental results on the datasets of TREC web track, OSHUMED, and a commercial web search ..... TREC data, since OHSUMED is a text document collection without hyperlink. ..... Knowledge Discovery and Data Mining (KDD), ACM.

Proceedings Template - WORD
685 Education Sciences. Madison WI, 53706-1475 [email protected] ... student engagement [11] and improve student achievement [24]. However, the quality of implementation of dialogic ..... for Knowledge Analysis (WEKA) [9] an open source data min

Proceedings Template - WORD
presented an image of a historical document and are asked to transcribe selected fields thereof. FSI has over 100,000 volunteer annotators and a large associated infrastructure of personnel and hardware for managing the crowd sourcing. FSI annotators

Proceedings Template - WORD
has existed for over a century and is routinely used in business and academia .... Administration ..... specifics of the data sources are outline in Appendix A. This.

Proceedings Template - WORD
the technical system, the users, their tasks and organizational con- ..... HTML editor employee. HTML file. Figure 2: Simple example of the SeeMe notation. 352 ...

Proceedings Template - WORD
Dept. of Computer Science. University of Vermont. Burlington, VT 05405. 802-656-9116 [email protected]. Margaret J. Eppstein. Dept. of Computer Science. University of Vermont. Burlington, VT 05405. 802-656-1918. [email protected]. ABSTRACT. T

Proceedings Template - WORD
Mar 25, 2011 - RFID. 10 IDOC with cryptic names & XSDs with long names. CRM. 8. IDOC & XSDs with long ... partners to the Joint Automotive Industry standard. The correct .... Informationsintegration in Service-Architekturen. [16] Rahm, E.

Proceedings Template - WORD
Jun 18, 2012 - such as social networks, micro-blogs, protein-protein interactions, and the .... the level-synchronized BFS are explained in [2][3]. Algorithm I: ...

Proceedings Template - WORD
information beyond their own contacts such as business services. We propose tagging contacts and sharing the tags with one's social network as a solution to ...

Proceedings Template - WORD
accounting for the gap. There was no ... source computer vision software library, was used to isolate the red balloon from the ..... D'Mello, S. et al. 2016. Attending to Attention: Detecting and Combating Mind Wandering during Computerized.

Proceedings Template - WORD
fitness function based on the ReliefF data mining algorithm. Preliminary results from ... the approach to larger data sets and to lower heritabilities. Categories and ...

Proceedings Template - WORD
non-Linux user with Opera non-Linux user with FireFox. Linux user ... The click chain model is introduced by F. Guo et al.[15]. It differs from the original cascade ...

Proceedings Template - WORD
temporal resolution between satellite sensor data, the need to establish ... Algorithms, Design. Keywords ..... cyclone events to analyze and visualize. On the ...

Proceedings Template - WORD
Many software projects use dezvelopment support systems such as bug tracking ... hosting service such as sourceforge.net that can be used at no fee. In case of ...

Proceedings Template - WORD
access speed(for the time being), small screen, and personal holding. ... that implement the WAP specification, like mobile phones. It is simpler and more widely ...

Proceedings Template - WORD
effectiveness of the VSE compare to Google is evaluated. The VSE ... provider. Hence, the VSE is a visualized layer built on top of Google as a search interface with which the user interacts .... Lexical Operators to Improve Internet Searches.