JE – 832

*JE832*

VII Semester B.E. (CSE/ISE) Degree Examination, June/July 2013 (2K6 Scheme) CI – 7.2 : DATA MINING AND ALGORITHMS Time : 3 Hours

Max. Marks : 100

Instruction : Answer any five questions, selecting atleast 2 questions from each Part. PART – A 1. a) Explain the various steps involved in Knowledge Discovery in Databases with a neat block diagram.

8

b) Explain the following : i) Multimedia database ii) Web database iii) Spatial database.

6

c) List out some of the major challenges in data mining.

6

2. a) What is data preprocessing ? Explain its importance.

6

b) Why is data cleaning done ? List out the methods for cleaning the data.

6

c) What are the different data transformation techniques ? Give any three methods for data normalization.

8

3. a) What is a data warehouse ? Explain the 3-tier architecture of a data warehouse. b) Define Association Rule Mining. Explain the various types of Association Rules.

10 10

4. a) Using Apriori algorithm, findout the Large itemsets for the database shown below, taking the minimum support as 2 TID Items bought 1

M, O, N, K, E, Y

2

D, O, N, K, E, Y

3

M, A, K, E

4

M, U, C, K, Y

5

C, O, O, K, I, E.

10 P.T.O.

JE – 832

*JE832*

-2-

b) Generate the frequent itemsets for the database shown below using Frequent Pattern tree. Take the minimum support as 2. TID

Items

1

a, b

2

b, c, d

3

a, c, d, e

4

a, d, e

5

a, b, c

6

a, b, c, d

10 PART – B

5. a) With a neat diagram, explain the process of classification.

6

b) Define a Decision tree and discuss some of the issues in constructing a decision tree.

6

c) For the training set shown below, construct a Decision Tree. Training set for classifying mammals/non-mammals Name

Body-Temperature Gives Birth

Four legged

Hibernates Class Label

Salamander Cold-Blooded

no

yes

yes

no

Guppy

Cold-Blooded

yes

no

no

no

Eagle

Warm-Blooded

no

no

no

no

Poorwill

Warm-Blooded

no

no

yes

no

Platypus

Warm-Blooded

no

yes

yes

yes

6. a) With a neat diagram, explain how Neural Networks can be used for classification.

8

b) With an example, explain Bayesian Belief Networks.

8

c) Compare Lazy learners with Eager Learners.

4

*JE832*

-3-

JE – 832

7. a) Explain the important features of a Good Clustering Algorithm. b) Discuss the various types of data on which clustering can be done.

6 6

c) Using k-means algorithm, cluster the following eight points (with (x,y)) into three clusters. A1(2,10), A2 (2, 5), A3 (8,4) B1(5,8), B2(7,5), B3(6,4) C1(1, 2), C 2(4,9)

8

8. a) Classify the various clustering algorithms.

8

b) Describe each of the following clustering algorithms in terms of the following criteria i) Shapes of clusters that can be determined ii) Input parameters that must be specified iii) Limitations. i) K-means ii) k-medoids iii) DBScan iv) Sting v) BIRCH

12 ________

DATA MINING AND ALGORITHMS.pdf

iv) Sting. v) BIRCH 12. ______. Whoops! There was a problem loading this page. DATA MINING AND ALGORITHMS.pdf. DATA MINING AND ALGORITHMS.pdf.

443KB Sizes 3 Downloads 151 Views

Recommend Documents

Data Warehouse and Data Mining Technology Data ...
IJRIT International Journal of Research in Information Technology, Vol. 1, Issue 2, February ... impact, relevance and need in Enterpr relevance and ... The data that is used in current business domains is not accurate, complete and precise.

Data Mining Approach, Data Mining Cycle
The University of Finance and Administration, Prague, Czech Republic. Reviewers: Mgr. Veronika ... Curriculum Studies Research Group and who has been in the course of many years the intellectual co-promoter of ..... Implemented Curriculum-1 of Radiol

Data Mining: Current and Future Applications - IJRIT
Artificial neural networks: Non-linear predictive models that learn through training ..... Semi-supervised learning and social network analysis are other methods ...

data mining and warehousing pdf
data mining and warehousing pdf. data mining and warehousing pdf. Open. Extract. Open with. Sign In. Main menu. Displaying data mining and warehousing ...

Data Mining: Current and Future Applications - IJRIT
(KDD), often called data mining, aims at the discovery of useful information from ..... Advanced analysis of data for extracting useful knowledge is the next natural ...

R and Data Mining
This book introduces into using R for data mining. It presents many examples of various data mining functionalities in R and three case studies of real world applications. The supposed audience of this book are postgraduate students, researchers and

Data Mining: Current and Future Applications - IJRIT
Language. (SQL). Oracle, Sybase,. Informix, IBM,. Microsoft. Retrospective, dynamic data delivery at record level. Data Warehousing. &. Decision Support. (1990s). "What were unit sales in. New England last. March? Drill down to. Boston. On-line analy

Data mining and education -
Overview. Data mining and education. Kenneth R. Koedinger,1∗ Sidney D'Mello,2 .... In both cases, a desirable final step ... Overview wires.wiley.com/cogsci. TABLE 1 Simplified Sample of Data Used in the KDD ..... developed such as LFA,56 Rule Spac

Mining Software Engineering Data
Apr 9, 1993 - To Change. Consult. Guru for. Advice. New Req., Bug Fix. “How does a change in one source code entity propagate to other entities?” No More.

Data mining and education -
An emerging field of educational data mining (EDM) is building on and ... ing system, and (4) how machine learning techniques applied to discussion data.

what is data mining and data warehousing pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. what is data ...

data mining and data warehousing pdf
data mining and data warehousing pdf. data mining and data warehousing pdf. Open. Extract. Open with. Sign In. Main menu. Displaying data mining and data ...

Review on Data Warehouse, Data Mining and OLAP Technology - IJRIT
An OLAP is market-oriented which is used for data analysis by knowledge ..... The data warehouse environment supports the entire decision. Database. Source.

Review on Data Warehouse, Data Mining and OLAP Technology - IJRIT
used for transactions and query processing by clerks, clients. An OLAP is market-oriented which is used for data analysis by knowledge employees, including ...

data warehousing and data mining pdf free download
data warehousing and data mining pdf free download. data warehousing and data mining pdf free download. Open. Extract. Open with. Sign In. Main menu.