Data Mining Questions and Answers – What Kind of Data can be Mined

This set of Data Mining Multiple Choice Questions & Answers (MCQs) focuses on “What Kind of Data can be Mined”.

1. Which types of data cannot be used for mining?
a) Database data
b) Data warehouse data
c) File System data
d) Transactional data
View Answer

Answer: c
Explanation: File System data is not appropriate for mining, because the data in the File System cannot be easily retrieved and accessed from everywhere until the proper address is known. Database data, Data warehouse data, Transactional data on the other hand can be easily accessed and are appropriate for mining.

2. What forms of the data are not appropriate for mining?
a) World Wide Web data (WWW)
b) Multimedia data
c) Binary data
d) Graph data
View Answer

Answer: c
Explanation: Binary data is not appropriate for mining, as it contains the data in 0’s and 1’s which are difficult to understand and complexity of mining increases. WWW, Multimedia, Graph data can be mined based upon the Text mining.

3. Database Management Consists of Software Programs. Below are the functions the software program ought to perform. Pick the wrong one.
a) Manage Redundancy
b) Maintain Consistency
c) Maintain Security
d) Manage Concurrency
View Answer

Answer: a
Explanation: Redundancy management is the task of the Database Administrator who has to design the Primary Key appropriately. Consistencies, Security, Concurrency are few among the most primitives of DBMS.

4. Which among the following is the irrelevant attribute to mine for the Credit Risk of the Customer?
a) Age
b) Address
c) Previous Credit Information
d) Income
View Answer

Answer: b
Explanation: Address is the irrelevant attribute to mine for the Credit Risk of Customer, as this is no where helpful for determining the Credit Information. Address might be a useful attribute if the mining is done for marketing the new product of the Company. Age, Previous Credit Information, Income can determine the Credit Risk of the Customers.

5. Customer (cust_ID, name, address, age, occupation). The contents within the Parenthesis are known as ______(per RDBMS terminology), Customer is a Table.
a) Primary Keys
b) Candidate keys
c) Tuples
d) Unique Keys
View Answer

Answer: b
Explanation: All the components which are within Parenthesis are known as Candidate keys. Cust_ID is a primary key. Tuples are the rows of the Table with meaningful contents. Unique keys can be used instead of Primary Keys, these allow the value of that row to be null.

6. Which among the following are not Relational Operators in SQL?
a) Selection
b) Projection
c) Join
d) Addition
View Answer

Answer: d
Explanation: Addition is the Arithmetic Operation. SQL Selection is the select operation which selects a particular tuple matching the query. SQL Projection matches are applied after the SQL select operation, which selects a particular column. SQL join operation is the Cross Product and is a combination of selection and projection.

7. Given below is the process which is used in Pattern Extraction. Identify and choose the name of the process.


a) Data Mining
b) Data Warehousing
c) Data Cleaning
d) Data Pre-Processing
View Answer

Answer: d
Explanation: Data Pre-Processing is the Prior Step in Data Warehousing which includes Data Cleaning, Data Transformation, and Data Loading. Data Mining is the process of extraction of interesting patterns.

8. Which among the following are not among Various Operations in Data Warehousing?
a) Drill down
b) Roll up
c) Sticking
d) Dice
View Answer

Answer: c
Explanation: Sticking is not at all an Operation. Instead it is slicing which is just mis-spelt to confuse. Drilling down is used to increase granularity. Roll up is an operation to decrease granularity. Dice is the projection operation.

9. Which kind of data generally goes into Flat files?
a) Database data
b) Data Warehouse data
c) Transactional data
d) File data
View Answer

Answer: c
Explanation: Transactional data is generally stored in Flat files. Database data is stored in the form of Tables. Data Warehouse data is generally stored in the form of Data Cubes. There are various file systems and each individual file can store its own type of data. Ex: .exe file stores binary data.

10. There are 4 basic operations to be performed on any data. 3 of them are Reading, Writing, Deleting. The fourth one is________
a) Refreshing
b) Reloading
c) Updating
d) Concatenating
View Answer

Answer: c
Explanation: Updating the data is one among the primitives of any file. Refreshing, Reloading are the terms that are generally used for browsers and websites.

11. How many steps are there in Knowledge Discovery from Data (KDD) process?
a) 5
b) 7
c) 6
d) 8
View Answer

Answer: b
Explanation: There are 7 steps in KDD process. They are Data Cleaning, Data Integration, Data Selection, Data Transformation, Data Mining, Pattern Evaluation and Knowledge Presentation.

Sanfoundry Global Education & Learning Series – Data Mining.

To practice all areas of Data Mining, here is complete set of Multiple Choice Questions and Answers.

If you find a mistake in question / option / answer, kindly take a screenshot and email to [email protected]

Subscribe to our Newsletters (Subject-wise). Participate in the Sanfoundry Certification contest to get free Certificate of Merit. Join our social networks below and stay updated with latest contests, videos, internships and jobs!

Youtube | Telegram | LinkedIn | Instagram | Facebook | Twitter | Pinterest
Manish Bhojasia - Founder & CTO at Sanfoundry
Manish Bhojasia, a technology veteran with 20+ years @ Cisco & Wipro, is Founder and CTO at Sanfoundry. He lives in Bangalore, and focuses on development of Linux Kernel, SAN Technologies, Advanced C, Data Structures & Alogrithms. Stay connected with him at LinkedIn.

Subscribe to his free Masterclasses at Youtube & discussions at Telegram SanfoundryClasses.