This set of Data Science Multiple Choice Questions & Answers (MCQs) focuses on “Big Data”.
1. Which of the following term is appropriate to the below figure?
a) Large Data
b) Big Data
c) Dark Data
d) None of the mentioned
View Answer
Explanation: Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate.
2. Point out the correct statement.
a) Machine learning focuses on prediction, based on known properties learned from the training data
b) Data Cleaning focuses on prediction, based on known properties learned from the training data
c) Representing data in a form which both mere mortals can understand and get valuable insights is as much a science as much as it is art
d) None of the mentioned
View Answer
Explanation: Visualization is becoming a very important aspect.
3. Which of the following characteristic of big data is relatively more concerned to data science?
a) Velocity
b) Variety
c) Volume
d) None of the mentioned
View Answer
Explanation: Big data enables organizations to store, manage, and manipulate vast amounts of disparate data at the right speed and at the right time.
4. Which of the following analytical capabilities are provided by information management company?
a) Stream Computing
b) Content Management
c) Information Integration
d) All of the mentioned
View Answer
Explanation: With stream computing, store less, analyze more and make better decisions faster.
5. Point out the wrong statement.
a) The big volume indeed represents Big Data
b) The data growth and social media explosion have changed how we look at the data
c) Big Data is just about lots of data
d) All of the mentioned
View Answer
Explanation: Big Data is actually a concept providing an opportunity to find new insight into your existing data as well guidelines to capture and analysis your future data.
6. Which of the following step is performed by data scientist after acquiring the data?
a) Data Cleansing
b) Data Integration
c) Data Replication
d) All of the mentioned
View Answer
Explanation: Data cleansing, data cleaning or data scrubbing is the process of detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database.
7. 3V’s are not sufficient to describe big data.
a) True
b) False
View Answer
Explanation: IBM data scientists break big data into four dimensions: volume, variety, velocity and veracity.
8. Which of the following focuses on the discovery of (previously) unknown properties on the data?
a) Data mining
b) Big Data
c) Data wrangling
d) Machine Learning
View Answer
Explanation: Data munging or data wrangling is loosely the process of manually converting or mapping data from one “raw” form into another format that allows for more convenient consumption of the data with the help of semi-automated tools.
9. Which of the following language should be replaced with the question mark in the below figure?
a) Java
b) PHP
c) COBOL
d) None of the mentioned
View Answer
Explanation: Java is used for processing data in Big data Analytics.
10. Beyond Volume, variety and velocity are the issues of big data veracity.
a) True
b) False
View Answer
Explanation: Data Veracity is uncertain or imprecise data.
Sanfoundry Global Education & Learning Series – Data Science.
Here’s the list of Best Books in Data Science.
- Apply for Computer Science Internship
- Practice Computer Science MCQs
- Practice Programming MCQs
- Check Computer Science Books
- Check Data Science Books