Data Science Questions and Answers – Big Data

This set of tough Data Science Questions and Answers focuses on “Big Data”.

1. Which of the following term is appropriate to the below figure?
Big data for data sets so large or complex that traditional data processing applications
a) Large Data
b) Big Data
c) Dark Data
d) None of the mentioned
View Answer

Answer: b
Explanation: Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate.

2. Point out the correct statement.
a) Machine learning focuses on prediction, based on known properties learned from the training data
b) Data Cleaning focuses on prediction, based on known properties learned from the training data
c) Representing data in a form which both mere mortals can understand and get valuable insights is as much a science as much as it is art
d) None of the mentioned
View Answer

Answer: d
Explanation: Visualization is becoming a very important aspect.

3. Which of the following characteristic of big data is relatively more concerned to data science?
a) Velocity
b) Variety
c) Volume
d) None of the mentioned
View Answer

Answer: b
Explanation: Big data enables organizations to store, manage, and manipulate vast amounts of disparate data at the right speed and at the right time.

4. Which of the following analytical capabilities are provided by information management company?
a) Stream Computing
b) Content Management
c) Information Integration
d) All of the mentioned
View Answer

Answer: d
Explanation: With stream computing, store less, analyze more and make better decisions faster.

advertisement
advertisement

5. Point out the wrong statement.
a) The big volume indeed represents Big Data
b) The data growth and social media explosion have changed how we look at the data
c) Big Data is just about lots of data
d) All of the mentioned
View Answer

Answer: c
Explanation: Big Data is actually a concept providing an opportunity to find new insight into your existing data as well guidelines to capture and analysis your future data.

6. Which of the following step is performed by data scientist after acquiring the data?
a) Data Cleansing
b) Data Integration
c) Data Replication
d) All of the mentioned
View Answer

Answer: a
Explanation: Data cleansing, data cleaning or data scrubbing is the process of detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database.

Sanfoundry Certification Contest of the Month is Live. 100+ Subjects. Participate Now!

7. 3V’s are not sufficient to describe big data.
a) True
b) False
View Answer

Answer: a
Explanation: IBM data scientists break big data into four dimensions: volume, variety, velocity and veracity.

8. Which of the following focuses on the discovery of (previously) unknown properties on the data?
a) Data mining
b) Big Data
c) Data wrangling
d) Machine Learning
View Answer

Answer: a
Explanation: Data munging or data wrangling is loosely the process of manually converting or mapping data from one “raw” form into another format that allows for more convenient consumption of the data with the help of semi-automated tools.

advertisement

9. Which of the following language should be replaced with the question mark in the below figure?
Java should be replaced for processing data in Big data Analytics
a) Java
b) PHP
c) COBOL
d) None of the mentioned
View Answer

Answer: a
Explanation: Java is used for processing data in Big data Analytics.

10. Beyond Volume, variety and velocity are the issues of big data veracity.
a) True
b) False
View Answer

Answer: a
Explanation: Data Veracity is uncertain or imprecise data.

advertisement

Sanfoundry Global Education & Learning Series – Data Science.

Here’s the list of Best Books in Data Science.

To practice tough questions on all areas of Data Science, Here is complete set of 1000+ Multiple Choice Questions and Answers.

If you find a mistake in question / option / answer, kindly take a screenshot and email to [email protected]

advertisement
advertisement
Subscribe to our Newsletters (Subject-wise). Participate in the Sanfoundry Certification contest to get free Certificate of Merit. Join our social networks below and stay updated with latest contests, videos, internships and jobs!

Youtube | Telegram | LinkedIn | Instagram | Facebook | Twitter | Pinterest
Manish Bhojasia - Founder & CTO at Sanfoundry
Manish Bhojasia, a technology veteran with 20+ years @ Cisco & Wipro, is Founder and CTO at Sanfoundry. He lives in Bangalore, and focuses on development of Linux Kernel, SAN Technologies, Advanced C, Data Structures & Alogrithms. Stay connected with him at LinkedIn.

Subscribe to his free Masterclasses at Youtube & discussions at Telegram SanfoundryClasses.