Hadoop Questions and Answers – Storm

«
»

This set of Hadoop Multiple Choice Questions & Answers (MCQs) focuses on “Storm”.

1. ____________ is a distributed real-time computation system for processing large volumes of high-velocity data.
a) Kafka
b) Storm
c) Lucene
d) BigTop
View Answer

Answer: b
Explanation: Storm on YARN is powerful for scenarios requiring real-time analytics, machine learning and continuous monitoring of operations.

2. Point out the correct statement.
a) A Storm topology consumes streams of data and processes those streams in arbitrarily complex ways
b) Apache Storm is a free and open source distributed real-time computation system
c) Storm integrates with the queueing and database technologies you already use
d) All of the mentioned
View Answer

Answer: d
Explanation: Storm has many use cases: real-time analytics, online machine learning, continuous computation, distributed RPC, ETL, and more.

advertisement

3. Storm integrates with __________ via Apache Slider.
a) Scheduler
b) YARN
c) Compaction
d) All of the mentioned
View Answer

Answer: c
Explanation: Impala is open source (Apache License), so you can self-support in perpetuity if you wish.

4. For Apache __________ users, Storm utilizes the same ODBC interface.
a) cTakes
b) Hive
c) Pig
d) Oozie
View Answer

Answer: b
Explanation: You don’t have to worry about re-inventing the implementation wheel.

5. Point out the wrong statement.
a) Storm is difficult and can be used with only Java
b) Storm is fast: a benchmark clocked it at over a million tuples processed per second per node
c) Storm is scalable, fault-tolerant, guarantees your data will be processed
d) All of the mentioned
View Answer

Answer: a
Explanation: Storm is simple, can be used with any programming language.

6. Storm is benchmarked as processing one million _______ byte messages per second per node.
a) 10
b) 50
c) 100
d) 200
View Answer

Answer: c
Explanation: Storm is a distributed real-time computation system.

advertisement

7. Apache Storm added open source, stream data processing to _________ Data Platform.
a) Cloudera
b) Hortonworks
c) Local Cloudera
d) MapR
View Answer

Answer: b
Explanation: The Storm community is working to improve capabilities related to three important themes: business continuity, operations and developer productivity.

8. How many types of nodes are present in Storm cluster?
a) 1
b) 2
c) 3
d) 4
View Answer

Answer: c
Explanation: A storm cluster has three sets of nodes.

9. __________ node distributes code across the cluster.
a) Zookeeper
b) Nimbus
c) Supervisor
d) None of the mentioned
View Answer

Answer: b
Explanation: Nimbus node is master node, similar to the Hadoop JobTracker.

10. ____________ communicates with Nimbus through Zookeeper, starts and stops workers according to signals from Nimbus.
a) Zookeeper
b) Nimbus
c) Supervisor
d) None of the mentioned
View Answer

Answer: c
Explanation: ZooKeeper nodes coordinate the Storm cluster.

advertisement

Sanfoundry Global Education & Learning Series – Hadoop.

Here’s the list of Best Reference Books in Hadoop.

advertisement
advertisement
advertisement
Manish Bhojasia, a technology veteran with 20+ years @ Cisco & Wipro, is Founder and CTO at Sanfoundry. He is Linux Kernel Developer & SAN Architect and is passionate about competency developments in these areas. He lives in Bangalore and delivers focused training sessions to IT professionals in Linux Kernel, Linux Debugging, Linux Device Drivers, Linux Networking, Linux Storage, Advanced C Programming, SAN Storage Technologies, SCSI Internals & Storage Protocols such as iSCSI & Fiber Channel. Stay connected with him @ LinkedIn