This set of Hadoop Multiple Choice Questions & Answers (MCQs) focuses on “Hadoop libraries-1”.
1. Apache __________ is a data repository containing device information, images and other relevant information for all sorts of mobile devices.
a) DirectMemory
b) Directory
c) DeviceMap
d) Drill
View Answer
Explanation: Drill is a distributed system for interactive analysis of large-scale datasets.
2. Point out the correct statement :
a) Drill is a build system based on Apache Ant and Apache Ivy
b) DirectMemory’s main purpose is to to act as a second level cache
c) Easyant is inspired by Google’s Dremel
d) None of the mentioned
View Answer
Explanation: DirectMemory is used to store large amounts of data without filling up the Java heap and thus avoiding long garbage collection cycles.
3. ____________ is a secure and highly scalable microsharing and micromessaging platform.
a) ESME
b) Directory
c) Empire-db
d) All of the mentioned
View Answer
Explanation: ESME allows people to discover and meet one another and get controlled access to other sources of information, all in a business process context.
4. Which of the framework is used for building and consuming network services ?
a) ESME
b) DirectoryMap
c) Empire-db
d) Etch
View Answer
Explanation: Etch is a cross-platform, language- and transport-independent framework.
5. Point out the wrong statement :
a) Felix is implementation of the OSGi R4 specification
b) Falcon is a data processing and management solution
c) Flex is application framework for building Flash-based applications
d) None of the mentioned
View Answer
Explanation: Falcon is used for coordination of data pipelines, lifecycle management, and data discovery.
6. _____________ is an open source system for expressive, declarative, fast, and efficient data analysis.
a) Flume
b) Flink
c) Flex
d) ESME
View Answer
Explanation: Stratosphere combines the scalability and programming flexibility of distributed MapReduce-like platforms with the efficiency, out-of-core execution.
7. ________________ is complete FTP Server based on Mina I/O system.
a) Giraph
b) Gereition
c) FtpServer
d) Oozie
View Answer
Explanation: Giraph is a large-scale, fault-tolerant, Bulk Synchronous Parallel (BSP)-based graph processing framework.
8. _____________ is a distributed computing framework based on BSP
a) HCataMan
b) HCatlaog
c) Hama
d) All of the mentioned
View Answer
Explanation: BSP stands for Bulk Synchronous Parallel.
9. Apache __________ is a generic cluster management framework used to build distributed systems
a) Helix
b) Gereition
c) FtpServer
d) None of the mentioned
View Answer
Explanation: Helix provides automatic partition management, fault tolerance and elasticity.
10. The __________ data Mapper framework makes it easier to use a database with Java or .NET applications
a) iBix
b) Helix
c) iBATIS
d) iBAT
View Answer
Explanation: iBATIS couples objects with stored procedures or SQL statements using a XML descriptor.
Sanfoundry Global Education & Learning Series – Hadoop.
Here’s the list of Best Reference Books in Hadoop.