This set of Hadoop Multiple Choice Questions & Answers (MCQs) focuses on “Hadoop libraries – 1”.
1. Apache __________ is a data repository containing device information, images, and other relevant information for all sorts of mobile devices.
a) DirectMemory
b) Directory
c) DeviceMap
d) Drill
View Answer
Explanation: Drill is a distributed system for interactive analysis of large-scale datasets.
2. Point out the correct statement.
a) Drill is a build system based on Apache Ant and Apache Ivy
b) DirectMemory’s main purpose is to act as a second-level cache
c) Easyant is inspired by Google’s Dremel
d) None of the mentioned
View Answer
Explanation: DirectMemory is used to store large amounts of data without filling up the Java heap and thus avoiding long garbage collection cycles.
3. ____________ is a secure and highly scalable micro sharing and micro-messaging platform.
a) ESME
b) Directory
c) Empire-db
d) All of the mentioned
View Answer
Explanation: ESME allows people to discover and meet one another and get controlled access to other sources of information, all in a business process context.
4. Which of the framework is used for building and consuming network services?
a) ESME
b) DirectoryMap
c) Empire-db
d) Etch
View Answer
Explanation: Etch is a cross-platform, language- and transport-independent framework.
5. Point out the wrong statement.
a) Felix is implementation of the OSGi R4 specification
b) Falcon is a data processing and management solution
c) Flex is application framework for building Flash-based applications
d) None of the mentioned
View Answer
Explanation: Falcon is used for coordination of data pipelines, lifecycle management, and data discovery.
6. _____________ is an open source system for expressive, declarative, fast, and efficient data analysis.
a) Flume
b) Flink
c) Flex
d) ESME
View Answer
Explanation: Stratosphere combines the scalability and programming flexibility of distributed MapReduce-like platforms with the efficiency, out-of-core execution.
7. ________________ is complete FTP Server based on Mina I/O system.
a) Giraph
b) Gereition
c) FtpServer
d) Oozie
View Answer
Explanation: Giraph is a large-scale, fault-tolerant, Bulk Synchronous Parallel (BSP)-based graph processing framework.
8. _____________ is a distributed computing framework based on BSP.
a) HCataMan
b) HCatlaog
c) Hama
d) All of the mentioned
View Answer
Explanation: BSP stands for Bulk Synchronous Parallel.
9. Apache __________ is a generic cluster management framework used to build distributed systems.
a) Helix
b) Gereition
c) FtpServer
d) None of the mentioned
View Answer
Explanation: Helix provides automatic partition management, fault tolerance and elasticity.
10. The __________ data Mapper framework makes it easier to use a database with Java or .NET applications.
a) iBix
b) Helix
c) iBATIS
d) iBAT
View Answer
Explanation: iBATIS couples objects with stored procedures or SQL statements using an XML descriptor.
Sanfoundry Global Education & Learning Series – Hadoop.
Here’s the list of Best Books in Hadoop.
- Check Hadoop Books
- Apply for Computer Science Internship
- Check Programming Books
- Practice Programming MCQs