Hadoop Questions and Answers – Lucene with Hadoop – 1

This set of Hadoop Multiple Choice Questions & Answers (MCQs) focuses on “Lucene with Hadoop – 1”.

1. ___________ provides Java-based indexing and search technology.
a) Solr
b) Lucene Core
c) Lucy
d) All of the mentioned
View Answer

Answer: b
Explanation: Lucene provides spellchecking, hit highlighting and advanced analysis/tokenization capabilities.

2. Point out the correct statement.
a) Building PyLucene requires GNU Make, a recent version of Ant capable of building Java Lucene and a C++ compiler
b) PyLucene is supported on Mac OS X, Linux, Solaris and Windows
c) Use of setuptools is recommended for Lucene
d) All of the mentioned
View Answer

Answer: d
Explanation: PyLucene requires Python version 2.x (x >= 3.5) and Java version 1.x (x &t;= 5).

3. ___________ is a high performance search server built using Lucene Core.
a) Solr
b) Lucene Core
c) Lucy
d) PyLucene
View Answer

Answer: a
Explanation: Solr provides hit highlighting, faceted search, caching, replication, and a web admin interface.

4. ____________ is a subproject with the aim of collecting and distributing free materials.
a) OSR
b) OPR
c) ORP
d) ORS
View Answer

Answer: c
Explanation: Open Relevance Project is used for relevance testing and performance.

advertisement

5. Point out the wrong statement.
a) PyLucene is a Lucene port
b) PyLucene embeds a Java VM with Lucene into a Python process
c) The PyLucene Python extension, a Python module called lucene is machine-generated by JCC
d) PyLucene is built with JCC
View Answer

Answer: a
Explanation: PyLucene is not a Lucene port but a Python wrapper around Java Lucene.

6. _______ is a Python port of the Core project.
a) Solr
b) Lucene Core
c) Lucy
d) PyLucene
View Answer

Answer: d
Explanation: PyLucene is a Python extension for accessing Java LuceneTM.

Free 30-Day C Certification Bootcamp is Live. Join Now!

7. The Lucene _________ is pleased to announce the availability of Apache Lucene 5.0.0 and Apache Solr 5.0.0.
a) PMC
b) RPC
c) CPM
d) All of the mentioned
View Answer

Answer: a
Explanation: PyLucene was previously hosted at the Open Source Applications Foundation.

8. ___________ is a technology suitable for nearly any application that requires full-text search, especially cross-platform.
a) Lucene
b) Oozie
c) Lucy
d) All of the mentioned
View Answer

Answer: a
Explanation: Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java.

9. Lucene provides scalable, high-Performance indexing over ______ per hour on modern hardware.
a) 1 TB
b) 150GB
c) 10 GB
d) None of the mentioned
View Answer

Answer: b
Explanation: Lucene offers powerful features through a simple API.

10. Lucene index size is roughly _______ the size of text indexed.
a) 10%
b) 20%
c) 50%
d) 70%
View Answer

Answer: b
Explanation: Lucene provides incremental indexing as fast as batch indexing.

advertisement

Sanfoundry Global Education & Learning Series – Hadoop.

Here’s the list of Best Books in Hadoop.

advertisement
advertisement
Subscribe to our Newsletters (Subject-wise). Participate in the Sanfoundry Certification contest to get free Certificate of Merit. Join our social networks below and stay updated with latest contests, videos, internships and jobs!

Youtube | Telegram | LinkedIn | Instagram | Facebook | Twitter | Pinterest
Manish Bhojasia - Founder & CTO at Sanfoundry
I’m Manish - Founder and CTO at Sanfoundry. I’ve been working in tech for over 25 years, with deep focus on Linux kernel, SAN technologies, Advanced C, Full Stack and Scalable website designs.

You can connect with me on LinkedIn, watch my Youtube Masterclasses, or join my Telegram tech discussions.

If you’re in your 40s–60s and exploring new directions in your career, I also offer mentoring. Learn more here.