Data Science Questions and Answers – Predicting with Regression

This set of Data Science Multiple Choice Questions & Answers (MCQs) focuses on “Predicting with Regression”.

1. Predicting with trees evaluate _____________ within each group of data.
a) equality
b) homogeneity
c) heterogeneity
d) all of the mentioned
View Answer

Answer: b
Explanation: Predicting with trees is easy to interpret.

2. Point out the wrong statement.
a) Training and testing data must be processed in different way
b) Test transformation would mostly be imperfect
c) The first goal is statistical and second is data compression in PCA
d) All of the mentioned
View Answer

Answer: a
Explanation: Training and testing data must be processed in same way.

3. Which of the following method options is provided by train function for bagging?
a) bagEarth
b) treebag
c) bagFDA
d) all of the mentioned
View Answer

Answer: d
Explanation: Bagging can be done using bag function as well.

4. Which of the following is correct with respect to random forest?
a) Random forest are difficult to interpret but often very accurate
b) Random forest are easy to interpret but often very accurate
c) Random forest are difficult to interpret but very less accurate
d) None of the mentioned
View Answer

Answer: a
Explanation: Random forest is top performing algorithm in prediction.

advertisement
advertisement

5. Point out the correct statement.
a) Prediction with regression is easy to implement
b) Prediction with regression is easy to interpret
c) Prediction with regression performs well when linear model is correct
d) All of the mentioned
View Answer

Answer: d
Explanation: Prediction with regression gives poor performance in non linear settings.

6. Which of the following library is used for boosting generalized additive models?
a) gamBoost
b) gbm
c) ada
d) all of the mentioned
View Answer

Answer: a
Explanation: Boosting can be used with any subset of classifier.

Note: Join free Sanfoundry classes at Telegram or Youtube

7. The principal components are equal to left singular values if you first scale the variables.
a) True
b) False
View Answer

Answer: b
Explanation: The principal components are equal to left singular values if you first scale the variables.

8. Which of the following is statistical boosting based on additive logistic regression?
a) gamBoost
b) gbm
c) ada
d) mboost
View Answer

Answer: a
Explanation: mboost is used for model based boosting.

advertisement

9. Which of the following is one of the largest boost subclass in boosting?
a) variance boosting
b) gradient boosting
c) mean boosting
d) all of the mentioned
View Answer

Answer: b
Explanation: R has multiple boosting libraries.

10. PCA is most useful for non linear type models.
a) True
b) False
View Answer

Answer: b
Explanation: PCA is most useful for linear type models.

advertisement

Sanfoundry Global Education & Learning Series – Data Science.

Here’s the list of Best Books in Data Science.

If you find a mistake in question / option / answer, kindly take a screenshot and email to [email protected]

advertisement
advertisement
Subscribe to our Newsletters (Subject-wise). Participate in the Sanfoundry Certification contest to get free Certificate of Merit. Join our social networks below and stay updated with latest contests, videos, internships and jobs!

Youtube | Telegram | LinkedIn | Instagram | Facebook | Twitter | Pinterest
Manish Bhojasia - Founder & CTO at Sanfoundry
Manish Bhojasia, a technology veteran with 20+ years @ Cisco & Wipro, is Founder and CTO at Sanfoundry. He lives in Bangalore, and focuses on development of Linux Kernel, SAN Technologies, Advanced C, Data Structures & Alogrithms. Stay connected with him at LinkedIn.

Subscribe to his free Masterclasses at Youtube & discussions at Telegram SanfoundryClasses.