Data Science Questions and Answers – Probability and Statistics

This set of Data Science Multiple Choice Questions & Answers (MCQs) focuses on “Probability and Statistics”.

1. The expected value or _______ of a random variable is the center of its distribution.
a) mode
b) median
c) mean
d) bayesian inference
View Answer

Answer: c
Explanation: A probability model connects the data to the population using assumptions.

2. Point out the correct statement.
a) Some cumulative distribution function F is non-decreasing and right-continuous
b) Every cumulative distribution function F is decreasing and right-continuous
c) Every cumulative distribution function F is increasing and left-continuous
d) None of the mentioned
View Answer

Answer: d
Explanation: Every cumulative distribution function F is non-decreasing and right-continuous.

3. Which of the following of a random variable is a measure of spread?
a) variance
b) standard deviation
c) empirical mean
d) all of the mentioned
View Answer

Answer: a
Explanation: Densities with a higher variance are more spread out than densities with a lower variance.

4. The square root of the variance is called the ________ deviation.
a) empirical
b) mean
c) continuous
d) standard
View Answer

Answer: d
Explanation: Standard Deviation (SD) is the measure of spread of the numbers in a set of data from its mean value.

advertisement
advertisement

5. Point out the wrong statement.
a) A percentile is simply a quantile with expressed as a percent
b) There are two types of random variable
c) R cannot approximate quantiles for you for common distributions
d) None of the mentioned
View Answer

Answer: c
Explanation: R can approximate quantiles for you for common distributions.

6. Which of the following inequality is useful for interpreting variances?
a) Chebyshev
b) Stautaory
c) Testory
d) All of the mentioned
View Answer

Answer: a
Explanation: Chebyshev’s inequality is also spelled as Tchebysheff’s inequality.

Note: Join free Sanfoundry classes at Telegram or Youtube

7. For continuous random variables, the CDF is the derivative of the PDF.
a) True
b) False
View Answer

Answer: b
Explanation: For continuous random variables, the PDF is the derivative of the CDF.

8. Chebyshev’s inequality states that the probability of a “Six Sigma” event is less than ___________
a) 10%
b) 20%
c) 30%
d) 3%
View Answer

Answer: d
Explanation: If a bell curve is assumed, the probability of a “six sigma” event is on the order of one ten millionth of a percent.

advertisement

9. Which of the following random variables are the default model for random samples?
a) iid
b) id
c) pmd
d) all of the mentioned
View Answer

Answer: a
Explanation: Random variables are said to be iid if they are independent and identically distributed.

10. Cumulative distribution functions are used to specify the distribution of multivariate random variables.
a) True
b) False
View Answer

Answer: a
Explanation: In the case of a continuous distribution, it gives the area under the probability density function from minus infinity to x.

advertisement

Sanfoundry Global Education & Learning Series – Data Science.

Here’s the list of Best Books in Data Science.

If you find a mistake in question / option / answer, kindly take a screenshot and email to [email protected]

advertisement
advertisement
Subscribe to our Newsletters (Subject-wise). Participate in the Sanfoundry Certification contest to get free Certificate of Merit. Join our social networks below and stay updated with latest contests, videos, internships and jobs!

Youtube | Telegram | LinkedIn | Instagram | Facebook | Twitter | Pinterest
Manish Bhojasia - Founder & CTO at Sanfoundry
Manish Bhojasia, a technology veteran with 20+ years @ Cisco & Wipro, is Founder and CTO at Sanfoundry. He lives in Bangalore, and focuses on development of Linux Kernel, SAN Technologies, Advanced C, Data Structures & Alogrithms. Stay connected with him at LinkedIn.

Subscribe to his free Masterclasses at Youtube & discussions at Telegram SanfoundryClasses.