This set of Basic Data Science questions and answers focuses on “Statistical Inference”.

1. Which of the following testing is concerned with making decisions using data ?

a) Probability

b) Hypothesis

c) Causal

d) None of the mentioned

Explanation:The null hypothesis is assumed true and statistical evidence is required to reject it in favor of a research or alternative hypothesis.

2. Point out the correct statement:

a) Power of a one sided test is lower than the power of the associated two sided test

b) Power of a two sided test is greater than the power of the associated one sided test

c) Hypothesis testing is less commonly used

d) None of the Mentioned

Explanation:Power of a one sided test is greater than the power of the associated two sided test.

3. Which of the following value is most common measure of “statistical significance” ?

a) P

b) A

c) L

d) All of the Mentioned

Explanation:The P-value is the probability under the null hypothesis of obtaining evidence as extreme or more extreme than would be observed by chance alone.

4. What is the purpose of multiple testing in statistical inference ?

a) Minimize errors

b) Minimize false positives

c) Minimize false negatives

d) All of the Mentioned

Explanation:A false positive is an error in some evaluation process in which a condition tested for is mistakenly found to have been detected.

5. Point out the wrong statement with respect to FDR:

a) FDR is difficult to calculate

b) FDR is relatively less conservative

c) FDR allows for more false positives

d) None of the Mentioned

Explanation:FDR stands for false discovery rate.

6. Which of the following is the oldest multiple testing correction ?

a) Bonferroni correction

b) Bernoulli correction

c) Likelihood correction

d) All of the Mentioned

Explanation:Bonferroni correction is easy to calculate.

7. The pooled estimator is a mixture of the group variances, placing greater weight on whichever has a larger sample size.

a) True

b) False

Explanation:If the sample sizes are the same the pooled variance estimate is the average of the group variances.

8. Which of the following tool is used for constructing confidence intervals and calculating standard errors for difficult statistics ?

a) baggyer

b) bootstrap

c) jacknife

d) None of the Mentioned

Explanation:The bootstrap procedure follows from the so called bootstrap principle.

9. Which of the following tool is used for estimating standard errors and the bias of estimators ?

a) knitr

b) jackknife

c) ggplot2

d) All of the Mentioned

Explanation:jackknife involve resampling data.

10. Power is the probability of rejecting the null hypothesis when it is true.

a) True

b) False

Explanation:Power is the probability of rejecting the null hypothesis when it is false.

