21. A data professional determines the best fit line by calculating the difference between observed values and the predicted value of a regression line. What is this calculation?
- Notion
- Coefficient
- Residual
- Parameter
22. A data professional minimizes the sum of squared residuals to estimate parameters in a linear regression model. What method are they using?
- Ordinary least squares
- Mean absolute error
- Residual coefficients
- R squared
23. FIll in the blank: A scatterplot matrix is a series of scatterplots that show the _____ between pairs of variables.
- discrepancies
- distances
- variability
- relationships
24. Fill in the blank: A _____ is the area surrounding a line that describes the uncertainty around the predicted outcome at every value of X.
- confidence band
- interval band
- interval slope
- confidence slope
25. What measures the proportion of variation in the dependent variable Y explained by the independent variable X?
- Mean squared error (MSE)
- Mean absolute error (MAE)
- P-value
- R squared
26. Which of the following statements accurately describe a randomized, controlled experiment? Select all that apply.
- The differences between the control and treatment groups must be observable and measurable.
- It is a study design that randomly assigns participants into an experimental group or a control group.
- As the study is conducted, the only expected similarity between the control and experimental groups is the outcome variable being studied.
- To be successful, data professionals must control for every factor in the experiment.
27. What is the difference between observed or actual values and the predicted values of a regression line?
- Slope
- Parameter
- Residual
- Beta
28. FIll in the blank: A scatterplot matrix is a series of scatterplots that show the relationships between pairs of _____.
- coordinates
- variables
- models
- lines
29. Fill in the blank: A confidence band is the area surrounding a line that describes the uncertainty around the predicted outcome at every value of _____.
- X
- intercept
- slope
- Y
30. Which of the following statements accurately describe running a randomized, controlled experiment? Select all that apply.
- To be successful, data professionals must control for every factor in the experiment.
- The differences between the control and treatment groups must be observable and measurable.
- It is typically used when arguing for causation between variables.
- It is a study design that systematically and methodically assigns participants into groups.