21. A data professional determines the best fit line by calculating the difference between observed values and the predicted value of a regression line. What is this calculation?

  • Notion
  • Coefficient
  • Residual
  • Parameter

22. A data professional minimizes the sum of squared residuals to estimate parameters in a linear regression model. What method are they using?

  • Ordinary least squares
  • Mean absolute error
  • Residual coefficients
  • R squared

23. FIll in the blank: A scatterplot matrix is a series of scatterplots that show the _____ between pairs of variables.

  • discrepancies
  • distances
  • variability
  • relationships

24. Fill in the blank: A _____ is the area surrounding a line that describes the uncertainty around the predicted outcome at every value of X.

  • confidence band
  • interval band
  • interval slope
  • confidence slope

25. What measures the proportion of variation in the dependent variable Y explained by the independent variable X?

  • Mean squared error (MSE)
  • Mean absolute error (MAE)
  • P-value
  • R squared

26. Which of the following statements accurately describe a randomized, controlled experiment? Select all that apply.

  • The differences between the control and treatment groups must be observable and measurable.
  • It is a study design that randomly assigns participants into an experimental group or a control group.
  • As the study is conducted, the only expected similarity between the control and experimental groups is the outcome variable being studied.
  • To be successful, data professionals must control for every factor in the experiment.

27. What is the difference between observed or actual values and the predicted values of a regression line?

  • Slope
  • Parameter
  • Residual
  • Beta

28. FIll in the blank: A scatterplot matrix is a series of scatterplots that show the relationships between pairs of _____.

  • coordinates
  • variables
  • models
  • lines

29. Fill in the blank: A confidence band is the area surrounding a line that describes the uncertainty around the predicted outcome at every value of _____.

  • X
  • intercept
  • slope
  • Y

30. Which of the following statements accurately describe running a randomized, controlled experiment? Select all that apply.

  • To be successful, data professionals must control for every factor in the experiment.
  • The differences between the control and treatment groups must be observable and measurable.
  • It is typically used when arguing for causation between variables.
  • It is a study design that systematically and methodically assigns participants into groups.

Leave a Reply