regression analysis: simplify complex data relationships coursera weekly challenges 3 answers

Test your knowledge: Understand multiple linear regression

1. Fill in the blank: _____ is a technique that estimates the linear relationship between one continuous dependent variable and two or more independent variables.

  • Singular curved regression
  • Multiple curved regression
  • Singular linear regression
  • Multiple linear regression

2. What are ways to ethically communicate multiple regression results as clearly as possible? Select all that apply.

  • Ordinary least squares
  • One hot encoding
  • Interaction terms
  • Confidence band

Test your knowledge: Model assumptions revisited

3. Which of the following statements is true? Select all that apply.

  • One hot encoding is a data transformation technique.
  • One hot encoding is a categorical transformation technique.
  • One hot encoding allows data professionals to turn several categorical variables into one binary variable.
  • One hot encoding allows data professionals to turn one categorical variable into several binary variables.

4. What is the definition of the no multicollinearity assumption?

  • No two independent variables can be highly correlated with each other.
  • No observation in the dataset can be independent.
  • No variation of the residential can be constant or similar across the model.
  • No predictor variable can be linearly related to the outcome variable.

5. In what ways might a data professional handle data with multicollinearity? Select all that apply.

  • Create new variables using existing data.
  • Square the variables that have high multicollinearity.
  • Turn one categorical variable into several binary variables.
  • Drop one or more variables that have high multicollinearity.

Test your knowledge: Model interpretation

6. Fill in the blank: An interaction term represents how the relationship between two independent variables is associated with the changes in the _____ of the dependent variable.

  • category
  • multicollinearity
  • rate of change
  • mean

7. Which of the following relevant statistics can be found by using statsmodel’s OLS function? Select all that apply.

  • P-values
  • Variance inflation factors
  • Coefficients
  • Standard errors

Test your knowledge: Variable selection and model evaluation

8. Fill in the blank: Adjusted R squared is a variation of the R squared regression evaluation metric that _____ unnecessary explanatory variables.

  • adds
  • rewards
  • eliminates
  • penalizes

Shuffle Q/A 1

9. Which of the following statements accurately describe the differences between adjusted R squared and R squared? Select all that apply.

  • Adjusted R squared is easily interpretable.
  • R squared is more easily interpretable.
  • R squared is used to compare models of varying complexity.
  • Adjusted R squared is used to compare models of varying complexity.

10. What variable section process begins with the full model that has all possible independent variables?

  • Backward elimination
  • Forward selection
  • F-test
  • Extra-sum-of Squares

Leave a Reply