the nuts and bolts of machine learning coursera week 4 quiz answers

Test your knowledge: Additional supervised learning techniques

1. Tree-based learning is a type of unsupervised machine learning that performs classification and regression tasks.

Answers

True
False

2. Fill in the blank: Similar to a flow chart, a _____ is a classification model that represents various solutions available to solve a given problem based on the possible outcomes of each solution.

Answers

decision tree
Poisson distribution
linear regression
binary logistic regression

3. In a decision tree, which node is the location where the first decision is made?

Answers

Leaf
Branch
Root
Decision

4. In tree-based learning, how is a split determined?

Answers

By the amount of leaves present
By which variables and cut-off values offer the most predictive power
By the level of balance present among the predictions made by the model
By the number of decisions required before arriving at a final prediction

Test your knowledge: Tune tree-based models

5. Fill in the blank: The hyperparameter max depth is used to limit the depth of a decision tree, which is the number of levels between the _____ and the farthest node away from it.

Answers

decision node
root node
leaf node
first split

6. What tuning technique can a data professional use to confirm that a model achieves its intended purpose?

Answers

Classifier
Min samples leaf
Grid search
Decision tree

7. During model validation, the validation dataset must be combined with test data in order to function properly.

Answers

True
False

8. Fill in the blank: Cross validation involves splitting training data into different combinations of _____, on which the model is trained.

Answers

banks
parcels
tiers
folds

Test your knowledge: Bagging

9. Ensemble learning is most effective when the outputs are aggregated from models that follow the exact same methodology all using the same dataset.

Answers

True
False

10. What are some of the benefits of ensemble learning? Select all that apply.

Answers

The predictions have lower variance than other standalone models.
It requires few base learners trained on the same dataset.
The predictions have less bias than other standalone models.
It combines the results of many models to help make more reliable predictions.

11. In a random forest, what type of data is used to train the ensemble of decision-tree base learners?

Answers

Duplicated
Unstructured
Bootstrapped
Sampled

12. Fill in the blank: When using a decision tree model, a data professional can use _____ to control the threshold below which nodes become leaves.

Answers

min_samples_leaf
max_features
max_depth
min_samples_split

Test your knowledge: Boosting

13. Fill in the blank: The supervised learning technique boosting builds an ensemble of weak learners _____, then aggregates their predictions.

Answers

in parallel
repeatedly
randomly
sequentially

14. When using a gradient boosting machine (GBM) modeling technique, which term describes a model’s ability to predict new values that fall outside of the range of values in the training data?

Answers

Learning rate
Cross validation
Grid search
Extrapolation

15. When using the hyperparameter min_child_weight, a tree will not split a node if it results in any child node with less weight than what is specified. What happens to the node instead?

Answers

It becomes a root.
It becomes a leaf
It gets deleted.
It duplicates itself to become another node.

Weekly challenge 4

16. Fill in the blank: In tree-based learning, a decision tree’s _____ represent observations about an item.

Answers

roots
splits
leaves
branches

17. Which of the following statements accurately describe decision trees? Select all that apply.

Answers

Decision trees represent solutions to solve a given problem based on possible outcomes of related choices.
Decision trees are susceptible to overfitting.
Decision trees are equally effective at predicting both existing and new data.
Decision trees require no assumptions regarding the distribution of underlying data.

18. Which section of a decision tree is where the final prediction is made?

Answers

Decision node
Root node
Leaf node
Split

19. In a decision tree model, which hyperparameter specifies the number of attributes that each tree selects randomly from the training data to determine its splits?

Answers

Max depth
Learning rate
Number of estimators
Max features

20. What process uses different portions of the data to test and train a model across several iterations?

Answers

Grid search
Cross validation
Model validation
Proportional verification

21. Which of the following statements correctly describe ensemble learning? Select all that apply.

Answers

If a base learner’s prediction is equally effective as a random guess, it is a strong learner.
A best practice of ensemble learning is to use very different methodologies for each contributing model.
Ensemble learning involves building multiple models.
It is possible to use the same methodology for each contributing model, as long as there are numerous base learners.

22. Fill in the blank: Each base learner in a random forest model has different combinations of features available to it, which helps prevent correlated errors among _____ in the ensemble.

Answers

splits
learners
roots
nodes

23. What are some benefits of boosting? Select all that apply.

Answers

The models used in boosting can be trained in parallel across many different servers.
Boosting does not require the data to be normalized.
Boosting is robust to outliers.
Boosting functions well even with multicollinearity among the features.

24. Fill in the blank: In tree-based learning, the decision tree’s _____ represent an item’s target value.

Answers

leaves
roots
splits
branches

25. What are some disadvantages of decision trees? Select all that apply.

Answers

When new data is introduced, decision trees can be less effective at prediction.
Preparing data to train a decision is a complex process involving significant preprocessing
Decision trees require assumptions regarding the distribution of underlying data.
Decision trees can be particularly susceptible to overfitting.

26. In a decision tree model, which hyperparameter sets the threshold below which nodes become leaves?

Answers

Min samples split
Min samples leaf
Min samples tree
Min child weight

27. What practice uses a validation dataset to verify that models are performing as expected?

Answers

Model validation
Grid search
Tree verification
Cross validation

28. Which of the following statements correctly describe ensemble learning? Select all that apply.

Answers

When building an ensemble using different types of models, each should be trained on different data.
Predictions using an ensemble of models are accurate, even when the individual models are barely more accurate than a random guess.
Ensemble learning involves aggregating the outputs of multiple models to make a final prediction.
If a base learner’s prediction is only slightly better than a random guess, it becomes a weak learner.

29. What is the only section of a decision tree that contains no predecessors?

Answers

Split based on what will provide the most predictive power.
Leaf node
Root node
Decision node

30. What practice uses a validation dataset to verify that models are performing as expected?

Answers

Model validation
Tree verification
Cross validation
Grid search

31. Fill in the blank: A random forest model grows trees by taking a random subset of the available features in the training data, then _____ each node at the best feature available to that tree.

Answers

bagging
tuning
bootstrapping
splitting

32. Which of the following statements correctly describe gradient boosting? Select all that apply.

Answers

Each base learner in the sequence is built to predict the residual errors of the model that preceded it.
Gradient boosting machines have difficulty with extrapolation.
Gradient boosting models can be trained in parallel.
Gradient boosting machines can be difficult to interpret.

33. Fill in the blank: In tree-based learning, the decision tree’s _____ represent where the first decision is made.

Answers

roots
branches
leaves
splits

34. Fill in the blank: A random forest is an ensemble of decision-tree _____ that are trained on bootstrapped data.

Answers

observations
variables
statements
base learners

35. What are some benefits of boosting? Select all that apply.

Answers

Boosting scales well to very large datasets.
Boosting can handle both numeric and categorical features.
Boosting algorithms are easy to understand.
Boosting does not require the data to be scaled.

36. Which of the following statements correctly describe gradient boosting? Select all that apply.

Answers

Gradient boosting machines cannot handle messy data.
Gradient boosting machines do not have coefficients or directionality.
Gradient boosting machines are often called black-box models because their predictions cannot be explained easily.
Gradient boosting machines have a lot of hyperparameters.

the nuts and bolts of machine learning coursera week 4 quiz answers

Test your knowledge: Additional supervised learning techniques

1. Tree-based learning is a type of unsupervised machine learning that performs classification and regression tasks.

2. Fill in the blank: Similar to a flow chart, a _____ is a classification model that represents various solutions available to solve a given problem based on the possible outcomes of each solution.

3. In a decision tree, which node is the location where the first decision is made?

4. In tree-based learning, how is a split determined?

Test your knowledge: Tune tree-based models

5. Fill in the blank: The hyperparameter max depth is used to limit the depth of a decision tree, which is the number of levels between the _____ and the farthest node away from it.

6. What tuning technique can a data professional use to confirm that a model achieves its intended purpose?

7. During model validation, the validation dataset must be combined with test data in order to function properly.

8. Fill in the blank: Cross validation involves splitting training data into different combinations of _____, on which the model is trained.

Test your knowledge: Bagging

9. Ensemble learning is most effective when the outputs are aggregated from models that follow the exact same methodology all using the same dataset.

10. What are some of the benefits of ensemble learning? Select all that apply.

11. In a random forest, what type of data is used to train the ensemble of decision-tree base learners?

12. Fill in the blank: When using a decision tree model, a data professional can use _____ to control the threshold below which nodes become leaves.

Test your knowledge: Boosting

13. Fill in the blank: The supervised learning technique boosting builds an ensemble of weak learners _____, then aggregates their predictions.

14. When using a gradient boosting machine (GBM) modeling technique, which term describes a model’s ability to predict new values that fall outside of the range of values in the training data?

15. When using the hyperparameter min_child_weight, a tree will not split a node if it results in any child node with less weight than what is specified. What happens to the node instead?

Weekly challenge 4

16. Fill in the blank: In tree-based learning, a decision tree’s _____ represent observations about an item.

17. Which of the following statements accurately describe decision trees? Select all that apply.

18. Which section of a decision tree is where the final prediction is made?

19. In a decision tree model, which hyperparameter specifies the number of attributes that each tree selects randomly from the training data to determine its splits?

20. What process uses different portions of the data to test and train a model across several iterations?

21. Which of the following statements correctly describe ensemble learning? Select all that apply.

22. Fill in the blank: Each base learner in a random forest model has different combinations of features available to it, which helps prevent correlated errors among _____ in the ensemble.

23. What are some benefits of boosting? Select all that apply.

24. Fill in the blank: In tree-based learning, the decision tree’s _____ represent an item’s target value.

25. What are some disadvantages of decision trees? Select all that apply.

26. In a decision tree model, which hyperparameter sets the threshold below which nodes become leaves?

27. What practice uses a validation dataset to verify that models are performing as expected?

28. Which of the following statements correctly describe ensemble learning? Select all that apply.

29. What is the only section of a decision tree that contains no predecessors?

30. What practice uses a validation dataset to verify that models are performing as expected?

31. Fill in the blank: A random forest model grows trees by taking a random subset of the available features in the training data, then _____ each node at the best feature available to that tree.

32. Which of the following statements correctly describe gradient boosting? Select all that apply.

33. Fill in the blank: In tree-based learning, the decision tree’s _____ represent where the first decision is made.

34. Fill in the blank: A random forest is an ensemble of decision-tree _____ that are trained on bootstrapped data.

35. What are some benefits of boosting? Select all that apply.

36. Which of the following statements correctly describe gradient boosting? Select all that apply.

Share the love Share this content

You Might Also Like

Module 1 – Concepts for data modeling

Week 1 – React Components

Module 1: What is Data Analytics

Leave a Reply Cancel reply

Share this content