the nuts and bolts of machine learning coursera week 3 quiz answers

Test your knowledge: Explore unsupervised learning and K-means

1. Fill in the blank: K-means is an unsupervised partitioning algorithm used to organize _____ data into clusters.

Answers

unlabeled

presorted
subcategorized

hierarchical

2. In k-means, what term describes the point at which each cluster is defined?

Answers

Commonality
Centroid

Core
Coordinate

3. A data professional is iterating on certain tasks that will enable them to create a k-means model. They continue doing this until the algorithm converges. Which step of the model-building process does this scenario represent?

Answers

Step four

Step three
Step two
Step one

Test your knowledge: Evaluate a K-means model

4. In a k-means model, which evaluation metric represents the sum of the squared distances between each observation and its closest centroid?

Answers

Silhouette score
SMAPE
F1-score

Inertia

5. Fill in the blank: A data professional may use the _____ method to choose an optimal value for k. This is a tool for identifying the point at which the decrease in inertia starts to level off.

Answers

partitioning
elbow

clustering
unsupervised learning

6. A data professional is using Scikit-learn to create a k-means model. Which attribute will enable them to get the cluster assignments?

Answers

Inertia

Labels
Fit
Silhouette score

Weekly challenge 3

7. Which of the following statements correctly describe key aspects of k-means? Select all that apply.

Answers

K-means is an unsupervised partitioning algorithm.
The value of k is a standard that never changes.
K-means clusters are defined by a central point, called a centroid.

To avoid poor clustering, data professionals run a k-means model with different starting positions for the centroids.

8. A data professional is recalculating the centroid of each cluster. Which step of the model-creation process are they working in?

Answers

Step four
Step one

Step three
Step two

9. Fill in the blank: In order to evaluate the intracluster space in a k-means model, a data professional uses the inertia metric. This is the _____ of the squared distances between each observation and its nearest centroid.

Answers

difference

sum
average
ratio

10. When creating a k-means model, what does it mean when an observation has a silhouette score coefficient with a value close to negative one?

Answers

The observation may be in the wrong cluster.
The observation is suitably within its own cluster and well separated from other clusters.
The observation is on the boundary between clusters.

The observation is in the correct cluster.

11. Which Python function fits a k-means model for multiple values of k by calculating the inertia for each value, appending it to a list, and returning that list?

Answers

silhouette score
cluster_image

k-means inertia
labels

12. Which of the following statements accurately describe the elbow method? Select all that apply.

Answers

When using the elbow method, data professionals aim to find the smoothest part of the curve.

The elbow method uses a line plot to visually compare the inertias of different models.
There is not always an obvious elbow.
The sharpest bend in the curve is usually the model that will provide the most meaningful clustering of data.

13. A data professional is assigning each data point to its nearest centroid. Which step of the model-creation process are they working in?

Answers

Step one
Step three
Step four

Step two

14. Fill in the blank: In order to evaluate the _____ space in a k-means model, a data professional uses the inertia metric. This is the sum of the squared distances between each observation and its nearest centroid.

Answers

converged
midpoint

intracluster
intercluster

15. When creating a k-means model, what does it mean when an observation has a silhouette score coefficient with a value of zero?

Answers

The observation is in an appropriate cluster.

The observation may be in the wrong cluster.
The observation is suitably within its own cluster and well separated from other clusters.
The observation is on the boundary between clusters.

16. Which of the following statements correctly describe key aspects of k-means? Select all that apply.

Answers

The k-means clustering process has four steps that repeat until the model converges.
K-means organizes unlabeled data into clusters.
The position of the k-means centroid is the center of the cluster, also known as the mathematical mean.

K-means is a supervised partitioning algorithm.

17. Which of the following statements accurately describe the elbow method? Select all that apply.

Answers

There is always an obvious elbow.
The elbow method uses a line plot to visually compare the inertias of different models.

When using the elbow method, data professionals find the sharpest bend in the curve.
The elbow method helps data professionals decide which clustering gives the most meaningful model.

18. A data professional is choosing the number of centroids to use in a k-means model and placing them in the data space. Which step of the model-creation process are they working in?

Answers

Step one

Step three
Step four
Step two

19. Fill in the blank: In order to evaluate the intracluster space in a k-means model, a data professional uses the _____ metric. This is the sum of the squared distances between each observation and its nearest centroid.

Answers

convergence
inertia
spread

silhouette score

20. Which metric would a data professional use to better understand the intracluster distance between data points and their centroids?

Answers

k-means inertia
silhouette score

cluster_image
labels

Leave a Reply Cancel reply