ibm data analyst capstone project coursera week 2 answers

Graded Quiz: Duplicates

1. How many duplicate rows are there in the dataset?

  • 145
  • 124
  • 154
  • 99

2. How many duplicate values are there in the column Respondent?

  • 99
  • 154
  • 10
  • 96

Graded Quiz: Removing Duplicates

3. After removing the duplicate rows, how many rows are there in the dataset?

  • 9999
  • 12333
  • 11398
  • 13456

4. After removing the duplicate rows, how many unique rows are there in the column Respondent?

  • 11398
  • 11342
  • 11999
  • 11000

Graded Quiz: Missing Values

5. After removing the duplicate rows, how many blank rows are there under the column EdLevel?

  • 231
  • 112
  • 0
  • 280

6. After removing the duplicate rows, how many rows are missing under the column Country?

  • 201
  • 188
  • 280
  • 0

Graded Quiz: Imputing Missing Values

7. What is the majority category under the column Employment?

  • Employed full-time
  • Employed part-time
  • Retired
  • Independent contractor, freelancer, or self-employed

8. Under the column " UndergradMajor", which category has the minimum number of rows?

  • Health Science
  • Fine Arts
  • Information Systems
  • Computer Science

9. The column ‘ConvertedComp’ contains the annual compensation of the survey respondents. What is the best approach to impute the missing values in this column?

  • min
  • max
  • mean
  • median

Graded Quiz: Normalizing Data

10. How many unique values are there in the CompFreq column?

  • 7
  • 3
  • 11398
  • 5

11. After removing the duplicate rows, how many respondents are being paid yearly?

  • 6073
  • 312
  • 3259
  • 9999

12. What is the median NormalizedAnnualCompensation?

  • 100000
  • 9000
  • 6132520
  • 8080

Leave a Reply