the path to insights: data models and pipelines coursera weekly challenge 3 answers
Test your knowledge: Optimize pipelines and ETL processes
1. What is the business intelligence process that involves checking data for defects in order to prevent system failures?
- Query planning
- Business intelligence monitoring
- Quality testing
- Data governance
2. Fill in the blank: Completeness is a quality testing step that involves confirming that the data contains all desired ____ or components.
- Measures
- Columns
- Fields
- Context
3. A business intelligence professional is considering the integrity of their data throughout its life cycle. Which of the following goals do they aim to achieve?
- Data is trustworthy
- Data is consistent
- Data is accurate and complete
- Data is encrypted
Activity: Evaluate a schema using a validation checklist
4. Did you complete this activity?
- Yes
- No
5. The Shipments table is missing a relationship to another table. Which table should it connect to?
- Sales Fact
- Order Details
- Product
- Order Items
6. Which of the following is a convention used in this schema?
- Including the order_sid dimension in every table
- Abbreviating system id as “sid”
- Abbreviating customer as “cust”
- Alphabetizing each dimension name
7. You find an error while trying to connect the Product table to the Order Items table. Which problem(s) would prevent the schema from validating? Select all that apply.
- The Product table has fewer columns than the Order Items table
- The product_id name does not match product_sid.
- The data type of the product ids in the Product table is an integer, but it’s a string in the Order Items table.
- There are product ids in the Order Items table that don’t exist in the Product table.
8. The Customer table should be linked to which of the following tables? Select all that apply.
- Billing
- Order Details
- Order Items
- Sales Fact
Test your knowledge: Data schema validation
9. A team of business intelligence professionals builds schema validation into their workflows. In this situation, what goal do they want to achieve?
- Consolidate data from multiple source systems
- Prevent two or more components from using a single resource in a conflicting way
- Consider the needs of stakeholders in the design of the data schema
- Ensure the source system data schema matches the target system data schema
10. Why is it important to ensure primary and foreign keys continue to function after data has been moved from one database system to another?
- To preserve the existing table relationships
- To evaluate database performance
- To provide more detail and context about the data
- To read and execute coded instructions
11. Fill in the blank: A _____ describes the process of identifying the origin of data, where it has moved throughout the system, and how it has transformed over time.
- Data dictionary
- Business rule
- Quality test
- Data lineage
Test your knowledge: Business rules and performance testing
12. A business intelligence professional considers what data is collected and stored in a database, how relationships are defined, the type of information the database provides, and the security of the data. What does this scenario describe?
- Ensuring the formal management of data assets
- Confirming that data is consistent
- Considering the impact of business rules
- Expanding scope in response to stakeholder requirements
13. At which point in the data-transfer process should incoming data be compared to business rules?
- At the same time as it is being loaded into the database
- As soon as it has been loaded into the database
- Before loading it into the database
- No later than 24 hours after being loaded into the database
Weekly challenge 3
14. Quality testing involves checking data for defects in order to prevent what from happening?
- System failure
- Redundancy
- Fragmentation
- Contention
15. A business intelligence professional is confirming that their data contains all desired components or measures. Which quality testing validation element does this involve?
- Integrity
- Completeness
- Accuracy
- Consistency
16. A business intelligence professional is working with a data warehouse. They perform various tasks to confirm that the data is timely and the pipeline is ingesting the latest information. For what reasons is this an important element of business intelligence? Select all that apply.
- To have the most current information
- To ensure the data is updated properly
- To provide relevant insights
- To map the data correctly
17. Conformity is an aspect of establishing consistent data governance. What are the key tools involved with conformity? Select all that apply.
- Schema validation
- Combined systems
- Data dictionaries
- Data lineages
18. Fill in the blank: Schema validation properties preserve table relationships, ensure consistent conventions, and ensure database _____ are still valid.
- models
- permissions
- interfaces
- keys
19. Fill in the blank: A data _____ describes the process of identifying the origin of data, where it has moved throughout the system, and how it has transformed over time.
- map
- model
- lineage
- dictionary
20. A business intelligence professional establishes what data will be collected, stored, and provided in a database. They also confirm how relationships are defined and the security of the data. What process does this scenario describe?
- Optimization
- Iteration
- Database modeling
- Creating business rules
21. What process involves confirming the validity of database keys, preserving table relationships, and ensuring that conventions are consistent?
- Typecasting
- Sorting
- Schema validation
- Data composition
22. Fill in the blank: A data _____ is a collection of information that describes the content, format, and structure of data objects within a database, as well as their relationships.
- map
- lineage
- model
- dictionary
23. What elements of database design are affected by business rules? Select all that apply.
- The maturity of the data
- The security of the data
- What data is collected, stored, and provided
- How relationships are defined
24. A business intelligence team analyzes current data in order to confirm that stakeholders gain the most up-to-date insights in the future. In this situation, what aspect of data do they consider?
- Timeliness
- Redundancy
- Maturity
- Conformity
25. Fill in the blank: Quality testing is the process of checking data for _____ in order to prevent system failures.
- granularity
- links
- scalability
- defects
26. What are the goals of schema validation? Select all that apply.
- To ensure consistent conventions
- To confirm the validity of database keys
- To establish row-based permissions
- To preserve table relationships
27. Which of the following statements accurately describe data dictionaries and data lineages? Select all that apply.
- A data dictionary describes the process of identifying the origin of data, where it has moved throughout the system, and how it has transformed over time.
- A data dictionary is a collection of information that describes the content, format, and structure of data objects within a database.
- A data lineage is a collection of information that describes the content, format, and structure of data objects within a database, as well as their relationships.
- A data lineage describes the process of identifying the origin of data, where it has moved throughout the system, and how it has transformed over time.