Chapter 2 Handout
docx
keyboard_arrow_up
School
University of the Pacific, Stockton *
*We aren’t endorsed by this school
Course
293E
Subject
Information Systems
Date
Feb 20, 2024
Type
docx
Pages
3
Uploaded by ChefEel4028
Chapter 2 Handout
Part One: Multiple Choices
1. Mastering the data can also be described via the ETL process. The ETL process stands for:
a.
extract, total, and load data.
b.
enter, transform, and load data.
c.
extract, transform, and load data.
d.
enter, total, and load data.
2. The advantages of storing data in a relational database include which of the following?
a.
Help in enforcing business rules.
b.
Increased information redundancy.
c.
Integrating business processes.
d.
Help in enforcing business rules and integrating business processes.
3. The purpose of transforming data is:
a.
to validate the data for completeness and integrity.
b.
to load the data into the appropriate tool for analysis.
c.
to obtain the data from the appropriate source.
d.
to identify which data are necessary to complete the analysis.
4. Which attribute is required to exist in each table of a relational database and serves as the “unique identifier” for each record in a table?
a.
Foreign key
b.
Unique identifier
c.
Primary key
d.
Key attribute
5. The metadata that describe each attribute in a database is which of the following?
a.
Composite primary key
b.
Data dictionary
c.
Descriptive attributes
d.
Flat file
6. As mentioned in the chapter, which of the following is not
a common way that data will need to be cleaned after extraction and validation?
a.
Remove headings and subtotals.
b.
Format negative numbers.
c.
Clean up trailing zeroes.
d.
Correct inconsistencies across data.
7. Why is Supplier ID considered to be a primary key for a Supplier table?
a.
It contains a unique identifier for each supplier.
b.
It is a 10-digit number.
c.
It can either be for a vendor or miscellaneous provider.
d.
It is used to identify different supplier categories.
8. What are attributes that exist in a relational database that are neither primary nor foreign keys?
a.
Nondescript attributes
b.
Descriptive attributes
c.
Composite keys
d.
Relational table attributes
9. Which of the following questions are not suggested by the Institute of Business Ethics to allow
a business to create value from data use and analysis, and still protect the privacy of stakeholders?
a.
How does the company use data, and to what extent are they integrated into firm strategy?
b.
Does the company send a privacy notice to individuals when their personal data are collected?
c.
Does the data used by the company include personally identifiable information?
d.
Does the company have the appropriate tools to mitigate the risks of data misuse?
Part Two:
1: Even though it is preferable to store data in a relational database, storing data across separate tables can make data analysis cumbersome. Describe three reasons it is worth the trouble to store
data in a relational database.
-
Relational databases connect with each other by foreign keys/ primary keys. Easy, efficient 1: completeness
2: save space
3: reenforce business rules and internal controls
4: improve communication
2: Identify which ETL tasks would be considered “Validate” the data, and which would be considered “Cleaning” the data.
ETL Task
Validate or Clean?
1.
Compare the number of records that were extracted to the number of records in the source
database
Validate
2.
Remove headings or subtotals
Cleaning
3.
Remove leading zeroes and nonprintable characters
Cleaning
4.
Compare descriptive statistics for numeric fields
Validate
5.
Format negative numbers
Cleaning
6.
Compare string limits for text fields
Validate
7.
Correct inconsistencies across data, in general
Cleaning
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help