Chapter 2 Handout

docx

School

University of the Pacific, Stockton *

*We aren’t endorsed by this school

Course

293E

Subject

Information Systems

Date

Feb 20, 2024

Type

docx

Pages

Uploaded by ChefEel4028

Chapter 2 Handout Part One: Multiple Choices 1. Mastering the data can also be described via the ETL process. The ETL process stands for: a. extract, total, and load data. b. enter, transform, and load data. c. extract, transform, and load data. d. enter, total, and load data. 2. The advantages of storing data in a relational database include which of the following? a. Help in enforcing business rules. b. Increased information redundancy. c. Integrating business processes. d. Help in enforcing business rules and integrating business processes. 3. The purpose of transforming data is: a. to validate the data for completeness and integrity. b. to load the data into the appropriate tool for analysis. c. to obtain the data from the appropriate source. d. to identify which data are necessary to complete the analysis. 4. Which attribute is required to exist in each table of a relational database and serves as the “unique identifier” for each record in a table? a. Foreign key b. Unique identifier c. Primary key d. Key attribute 5. The metadata that describe each attribute in a database is which of the following? a. Composite primary key b. Data dictionary c. Descriptive attributes d. Flat file

6. As mentioned in the chapter, which of the following is not a common way that data will need to be cleaned after extraction and validation? a. Remove headings and subtotals. b. Format negative numbers. c. Clean up trailing zeroes. d. Correct inconsistencies across data. 7. Why is Supplier ID considered to be a primary key for a Supplier table? a. It contains a unique identifier for each supplier. b. It is a 10-digit number. c. It can either be for a vendor or miscellaneous provider. d. It is used to identify different supplier categories. 8. What are attributes that exist in a relational database that are neither primary nor foreign keys? a. Nondescript attributes b. Descriptive attributes c. Composite keys d. Relational table attributes 9. Which of the following questions are not suggested by the Institute of Business Ethics to allow a business to create value from data use and analysis, and still protect the privacy of stakeholders? a. How does the company use data, and to what extent are they integrated into firm strategy? b. Does the company send a privacy notice to individuals when their personal data are collected? c. Does the data used by the company include personally identifiable information? d. Does the company have the appropriate tools to mitigate the risks of data misuse? Part Two: 1: Even though it is preferable to store data in a relational database, storing data across separate tables can make data analysis cumbersome. Describe three reasons it is worth the trouble to store data in a relational database. - Relational databases connect with each other by foreign keys/ primary keys. Easy, efficient 1: completeness 2: save space 3: reenforce business rules and internal controls

4: improve communication 2: Identify which ETL tasks would be considered “Validate” the data, and which would be considered “Cleaning” the data. ETL Task Validate or Clean? 1. Compare the number of records that were extracted to the number of records in the source database Validate 2. Remove headings or subtotals Cleaning 3. Remove leading zeroes and nonprintable characters Cleaning 4. Compare descriptive statistics for numeric fields Validate 5. Format negative numbers Cleaning 6. Compare string limits for text fields Validate 7. Correct inconsistencies across data, in general Cleaning

Your preview ends here

Eager to read complete document? Join bartleby learn and gain access to the full version

Access to all documents
Unlimited textbook solutions
24/7 expert homework help

Chapter 2 Handout

Related Documents