M10-Assignment

docx

School

Sacramento City College *

*We aren’t endorsed by this school

Course

457

Subject

Computer Science

Date

Jan 9, 2024

Type

docx

Pages

8

Uploaded by w1552680

Report
CISP 457 M10 – Assignment QUESTIONS: 1. Describe the two steps to data storage design. The first step is to select the appropriate format for the data storage. There are several different methods of storing data (files, relational data bases, multidimensional databases, object-oriented databases) and the analyst should select the one that will provide the best approach to storing the system data. Second, the data storage must be designed to optimize its processing efficiency, which involves considering how the data will be used, and making the appropriate design decisions. 2. How are a file and a database different from each other? Files are essentially an electronic list of information that is formatted for a particular transaction. Any programs that are written must be developed to work with the file exactly as it is laid out. If there is a need to combine data in a new way, a new file must be created (usually by extracting data from other files) and a program written to work specifically with that new file. Databases, on the other hand, are made up of a collection of data sets that are related to each other in some way. Database management system software creates these data groupings. The DBMS provides access to the data and can usually provide access to any desired subset of data. It is not necessary to write new programs to build a new file in order to retrieve data from the database in a new way. 3. What is the difference between the end – user database and an enterprise database? Provide an example of each one. An end-user database is one that is designed to run on a PC and is used to create personal database applications. An end-user in sales might develop a Microsoft Access database, for example, to keep track of current and prospective client contacts. An enterprise database is one that is capable of handling huge volumes of information for the entire organization. Applications that serve the entire enterprise can be built upon these enterprise databases. These databases are fast, high capacity, but also complex. Oracle is a vendor of enterprise database management systems. 4. Name five types of files, and describe the primary purpose of each type. Master files store the business's or application's core data. The data in a master file is considered fairly permanent, does not change much, and is usually retained for long periods. Look-up files contain reference information that is used primarily during validation processing. A list of valid code values for a field might be referred to during data entry to ensure that a valid code was entered.
CISP 457 Transaction files contain information that will be used to update a master file. These files are usually temporary in nature; they are used to collect transactions, the transactions update the master file, and then the transaction files are archived. Audit files are used to help trace changes made to files. An image of a record before and after a change may be written to an audit file so that the change is documented. History files serve as archives for older information that is rarely used. These files can be accessed, if necessary, but are usually stored on tape to remove the little- used data from the active data storage areas. 5. Name two types of legacy databases and the main problems associated with each type. Hierarchical databases use hierarchies, or inverted trees, to represent relationships. The main problem with this database model is that it cannot be used efficiently to represent non-hierarchical associations. Network databases avoid this problem but require a considerable amount of programming effort. Programs must be written to follow the database structure, and if the database structure changes, then complex programming must be done to change the application programs as well. 6. What is the most popular kind of database today? Provide three examples of products that are based on this technology. Relational databases are most popular today due to their ease of use and conceptual simplicity. Examples of relational DBMSs on the market include MS Access, Oracle, DB2, Sybase, Informix, and MS SQL Server. 7. What is referential integrity, and how is it implemented in a relational database? Referential integrity refers to the need to make sure that the values linking the table together through the primary and foreign keys are valid and correctly synchronized. For example, if a customer is placing an order, we need to have information on the customer in the customer table. The RDBMS will check to see if there is a record for that customer in the Customer table before it will let an order be entered. Checking for known required relationships helps assure referential integrity. 8. What is the biggest strength of the object database? Describe two of its weaknesses. The biggest strength of object databases is the reusability of objects. This accelerates system development and helps keep costs manageable. Object databases are also very suitable to store complex data (e.g., graphics, video, and sound). Two weaknesses of object databases and the lack of experienced developers and the steep learning curve associated with OODBMSs. 9. How does the multidimensional database store data? Multidimensional databases store data using several dimensions. Data may be aggregated and/or detailed, depending upon the access needs of the users. 10. What are the two most important factors in determining the type of data storage that should be adopted for a system. Why are these factors so important? First, evaluate the type of data that will be stored. Relational databases are the standard for simple data such as numbers, text, and dates. If the data is more
CISP 457 complex (video, images, or audio), then object databases may be required. If the data needs to be aggregated, then multidimensional databases are recommended. The second factor is the type of system being developed. Transaction processing systems require rapid update and retrieval capability, and will best be constructed using files, relational databases, or object databases. Decision support types of applications require rapid access to data in ad hoc ways. These types of systems are best implemented using relational or multidimensional databases. These two factors are very important because you must select a data storage format that is suitable for the data the system will include and the uses planned for that data. 11. What should you consider the storage formats that already exist in an organization when deciding on a storage format for a new system? This factor is important because the project team needs to be aware of the existing base of technical skills that are available to work with the data storage format. If a data storage format is chosen that is new to the organization, then the team must allocate training and learning time into the project schedule. 12. What are the differences between the logical and physical ERD? The logical ERD represents the data required by the application and presents a 'business view' of the data without including implementation details. The physical ERD includes all elements of a logical ERD but includes implementation details which aid in describing characteristics of the system and presenting a 'systems view' of the new system. 13. Describe the metadata associated with the physical ERD. Metadata included in the physical ERD includes information regarding attributes such as data type, field size, format, default values, primary keys, and foreign keys. 14. Describe the purpose of the primary and foreign keys. A primary key serves as the unique identifier for each record to be stored in a table, and one is required for identified for each table. A foreign key is one in which an attribute in one table is the primary key in another. Identifying foreign keys is important in enforcing referential integrity. 15. Name three ways that null values in a database can be interpreted. Why is this problematic? A null value in a field can indicate that there should not be a value in the field (i.e., blank is correct). It can also mean that an error was made, and a value that should have been entered was incorrectly omitted. It can also indicate that a value for the field has been deleted, which may or may not be correct. The difficulty in really knowing why the null exists is the major problem with nulls. EXERCISES: A. Using the Web or other resources, identify a product that can be classified as an end – user database and products that can be classified as an enterprise database. How are the
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
CISP 457 products described and marketed? What kinds of applications and users do they support? In what kinds of situations would an organization choose to implement an end – user database over an enterprise database? One of the more famous type of end- user database is Microsoft Access. Microsoft Access is part of the Microsoft 365 and is bundled with the package. The Microsoft 365 is a popular desktop application that is used by most workstations around the world. Naturally, being a bundle, Access is identified as a productivity utility that end users can use to create local applications. It is meant to help them make their day-to-day work easier. When it comes to applications, MS Access can be used with VBA to create light desktop applications that the employees can use. It requires minimal planning, and some basic knowledge of VBA is enough to build these applications. These applications usually work as data entry, sorting, tabulation, and reporting tools. Enterprise database is a more robust database that is used for large scale purposes. Consider Oracle. Their database is solely targeted towards enterprises. End users do not use Oracle or cannot afford it. Also given the scale at which Oracle DB can operate, it is mostly used by large enterprises and BFSI industry. Oracle is very popular among banks because the software is extremely robust and stable. For banks, it is critical to maintain high security and stable financial records and Oracle provides just that. B. Visit a commercial website (e.g., CDnow, Amazon.com). If files were being used to store the data supporting the application, what types of files would be needed? What data would they contain? If the data is being stored in databases, these are the types of files someone may encounter: o Text files o CSV files o Excel Files o Database Files o JSON Files For amazon.com or for most customer centric websites, if the file contains information about product listings, then that file will be considered “public.” If the file contains or stores customer information or seller information, then it must be “private.” The files would contain data about product listings, customer account information, personal information, and order details. C. Using the Web, review one of the products listed at the end of this exercise. What are the main features and functions of the software? In what companies has the database management system (DBMS) been implemented, and for what purposes? According to the information that you found, what are three strengths and weaknesses of the product? Relational DBMS o Main Features and Functions of Relational DBMS:
CISP 457 o Tables and Relations: Relational DBMS stores data in tables, where each table represents an entity, and relations define how tables are related to each other. o Structured Query Language (SQL): SQL is used to interact with relational DBMS. o Data Integrity: Relational DBMS enforces data integrity through constraints like primary keys, foreign keys, and unique constraints, ensuring data accuracy. o ACID Compliance: It ensures that database transactions are Atomic, Consistent, Isolated, and Durable, guaranteeing data consistency and reliability. o Indexing: It uses indexes to speed up data retrieval operations, making queries faster. Relational DBMS like MySQL, PostgreSQL, and Microsoft SQL Server are widely used across various industries. For instance: o MySQL: Used by companies like Facebook, Airbnb, and Twitter for data storage and retrieval in their web applications. o PostgreSQL: Trusted by organizations like Apple, Cisco, and Fujitsu for managing data in various applications, including e-commerce and financial systems. Strengths: o Data Integrity: Relational DBMS ensures data accuracy and consistency through constraints. o ACID Compliance: Transactions are guaranteed to be reliable and recoverable. o Standardized Query Language: SQL is a widely accepted standard, making it easier to learn and use. Weaknesses: o Scalability: Scaling relational DBMS for large datasets can be challenging and costly. o Schema Changes: Modifying the database schema can be complex and time-consuming. o Performance: D. You have been given a file that contains fields relating to CD information. Using the steps of normalization, create a logical data model that represents this file in third normal form. The fields include the following: Musical group name Musicians in group Date group was formed Group’s agent CD title 1 CD title 2
CISP 457 CD title 3 CD 1 length CD 2 length CD 3 length The assumptions are as follows: Musicians in group contains a list of the members in the musical group. Musical groups can have more than one CD, so both group name and CD title are needed to uniquely identify a particular CD. E. Jim Smith’s dealership sells Fords, Hondas, and Toyotas. The dealership keeps information about each car manufacturer with whom it deals so that employees can get in touch with manufacturers easily. The dealership staff also keeps information about the models of cars that the dealership carries from each manufacturer. They keep such information as list price, the price the dealership paid to obtain the model, and the model’s name and series (e.g., Honda Civic LX). They also keep information about all sales that they have made. (For instance, they will record the buyer’s name, the car he or she bought, and the amount he or she paid for the car.) So the staff can contact the buyers in the future, contact information is also kept (e.g., address, phone number). Create a logical data model. Apply the normalization on the model to check for processing efficiency.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
CISP 457 F. Describe how you would denormalize the model that you created in question 29. Draw the new physical model on the basis if your suggested changes. How would performance be affected by your suggestions? MINICASE #1 The analysts use the structure chart for the program design. The structure chart is divided into two parts: Transaction Structure o It contains modules which call its lower modules and perform the tasks assigned to it. Transform Structure o It identified the primary functional components and its high level input and output components VOLUMETRICS ANALYSIS
CISP 457 Table Name Average Recorded Size Overhead Factor Total Record Size Initial Table Size Initial Table Volume New Vehicle 65 35% 88 10,000 880,000 Tade – in Vehicle 48 35% 65 7,500 487,500 Sales Invoice 76 35% 103 16,000 1,648,000 Customer 61 35% 83 13,000 1,079,000 Salesperson 34 35% 46 100 4,600 Installed Option 16 35% 22 25,000 550,000 Option 28 35% 38 500 19,000 Total initial volume: 4,668,100 Growth rate: 10% Total volume in three years: 6,213,241 Database requires to allowing 6,213,241, so hardware is require to allow that much size of character data.