What Is a Data Warehouse? Warehousing Data, Data Mining Explained

pdf

School

Marshall University *

*We aren’t endorsed by this school

Course

CS-582

Subject

Information Systems

Date

Oct 30, 2023

Type

pdf

Pages

10

Uploaded by KidSnow15259

Report
9/16/23, 9 : 33 AM What Is a Data Warehouse ? Warehousing Data, Data Mining Explained Page 1 of 10 https://www.investopedia.com/terms/d/data-warehousing.asp What Is a Data Warehouse ? Warehousing Data, Data Mining Explained What Is a Data Warehouse ? A data warehouse is the secure electronic storage of information by a business or other organization. The goal of a data warehouse is to create a trove of historical data that can be retrieved and analyzed to provide useful insight into the organization's operations. A data warehouse is a vital component of business intelligence . That wider term encompasses the information infrastructure that modern businesses use to track their past successes and failures and inform their decisions for the future. Key Takeaways A data warehouse is the storage of information over time by a business or other organization. New data is periodically added by people in various key departments such as marketing and sales. The warehouse becomes a library of historical data that can be retrieved and analyzed in order to inform decision-making in the business. The key factors in building an effective data warehouse include defining the information that is critical to the organization and identifying the sources of the information. A database is designed to supply real-time information. A data
9/16/23, 9 : 33 AM What Is a Data Warehouse ? Warehousing Data, Data Mining Explained Page 2 of 10 https://www.investopedia.com/terms/d/data-warehousing.asp warehouse is designed as an archive of historical information. How a Data Warehouse Works The need to warehouse data evolved as businesses began relying on computer systems to create, file, and retrieve important business documents. The concept of data warehousing was introduced in 1988 by IBM researchers Barry Devlin and Paul Murphy.1 Data warehousing is designed to enable the analysis of historical data. Comparing data consolidated from multiple heterogeneous sources can provide insight into the performance of a company. A data warehouse is designed to allow its users to run queries and analyses on historical data derived from transactional sources. Data added to the warehouse does not change and cannot be altered. The warehouse is the source that is used to run analytics on past events, with a focus on changes over time. Warehoused data must be stored in a manner that is secure, reliable, easy to retrieve, and easy to manage. Maintaining a Data Warehouse There are certain steps that are taken to maintain a data warehouse. One step is data extraction, which involves gathering large amounts of data from multiple source points. After a set of data has been compiled, it goes through data cleaning, the process of combing through it for errors and correcting or excluding any that are found. The cleaned-up data is then converted from a database format to a warehouse format. Once stored in the warehouse, the data goes through sorting, consolidating, and summarizing, so that it will be easier to use. Over time, more data is added to the warehouse as the various data sources are
9/16/23, 9 : 33 AM What Is a Data Warehouse ? Warehousing Data, Data Mining Explained Page 3 of 10 https://www.investopedia.com/terms/d/data-warehousing.asp updated. A key book on data warehousing is W. H. Inmon's Building the Data Warehouse , a practical guide that was first published in 1990 and has been reprinted several times.2 Today, businesses can invest in cloud-based data warehouse software services from companies including Microsoft , Google , Amazon, and Oracle, among others.3 Data Mining Businesses warehouse data primarily for data mining . That involves looking for patterns of information that will help them improve their business processes. A good data warehousing system makes it easier for different departments within a company to access each other's data. For example, a marketing team can assess the sales team's data in order to make decisions about how to adjust their sales campaigns. The 5 Steps of Data Mining The data mining process breaks down into five steps: 1 . An organization collects data and loads it into a data warehouse. 2 . The data are then stored and managed, either on in-house servers or in a cloud service. 3 . Business analysts, management teams, and information technology professionals access and organize the data. 4 . Application software sorts the data. 5 . The end-user presents the data in an easy-to-share format, such as a
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
9/16/23, 9 : 33 AM What Is a Data Warehouse ? Warehousing Data, Data Mining Explained Page 4 of 10 https://www.investopedia.com/terms/d/data-warehousing.asp graph or table. Investopedia / Theresa Chiechi The concept of the data warehouse was introduced by two IBM researchers in 1988.4 Data Warehouse Architecture Designing a data warehouse is known as data warehouse architecture and depending on the needs of the data warehouse, can come in a variety of tiers. Typically there are tier one, tier two, and tier three architecture designs. Single-tier Architecture: Single-tier architecture is hardly used in the creation of data warehouses for real-time systems. They are often used for
9/16/23, 9 : 33 AM What Is a Data Warehouse ? Warehousing Data, Data Mining Explained Page 5 of 10 https://www.investopedia.com/terms/d/data-warehousing.asp batch and real-time processing to process operational data. A single-tier design is composed of a single layer of hardware with the goal of keeping data space at a minimum. Two-tier Architecture: In a two-tier architecture design, the analytical process is separated from the business process. The point of this is to increase levels of control and efficiency. Three-tier Architecture: A three-tier architecture design has a top, middle, and bottom tier; these are known as the source layer, the reconciled layer, and the data warehouse layer. This design is suited for systems with long life cycles. When changes are made in the data, an extra layer of review and analysis of the data is completed to ensure there have been no errors. Regardless of the tier, all data warehouse architectures must meet the same five properties: separation, scalability, extensibility, security, and administrability. Data Warehouse vs. Database A data warehouse is not the same as a database: A database is a transactional system that monitors and updates real- time data in order to have only the most recent data available. A data warehouse is programmed to aggregate structured data over time. For example, a database might only have the most recent address of a customer, while a data warehouse might have all the addresses of the customer for the past 10 years. Data mining relies on the data warehouse. The data in the warehouse is
9/16/23, 9 : 33 AM What Is a Data Warehouse ? Warehousing Data, Data Mining Explained Page 6 of 10 https://www.investopedia.com/terms/d/data-warehousing.asp sifted for insights into the business over time. Data Warehouse vs. Data Lake Both data warehouses and data lakes hold data for a variety of needs. The primary difference is that a data lake holds raw data of which the goal has not yet been determined. A data warehouse, on the other hand, holds refined data that has been filtered to be used for a specific purpose. Data lakes are primarily used by data scientists while data warehouses are most often used by business professionals. Data lakes are also more easily accessible and easier to update while data warehouses are more structured and any changes are more costly. Data Warehouse vs. Data Mart A data mart is just a smaller version of a data warehouse. A data mart collects data from a small number of sources and focuses on one subject area. Data marts are faster and easier to use than data warehouses. Data marts typically function as a subset of a data warehouse to focus on one area for analytical purposes, such as a specific department within an organization. Data marts are used to help make business decisions by helping with analysis and reporting. Advantages and Disadvantages of Data Warehouses A data warehouse is intended to give a company a competitive advantage . It creates a resource of pertinent information that can be tracked over time and analyzed in order to help a business make more informed decisions.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
9/16/23, 9 : 33 AM What Is a Data Warehouse ? Warehousing Data, Data Mining Explained Page 7 of 10 https://www.investopedia.com/terms/d/data-warehousing.asp It also can drain company resources and burden its current staff with routine tasks intended to feed the warehouse machine. Some other disadvantages include the following: It takes considerable time and effort to create and maintain the warehouse. Gaps in information, caused by human error, can take years to surface, damaging the integrity and usefulness of the information. When multiple sources are used, inconsistencies between them can cause information losses. Advantages Provides fact-based analysis on past company performance to inform decision-making. Serves as a historical archive of relevant data. Can be shared across key departments for maximum usefulness. Disadvantages Creating and maintaining the warehouse is resource-heavy. Input errors can damage the integrity of the information archived. Use of multiple sources can cause inconsistencies in the data. What Is a Data Warehouse and What Is It Used for ? A data warehouse is an information storage system for historical data that can be analyzed in numerous ways. Companies and other organizations draw on the data warehouse to gain insight into past performance and plan
9/16/23, 9 : 33 AM What Is a Data Warehouse ? Warehousing Data, Data Mining Explained Page 8 of 10 https://www.investopedia.com/terms/d/data-warehousing.asp improvements to their operations. What Is a Data Warehouse Example ? Consider a company that makes exercise equipment. Its best seller is a stationary bicycle, and it is considering expanding its line and launching a new marketing campaign to support it. It goes to its data warehouse to understand its current customer better. It can find out whether its customers are predominantly women over 50 or men under 35. It can learn more about the retailers that have been most successful in selling their bikes, and where they're located. It might be able to access in-house survey results and find out what their past customers have liked and disliked about their products. All of this information helps the company to decide what kind of new model bicycles they want to build and how they will market and advertise them. It's hard information rather than seat-of-the-pants decision-making. What Are the Stages of Creating a Data Warehouse ? There are at least seven stages to the creation of a data warehouse, according to ITPro Today, an industry publication. They include: Determining the business objectives and its key performance indicators. Collecting and analyzing the appropriate information. Identifying the core business processes that contribute the key data. Constructing a conceptual data model that shows how the data are displayed to the end-user. Locating the sources of the data and establishing a process for feeding data into the warehouse.
9/16/23, 9 : 33 AM What Is a Data Warehouse ? Warehousing Data, Data Mining Explained Page 9 of 10 https://www.investopedia.com/terms/d/data-warehousing.asp Establish a tracking duration. Data warehouses can become unwieldy. Many are built with levels of archiving, so that older information is retained in less detail. Implementing the plan.5 Is SQL a Data Warehouse ? SQL, or Structured Query Language, is a computer language that is used to interact with a database in terms that it can understand and respond to. It contains a number of commands such as "select," "insert," and "update." It is the standard language for relational database management systems.6 A database is not the same as a data warehouse, although both are stores of information. A database is an organized collection of information. A data warehouse is an information archive that is continuously built from multiple sources.7 What Is ETL in a Data Warehouse ? "ETL" stands for "extract, transform, and load." ETL is a data process that combines data from multiple sources into one single data storage unit, which is then loaded into a data warehouse or similar data system. It is used in data analytics and machine learning. The Bottom Line The data warehouse is a company's repository of information about its business and how it has performed over time. Created with input from employees in each of its key departments, it is the source for analysis that reveals the company's past successes and failures and informs its decision- making.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
9/16/23, 9 : 33 AM What Is a Data Warehouse ? Warehousing Data, Data Mining Explained Page 10 of 10 https://www.investopedia.com/terms/d/data-warehousing.asp Sponsored Investors Earned a 325% Annualized Return Here Investors were able to collect a 15.4% net return after just 36 days through a sculpture offered by the fractional art investment platform Masterworks —a triple-digit return on an annualized basis. And while it’s not common for Masterworks to exit so fast, investors have recently secured 14%, 27%, and even 35% annualized returns from other offerings. Get priority access to their new offerings and skip the waitlist now.

Browse Popular Homework Q&A

Q: elow, list all of the dependence types (RAW, WAR, WAW). List the dependencies in the respective…
Q: Find the distance from the point (-2, 8, 9) to the line ā(t) = ( − 1, 3, 2)t + (− 2, 3, 9).
Q: BugRx is a new biotechnology company in Cambridge, Massachusetts, developing human monoclonal…
Q: Calculate the pH of a 3.11 M solution of KA given that  for the acid:  HA Ka = 4.21⋅10−44.21⋅10-4
Q: The top and bottom margins of a poster are 2 cm and the side margins are each 6 cm. If the area of…
Q: Find x so the distance between (x, 2) and (4, 5) is a square root 10  (Enter your answers as a…
Q: BugRx is a new biotechnology company in Cambridge, Massachusetts, developing human monoclonal…
Q: Python question please include all steps and screenshot of code. Also please provide a docstring,…
Q: Give an English sentence in the form "If...then...." that is equivalent to each sentence. (c) Rajiv…
Q: I don't understand the equation that gave you the answer 44.00. I mostly dont get what the commas…
Q: 7 ►X a L a. Find reaction at x = 0. b. Find M(x) and V(x) Mo j Li 1
Q: Calculate the accrued interest (in $) and the total purchase price (in $) of the bond purchase.…
Q: The point P(1, 0) lies on the curve y = sin( (a) If Q is the point (x, sin(¹3″ )), find the slope of…
Q: Find the intersection point of the line l.x=-1. y = - 3 + t z = 4 + 2t with xx - 1 .
Q: A biotechnology company's stock is currently selling for $46.35 per share. The earnings per share…
Q: For the given functions, find (fog)(x) and (g o f)(x) and the domain of each. 2 f(x) = 1 g(x) = 1-3x…
Q: Find all solutions to 2cos(θ)=1 on the interval 0≤θ<2π. θ =    Give your answers as exact values in…
Q: The cost of debt for firm XYZ is 6%. It's tax rate is 40%. The cost of retained earnings is 12% and…
Q: Chapter 5, problem 8b Consider a cyclic group G and two positive integers A and B with the…
Q: If the demand for airline traveling increases,
Q: Record your KIA results below. Indicate the color of the slant and butt (Yellow/Red/Fuchsia),…
Q: A working C program to transfer the contents of a text file to a linked list. Also, contents should…