Project 1 -Hughes

docx

School

Southern New Hampshire University *

*We aren’t endorsed by this school

Course

300

Subject

Industrial Engineering

Date

Feb 20, 2024

Type

docx

Pages

4

Uploaded by ChancellorExplorationLeopard29

Report
DAT-300 Data Valid: Getting Right Data Dr. Francine Adams Patricia Hughes July 21, 2023 Project One
Preliminary Data The company National Motors has acquired Kansas City Motors, a small firm. We have found that National Motors and Kansas City Motors are using different databases to organize their data, they are both also using different formats to store this data. The Kansas City and Warehouse data set is an Excel spreadsheet that includes 5 other city’s data. The Kansas City Monthly Totals data is on a word document that has the month number and the sold motor numbers on it. These will need to be merged to have everything together. The organizational problem is to merge the inventory data of National Motors and Kansas City Motors while we maintain data integrity and reliability. Since these data sets are in different formats and databases, this can lead to data integration issues when trying to merge them. Due to the different formats, there can be inconsistencies in the data structure and data types. Data Usability After examining the data set for analysis, there does appear to be some inconsistent data. Since the data is in two different formats it will be more difficult to maintain the data integrity. Being able to use good or healthy data is extremely important to the business. “Dirty data” can lead to large negative impact that will have larger consequences with the gap analysis. A gap analysis needs to have accurate data in order to be able to identify the areas in which things need to be improved. For this to happen accurate and clean data need to be used. This will give us the highest quality data for better decision-making. The different databases will make for a challenge when merging the data. We will need to transform the data into one standardized format. This is an important step because once the data is in one format it will facilitate the merge and have a more consistent data set. Here we will see if there are any discrepancies, missing or duplicate values, or any quality issues. Completeness and Accuracy During the review of the data, it does look like there is an inconsistency with the Kansas City Monthly Totals word document. This document shows that there was an addendum made for the 28 th month, which shows to have sold 286 motors. This value could be an outlier, anomaly, or error made. This would be something to go back and double check to make sure this value is correct. That is a drastic change from the 1 st to the 27 th months that range from 2900- 3120 motors sold. Both data sets do show the same number of months for data recorded. They do not show which months. Since both data sets are recorded in different formats, they will need to be merged into one format to be properly analyzed. Any errors found would need to be corrected before continuing and they would need documented to show the correction. These errors could have been simply done by human error. Afterwards the different formats could either transform it or omit it from the merger. That is why having accurate and clean data is so important. After the merger this issue can be prevented by
ensuring clean data is being used. As well as having the same format will help tremendously. Data quality audits will help ensure the reports have the best data available. Data for Organizational Challenge The data that should be retained and that will have the most relevance is will need to be analyzed for the gap analysis. After converting the data and using one format and database, we will want to make sure that all the data in the 28 months is transferred over correctly. This includes the 28 th month that look like an outlier or error. We would need to transfer the month and motors sold for each month. This would need to be converted into the Kansas City and Warehouse Data Set Excel spreadsheet, it will be placed with the other city’s data. Limitations for Data Usage In regards to limitations for data usage regarding this data for regulatory, ethical, and legal considerations there are none. This data set does not contain any personal or private information about a customer. This data set is regarding the inventory of a company, it does not show to who it was sold to, just the number of motors sold in a month. The only thing that would warrant any regulatory usage of this data is that they would need to keep these numbers within the company. Only employees within the company and/or department would really need to know this data, depending on company policies. If only a certain department would need this information, then only, they should have access. Access limitations for employees would need to be placed. Summary This preliminary assessment for a gap analysis shows that there will be organizational challenges with the merger of National Motors and Kansas City Motors, due to the different databases and formats for data sets. These challenges can be overcome by careful examination of the data set and ensuring that the highest quality and clean data is being transferred. If a problem arises with missing, incomplete, or erroneous data, that would need to be looked at and fixed. Also, ensuring there no limitations regarding the data, will help keep the data safe and protected. The merger could be achieved with little to no issues if these problems are addressed.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
Resources Angie. (2022). Dirty Data: How Much it Can Cost Your Business and How to Get Rid of it. SyncApps, the #1 Integration Cloud . https://www.cazoomi.com/blog/dirty-data-how- much-it-can-cost-your-business-and-how-to-get-rid-of-it/#:~:text=On%20average%2C %20organizations%20believe%20that,%2C%20and%20unreliable%20decision %2Dmaking. Guides: Data Management: Privacy and Ethics in Data Management . (n.d.). https://libguides.library.cpp.edu/c.php?g=1026882&p=8989928