Ted Tenhagen DAT-500-Q4409 4-3 Journal: Data Validation and Discovery Walkthrough
After completing the data assessment, why do you think running a summary function on the data set is important (or not important)?
I do believe that running a summary function on the data set you are about to analyze is important. After performing the summary, you as the analyst would get a baseline for the information you are going to be working with and can help you better make conclusions about what the data means. How does reviewing the minimum, maximum, and average of a field in the data help describe the data?
Like I stated above, when performing the data assessment, you want to get as much information on the data you are working with. The minimum, maximum, and average help to give you even more understanding of the data. You would be able to tell if there are any outliers with these three stats, allowing you to gain more insight into what you are looking at and help to explain in your reporting why some numbers are the way they are. Describe in your own words what data assessment is.
To me, data assessment is when the data analyst gets an overview of the data that they as going to be analyzing. Doing this sets the baseline for the analysis you are about to do and can help guide you in your reporting. When performing the analysis, you as the analyst should be following the same procedure that the other analyst on your team follow in order to make sure that every report from the team has the same credibility.