2
CYBER ANALYTICS – DATA SET SELECTION
Cyber Analytics – Data Set Selection
A summary of the data set you have chosen
During COVID-19 in the United States, a lot happened, some of which were
hospitalizations of the patients. The selected data is about COVID-19 hospitalizations. The data
is from the California COVID-19 State Dashboard which is at
https://covid19.ca.gov/state-
dashboard/
. As the data is specifically about hospitalization, the counts includes all patients who
were diagnosed with COVID-19 during the stay. However, from the data, it does not necessary
mean the patients were hospitalized because of COVID-19 complications or they had
experienced complications as a result of COVID-19. However, the data does not show the
cumulative totals because hospitals report were reporting the total number of patients each day
and not necessarily new patients.
A web link or directions for access this data set
https://catalog.data.gov/dataset/covid-19-hospital-data-7af67
Identification of the dimensions and measure fields that are part of this data set
A dataset is characterized by dimensions and measures, which are significantly different.
The selected dataset has both dimensions and measures. Dimensions are the type of fields
showing people, places, things, time, or events. Dimensions also use an identification number
and in this dataset, dimensions are only the county. There are also measures which are the fields
that store numbers capable of being totaled and averaged. Measures in the dataset are
hospitalized_covid_confirmed_patients, hospitalized_suspected_covid_patients,
all_hospital_beds, icu_covid_confirmed_patients, icu_suspected_covid_patients, and
icu_available_beds.