Summary and Descriptive Statistics

docx

School

University of Nairobi *

*We aren’t endorsed by this school

Course

1001

Subject

Nursing

Date

Nov 24, 2024

Type

docx

Pages

5

Uploaded by Carl2030

Report
1 Summary and Descriptive Statistics Grace Wanjiru College of Nursing & Healthcare Professionals, Grand Canyon University HLT-362V-O500: Applied Statistics for Health Care Professionals Prof. Cruz May 14, 2023
2 Descriptive Statistics Measures of Central Tendency: Mean, Median, Mode This paper will use the descriptive statistics on the Excel sheet attached on breast cancer (female only) obtained from the National Cancer Institute (2023). The data compiled was from the years 2000 to 2020 as shown in the table. Mean Mean, which is also called average, refers to the overall summation of the values provided that is then divided by the number of items in a given set of data (Khorana et al., 2023). The formula is expressed as, Mean formula = (Sum of items) ÷ (Total numbers of items) For the given data set, the summation will depict the total of breast cancer cases for each ethnic group from 2000 to 2020. For Hispanics (any race), the total sum of breast cancer cases is = 2013.2 while the total number of items is 21 (years 2000 to 2020) 2013.2/21 = 95.87 Note that I have automatically calculated these formulas on the Excel sheet using the Median formula =AVERAGE (B3:B23). The means for all the ethnic groups are as follows; Non-Hispanic American Indian / Alaska Native = 104.53 Non-Hispanic Asian / Pacific Islander = 98.45 Non-Hispanic Black = 125.59 Non-Hispanic White = 136.15 Median The median refers to the middle number in a given data (Khorana et al., 2023). When the data set uses an even number of items, the median is calculated by computing the average of the two middle numbers. However, in this case, the data set contains an odd number of items (21) meaning that the middle number is the median. The diagnoses for 2008 are the median for this data set and are summarized below; Hispanics (any race) = 94.8 Non-Hispanic American Indian / Alaska Native = 104.7 Non-Hispanic Asian / Pacific Islander = 96.5 Non-Hispanic Black = 124.9 Non-Hispanic White = 135.4 Mode
3 Mode refers to the number in the data set that is repeated the most. In case a modal value cannot be established, one can apply some set of formulas to identify it. However, in most cases, when there is no number that appears most, it is just stated as not applicable. In the data used for this descriptive statistics, the mode for Hispanics (any race) is 91.5 and for Non-Hispanic Asians/Pacific Islanders, the mode is 90.3. These numbers appear more than once but the other ethnic groups did not have any number that appears the most hence the response is N/A. Measures of Variation Variance Variance refers to the measurement of how numbers are distributed in a given set of data (Smith, 2022). Variance is calculated using the following formula; Variance = (Standard deviation)2= σ2 The variance for the ethnic groups is calculated automatically on the Excel sheet using the formula =VAR.S(B3:B23) for Hispanics (any race) and the variance is 18.41 . The variance for the other ethnic groups is as follows; Non-Hispanic American Indian / Alaska Native = 161.97 Non-Hispanic Asian / Pacific Islander = 76.58 Non-Hispanic Black = 34.26 Non-Hispanic White = 16.37 Standard Deviation Standard deviation refers to a measure of how numbers are spread out. Its symbol is σ (the Greek letter sigma) (Smith, 2022). Its formula is simply the square root of the variance. Standard deviation is also calculated automatically using the formula =STDEV.S (B3:B23) and the results are as follows; Hispanics (any race) = 4.29 Non-Hispanic American Indian / Alaska Native = 12.73 Non-Hispanic Asian / Pacific Islander = 8.75 Non-Hispanic Black = 5.85 Non-Hispanic White = 4.04 Range Range refers to the difference between the lowest and the highest values in a data set (Smith, 2022). For example, in (4,5, 7, 2, 8, 10), the lowest value is 2 and the highest value is 10. So the range is 10-2= 8
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
4 Using the same formula, the ranges for the five ethnic groups are as follows; Hispanics (any race) = 14.60 Non-Hispanic American Indian / Alaska Native = 49.7 Non-Hispanic Asian / Pacific Islander = 29.7 Non-Hispanic Black = 21.4 Non-Hispanic White = 13.3 Analysis of the Descriptive Statistics This descriptive analysis is based on the data collected from the National Cancer Institute (2023) precisely on breast cancer cases (female only) diagnosed in the US from 2000 to 2020. The data is categorized based on the five ethnic groups observed above. With regard to the measures of central tendency, Non-Hispanic Whites had the highest mean of 136.15 while Hispanics (any race) had the lowest mean of 95.87. This shows that Whites reported the highest number of breast cancer cases for the 21 years compared to the other ethnic groups in the US. One of the contributing factors is the population of whites compared to the other ethnic groups. The highest median number is 135.40 representing Whites and the lowest median is 94.80 representing Hispanics (any race). The mode is only applicable to Hispanics and Asians/Pacific Islanders who had numbers appearing more than once. The other races did not have any modes hence indicated with N/A. With regard to the measures of variations, Non-Hispanic American Indian / Alaska Native had the highest variance of 161.97 followed by Non-Hispanic Asian / Pacific Islander (76.57) while the Non-Hispanic Whites had the lowest variance of 16.37. Non-Hispanic American Indian / Alaska Native had the highest standard deviation of 12.72 followed by Non- Hispanic Asian / Pacific Islander at 8.75 while Non-Hispanic Whites had the lowest standard deviation of 4.04. the highest range was depicted by Non-Hispanic American Indian / Alaska Native at 49.7 while Non-Hispanic Whites had the lowest range of 13.3.
5 References National Cancer Institute (2023). Breast Cancer: Recent Trends in SEER Age-Adjusted Incidence Rates 2000 – 2020. Retrieved from https://seer.cancer.gov/statistics-network/explorer/application.html? site=55&data_type=1&graph_type=2&compareBy=race&chk_race_1=1&chk_race_6=6 &chk_race_5=5&chk_race_4=4&chk_race_9=9&chk_race_8=8&rate_type=2&sex=3&a ge_range=1&stage=101&advopt_precision=1&hdn_view=1&advopt_show_apc=on&adv opt_display=1 Khorana, A., Pareek, A., Ollivier, M., Madjarova, S. J., Kunze, K. N., Nwachukwu, B. U., Karlsson, J., Marigi, E. M., & Williams, R. J., 3rd (2023). Choosing the appropriate measure of central tendency: mean, median, or mode? Knee surgery, sports traumatology, arthroscopy: official journal of the ESSKA , 31 (1), 12–15. https://doi.org/10.1007/s00167-022-07204-y Smith D. J. (2022). Basic statistical techniques for medical and other professionals a course in statistics to assist in interpreting numerical data . Routledge.