a) Read the Excel file as a pandas DataFrame, and print the bottom 5 rows in the file. b) Select the "Name", "Gender", "Age" and "Weight" columns for female patients whose age is above 50. Print the top 5 rows in this dataset. c) Find the mean and the sample standard deviation of the weight of all patients in the dataset and print them out. Round the numbers to 2 decimal places. d) Extract the patients whose weight is 1.2 standard deviations above the mean, i.e., with weight greater than (mean + 1.2 * sample standard deviations). Write the name, age, gender and weight of these patients to an Excel file named "overweight patients.xlsx" without row labels.

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question

Hope to solve this question *{{D}}* requirements, this question I find in internet, Hope to explain in the code using #. The Excel file is in the link. Thanks a lot https://we.tl/t-cs1Lxia7S6

You are given an Excel file ("Patient_Information.xlsx") that stores the Patient ID, Name,
gender, age, weight and systolic blood pressure of patients. Write python codes to complete
the following tasks. Name your python program as "LA3_2.py".
b)
a) Read the Excel file as a pandas DataFrame, and print the bottom 5 rows in the file.
Select the "Name", "Gender", "Age" and "Weight" columns for female patients whose age
is above 50. Print the top 5 rows in this dataset.
c) Find the mean and the sample standard deviation of the weight of all patients in the dataset
and print them out. Round the numbers to 2 decimal places.
d) Extract the patients whose weight is 1.2 standard deviations above the mean, i.e., with
weight greater than (mean + 1.2 * sample standard deviations). Write the name, age, gender
and weight of these patients to an Excel file named "overweight_patients.xlsx" without
row labels.
e) Create a new column "Age Group". If the "Age" of a patient is from 30 to 39 (including
the boundaries), the value of "Age Group" will be "30-39". Do similarly for the following
"Age Group" values: “40 – 49", "50 – 59", "60 or above". Print a table to show the number
of values in each age group.
f) Construct a pivot table that summarises the mean systolic blood pressure of patients in
different gender and age groups. Print the pivot table.
g) Mask the surname of the patient, except the first letter, with five "*" characters. For
example, if the patient has a name “John Smith”, the Name value should store “John
S*****". Print the top 5 rows of the modified "Name" column.
Transcribed Image Text:You are given an Excel file ("Patient_Information.xlsx") that stores the Patient ID, Name, gender, age, weight and systolic blood pressure of patients. Write python codes to complete the following tasks. Name your python program as "LA3_2.py". b) a) Read the Excel file as a pandas DataFrame, and print the bottom 5 rows in the file. Select the "Name", "Gender", "Age" and "Weight" columns for female patients whose age is above 50. Print the top 5 rows in this dataset. c) Find the mean and the sample standard deviation of the weight of all patients in the dataset and print them out. Round the numbers to 2 decimal places. d) Extract the patients whose weight is 1.2 standard deviations above the mean, i.e., with weight greater than (mean + 1.2 * sample standard deviations). Write the name, age, gender and weight of these patients to an Excel file named "overweight_patients.xlsx" without row labels. e) Create a new column "Age Group". If the "Age" of a patient is from 30 to 39 (including the boundaries), the value of "Age Group" will be "30-39". Do similarly for the following "Age Group" values: “40 – 49", "50 – 59", "60 or above". Print a table to show the number of values in each age group. f) Construct a pivot table that summarises the mean systolic blood pressure of patients in different gender and age groups. Print the pivot table. g) Mask the surname of the patient, except the first letter, with five "*" characters. For example, if the patient has a name “John Smith”, the Name value should store “John S*****". Print the top 5 rows of the modified "Name" column.
Expert Solution
steps

Step by step

Solved in 4 steps with 2 images

Blurred answer
Knowledge Booster
Form
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education