Peform this in Python. Utilize vectorized calculations and not use any form of loops. Make a data frame consisting of 20 and 10 columns. Each column j should consist of 20 values from a normal distribution with mean (j-1) and standard deviation 0.5j. For example, the third column should be normal(mean=2, sd=1.5). Using this data frame, do each of the following (using code, of course): Find the mean and standard deviation for each column. Write code that counts the number of columns for which the sample mean and sample standard deviation are within 20% of the values used to generate the data. Write code that writes the columns from part b to a new data frame. For each value in the new data frame, subtract its column mean and divide by the column standard deviation.
Peform this in Python. Utilize vectorized calculations and not use any form of loops. Make a data frame consisting of 20 and 10 columns. Each column j should consist of 20 values from a normal distribution with mean (j-1) and standard deviation 0.5j. For example, the third column should be normal(mean=2, sd=1.5). Using this data frame, do each of the following (using code, of course): Find the mean and standard deviation for each column. Write code that counts the number of columns for which the sample mean and sample standard deviation are within 20% of the values used to generate the data. Write code that writes the columns from part b to a new data frame. For each value in the new data frame, subtract its column mean and divide by the column standard deviation.
Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
Related questions
Question
Peform this in Python. Utilize
column j should consist of 20 values from a normal distribution with mean (j-1) and standard deviation 0.5j. For example, the third column should be normal(mean=2, sd=1.5). Using this data frame, do each of the following (using code, of course):
- Find the mean and standard deviation for each column.
- Write code that counts the number of columns for which the sample mean and sample standard deviation are within 20% of the values used to generate the data.
- Write code that writes the columns from part b to a new data frame.
- For each value in the new data frame, subtract its column mean and divide by the column standard deviation.
Expert Solution
This question has been solved!
Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.
This is a popular solution!
Trending now
This is a popular solution!
Step by step
Solved in 2 steps with 5 images
Knowledge Booster
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.Recommended textbooks for you
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education