An E-Commerce portal has collected data regarding the customer activity on the website, app, and purchase portal. The portal management wanted to study the relationship between customers' length of Memberships and the yearly money amount spent by the customer buying stuff on their portal (The last two columns in the datasheet). Unfortunately, some information regarding some customers is missing e.g. Length of Membership. A. You have three approaches that needed to be tackled: 1) Fill the missing data using 'zeros' 2) Fill the missing data using the mean of their column 3) Fill the missing data using the median of their column

MATLAB: An Introduction with Applications
6th Edition
ISBN:9781119256830
Author:Amos Gilat
Publisher:Amos Gilat
Chapter1: Starting With Matlab
Section: Chapter Questions
Problem 1P
icon
Related questions
Question
Story:
An E-Commerce portal has collected data regarding the customer activity on the website, app,
and purchase portal. The portal management wanted to study the relationship between
customers' length of Memberships and the yearly money amount spent by the customer buying
stuff on their portal (The last two columns in the datasheet). Unfortunately, some information
regarding some customers is missing e.g. Length of Membership.
A. You have three approaches that needed to be tackled:
1) Fill the missing data using 'zeros'
2) Fill the missing data using the mean of their column
3) Fill the missing data using the median of their column
After trying each of them you need to study the effect of your new data proposition on
the correlation between "Length of Membership" and "Yearly Amount Spent
B. Using Principal Components Analysis (PCA), Reduce the number of independent
variables to only two(use any library you want for PCA)
Transcribed Image Text:Story: An E-Commerce portal has collected data regarding the customer activity on the website, app, and purchase portal. The portal management wanted to study the relationship between customers' length of Memberships and the yearly money amount spent by the customer buying stuff on their portal (The last two columns in the datasheet). Unfortunately, some information regarding some customers is missing e.g. Length of Membership. A. You have three approaches that needed to be tackled: 1) Fill the missing data using 'zeros' 2) Fill the missing data using the mean of their column 3) Fill the missing data using the median of their column After trying each of them you need to study the effect of your new data proposition on the correlation between "Length of Membership" and "Yearly Amount Spent B. Using Principal Components Analysis (PCA), Reduce the number of independent variables to only two(use any library you want for PCA)
Expert Solution
steps

Step by step

Solved in 2 steps

Blurred answer
Similar questions
Recommended textbooks for you
MATLAB: An Introduction with Applications
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
Elementary Statistics: Picturing the World (7th E…
Elementary Statistics: Picturing the World (7th E…
Statistics
ISBN:
9780134683416
Author:
Ron Larson, Betsy Farber
Publisher:
PEARSON
The Basic Practice of Statistics
The Basic Practice of Statistics
Statistics
ISBN:
9781319042578
Author:
David S. Moore, William I. Notz, Michael A. Fligner
Publisher:
W. H. Freeman
Introduction to the Practice of Statistics
Introduction to the Practice of Statistics
Statistics
ISBN:
9781319013387
Author:
David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:
W. H. Freeman