Some of the bars in the following graph cannot be clearly seen due to the large difference in values in the #of URLS corresponding to different amount of traffic (website visits). What kind of transformation to the data would make those columns more visible and the differences between remaining bar lengths clear? Histogram showing positive skew of population of URLS 20000 40000 60000 80000 Amount of traffic O a. log O b. Arcsine O c. square O d. linear # of URLS 500 1000 1500 2000 2500 T
Correlation
Correlation defines a relationship between two independent variables. It tells the degree to which variables move in relation to each other. When two sets of data are related to each other, there is a correlation between them.
Linear Correlation
A correlation is used to determine the relationships between numerical and categorical variables. In other words, it is an indicator of how things are connected to one another. The correlation analysis is the study of how variables are related.
Regression Analysis
Regression analysis is a statistical method in which it estimates the relationship between a dependent variable and one or more independent variable. In simple terms dependent variable is called as outcome variable and independent variable is called as predictors. Regression analysis is one of the methods to find the trends in data. The independent variable used in Regression analysis is named Predictor variable. It offers data of an associated dependent variable regarding a particular outcome.
Step by step
Solved in 2 steps