19828121_BUS5CA_Assignment 3
docx
keyboard_arrow_up
School
Swinburne University of Technology *
*We aren’t endorsed by this school
Course
70017
Subject
Information Systems
Date
Nov 24, 2024
Type
docx
Pages
16
Uploaded by Deep2029
ASSIGNMENT-03
Churn Analysis
SUBMITTED BY: RUHI MIGLANI
19828121
TABLE OF CONTENTS
Task 1
Task-2A
Task2B
Task 2.B1
Task 2 B2
Task2 B3
Task3
Introduction
The Assignment is based on the Churn Analysis, here churn analysis is carried out for the Alpha Telco
in order to get the clear insights of the ratio of churned people and also it asked to develop various
predictive models in order to do something for the retention of customer as if churn ratio is going high
it is not beneficial for the company. However, with the help of SAS Miner Enterprise three different
models are created in order to check which model performs better and good for the company along
with that Excel is used in order to get clear picture about the values of churned and non- churned
customers for this few steps are taken into consideration in order to filter the data effectively and
efficiently as only after data filtration it is possible to get the overall churn and churn based on
different categories. Then for Task 3, based on the analysis we carried out in part 1 and 2 we gave the
recommendations.
Task-01
In this task it is asked to conduct the descriptive analysis based on the customer data given to us and then different profiles are constructed for each customer group. Here, the three main categories are:
Loyal Customers
Churned Customers
Non-churned Customers
Loyal – Customers
We get the data of Loyal customers after filtering the data or as it is given loyal customers are the
subset of non- churned customers and from them top 10% are taken into consideration. After all the
below mentioned results are obtained:
331
1053
0
250
500
750
1000
Yes
No
Partner
T o t a l N
u m
b e r
Partner
673
711
0
200
400
600
Male
Female
Gender
T o t a l N
u m
b e r
Gender
515
582
287
0
200
400
600
Fiber optic
DSL
No
InternetService
T
o t a l N
u m
b e r
InternetService
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
100
296
988
0
250
500
750
1000
Two year
One year
Month-to-month
Contract
T o t a l N
u m
b e r
Contract
365
287
732
0
200
400
600
Yes
No
No internet service
TechSupport
T o t a l N u m
b e r
TechSupport
First if we look onto variable Partner
, we have two categories Yes & No. Here, Yes means
the count of people who are loyal to the company have partners and count value is 1053
whereas, no means these loyal customers don’t have partners and their count is 331. It can be
easily analysed that in loyalty segment for variable partner those who have partners are more
loyal in compare to those who don’t have partners. It is necessary for the company to target
and focus those who have partners as they are giving more profit to the company.
Then, considering another variable named Gender, here also we have two categories MALE
is 711and FEMALES is 673. With the plot it can easily observe that the Males customers are
more loyal in compare to the females, so it is essential to focus on females as they are less
loyal and giving less profit to the company whereas, it is essential to maintain the loyalty of
Males customers as well’
Next variable is Payment Method
with the graph we can observe that here major preference
of the loyal customers is the Bank Transfer i.e. 525 following with Credit card and they are
least preferring Electronic check and mailed check. But it is important for the bank to
promote all services so for this an action needs o be taken.
Next is Internet services offered by the company to their customers. Here company is
offering two types of services DSL and Fiber optic, if we see the graph of Internet services it
can be easily observed that in the
loyal customer segment the count of opting Fiber optic is
high
its count is 582
whereas, the count of opting DSL is less and here few loyal customers
are not opting any of the services so it is essential to do something so that people will opt
more services and for those who are not opting anything some initiatives needs to be taken so
that they will encourage to opt any of the services offered.
Then next variable is Contract here the customers are more loyal towards the long term plans
this states that people are happy with the long -term services so here we can do something for
the contracts of short- term nature.
Another is Tech -Support those who opt tech support are more loyal
and their count is 732
those who not -opt their count is less than it is essential to promote these services to those
who didn’t opt that.
All-in-all, to maintain the customer base it is essential to have a good relation with the customers who
are generating profit for the company along with that it is also essential to focus on those who are not
generating much profit if both will taken into account the profit and the customer base automatically
improve.
Churned (Segment-02)-
These are those customers who are no more- happy with the services provided by the company and
they don’t want to use more services of the company as they are using earlier. This data set is obtained
after filtering the data and following results are obtained:
1200
669
0
250
500
750
1000
1250
No
Yes
Partner
T o ta l N u m be r
Partner
939
930
0
250
500
750
Female
Male
Gender
T o ta l N u m b e r
Gender
459
1297
113
0
500
1000
Fiber optic
DSL
No
InternetService
T o ta l N u m b e r
InternetService
1655
166
48
0
500
1000
1500
Month-to-month
One year
Two year
Contract
T o ta l N u m b e r
Contract
1446
113
310
0
500
1000
1500
No
Yes
No internet service
TechSupport
T ota l N um be r
TechSupport
The firs variable taken here is partner
, there are two categories in it YES and NO here also
we can see that the count of NO is more in compare to YES means those who don’t have
partners are less happy with the services provided by the company and they are less or not
using the services or they are not going to use their services for longer period of time. So, it is
essential for the company to focus on those who are not happy and comes under the category
of churn so that some initiative can be taken to retain these customers.
Next, is Gender
, in this the number of Females are more in compare to Males the difference
is not big but still females churning count is more in respect to males so Females needs to be
targeted in order to check why their churning count is high and then according to all such
answers the action needs to be taken.
Then Payment Method
, in this the count of Electronic transfer is more in respect to other
mode of payments as its count is 1071, it is essential to know the reason behind this and also
essential to focus on the all modes and to check the reasons why the churning ratio is more
and why these services are least preferred.
Next is Internet Services if we see the graph of it we can easily observe that here trend is
same the count of using Fiber optic is more and DSL is less here also we have few peoples
who are not using anything means they are already churned .So it is essential to know the
causes why people are churned why the count of services used are less then accordingly steps
needs to be taken because the count of churn in this segment is high.
Other variable is contract if we see the those who comes under churned category the count
of churned are more who are using monthly subscribtion this means company is not focusing
on those who are opting short term services or their short term services are not effective
thener fore churn count is more for that.
Last is Tech support,those who are not using or who doesnot opt the Tech support are more
liable to churn means they are facing some issues but because of lack of tech support they are
not able to solve their issues or their issues are not solved and they are leaving the company
or no more want to use the services of the company.
Over all it is essential to consider on those who have no partners as their churn out ratio is
high along with that Females, mode of payments and Internet services Tech support and
Tenure of the contract all needs to be taken into consideration to check why churn ratio is
high and to check how it can be reduced as soon as possible.
Non-churned
2439
2724
0
1000
2000
Yes
No
Partner
T otal N um ber
Partner
2544
2619
0
1000
2000
Male
Female
Gender
T o ta l N u m b e r
Gender
1957
1799
1407
0
500
1000
1500
2000
DSL
Fiber optic
No
InternetService
Total N um ber
InternetService
This category contains those customers who are not churned and enjoying the services provided by
the company. YES, means they have partners and No means they don’t have partner. So, those who
have partners the non- churned count is high for those who are single their churned rate is high means
non churned count of those customers are less in compare to those have partners.so, with some
effective measures it would be easy to retain those who are happy and also to those who don’t have
partners. Then next is Gender
, the count of Males is high, and females are less as discussed above as
well kit is essential to target females in order to increase profit also to make Females loyal customers.
Then next is Payment Method
in this segment all services are enjoyed and used by the customers and
count of opting all methods are near to each other but mailed checks count is high so this reflects with
help of customer relationship management we can take feedbacks and advice from customers and can
implement the things accordingly to convert churned customers to the potential customers for the
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
company. Last is,
Internet services here, DSL is most preferred its count is high in compare to Fiber
optic but here also those who are with company are not using any of the services among two so we
can do something for that so that they will also opt one of the service among those two. Conclusion
By analysing all three different customers we can easily conclude that in each type variables are
playing different roles but after analysing all the segments or profiles we can say that we need to look
out those who don’t have partners, in gender we need to focus on females , in mode of payment all
services needs to be taken into consideration whereas, in internet services we need to check why DSL
is not famous in loyal customers and why there are few customers who are not using any of the
services offered by the company.
Task-02
A.
Over- all churn rate:
In this part it is asked to calculate over all churn rate and the churn rate of the categorical variable.
The results are given below:
No; 73.4
2%
Yes; 26.5
8%
Total Churn Rate(%)
No
Yes
Here with the table and graph we can easily conclude that overall rate of churn is 26.57% means the ratio of the customers who are churned is 26.57% and rest 73.42% are the loyal ones.
Group churn Rate
Gender
Males= (
930/3549) *100 = 26.20%
Females = (939)/3483) *100=26.95%
Senior citizens
Yes= (476/1142) *100= 41.68%
No
= (1394/5891} *100= 23.66%
Dependent
Yes = (669/3393) *100 = 19.71%
No = (1200/3639) * 100 = 32.97%
Phone Services
Yes = (1687/6340) *100 = 26.60%
No = (170/680) *100 = 25%
Paperless Billing
Yes = (1400/4168) *100 = 33.58%
Row Labels
Count of Churn
Churn Rate (%)
No
5163
73.42150171
Yes
1869
26.57849829
Grand Total
7032
No = (469/2864) * 100 = 16.37%
Above mentioned are the calculation of Group churn.
Task-02
B.1 why we apply standardization (z-score normalization) on the continuous variables?
When all the Continuous variable are measured at different scales, we need to apply standardization
(z-score normalization) this states that in the analysis there is no equal contribution of all the
continuous variables. for instance, if we conduct the customer segmentation, in which we seek to
group customers based on their diverse qualities. A' transaction amount ' variable, which is in b/w $10
and $10,000, has a greater weighting than the number of transactions between 0 and $20. The variable
is called a transaction amount. It is therefore necessary to transform the data into weights which can
be compared. It is a matter of rescaling an original parameter to the same scope and/or variability
.
B.2 What are the selected variables used for building the prediction models?
With the above- mentioned diagram, we can easily identify the variables in order to build prediction
models and this can be get after making changes in running the transformation node by making
changes in the properties of Transformation node where we did Interval Inputs and in sample
properties taken method as Random and Size as max. Selected variables are given below:
B.3 What are the predictive performance of various models and how they rank against one
another?
Model 1
Neural network Model The neural network model helps in understanding the complex and non-linear relations in between the
different variables.The first graph is the graph of Score Ranking Matrix with this we can observe that
3.7 is the best score whereas our score is 2.9and it is closer to the Best Commulative Scorewhich is a
good indication. Then second graph is of Misclassification Rate if we see the value of
Misclassification rate is 0.208 means there are less errors in the model.
As, good Commulative score and less value of Misclassification Rate means the performance of
model is good.
Confusion Matrix:
Accuracy
: If we calculate the accuracy of the model it is showing 0.79
Model 2 Decision Tree
The second model taken is Decision Tree Model in which few propertioes are chnaged like Leaf size
taken as 25 and split size is 50 this is done to prevent the very small size of the leaves creation and
selection method is taken as Misclassification Rate in order to check the misclassification rate
however, below are the two different plots one is Score ranking overlay and second is Subtree
Assessment plot, the major things obtained are:
Here commulative lift score is 2.8 and the optimal number of leaves are 8.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
With the above-mentioned fit statistics diagram, it is easy to check the performance of the model as it
depicts all the values which is essential to check the overall performance of the model.
Regression Model
The Regression is used in order to get the existing relationship between the different variables,to get
the clear picture here, also few properties are changed like in Regression Type we take Logistic
Regression then in Link Function, Logit is there, Selection Model is Backwars and the criteria of
selection is taken as Misclassification Validation. However, the results after running the Regression
Node are given below:
Here Blue Bar represents the Negative Relation and Red bars represents the Positive Relation, so
here few variables are representing positive and few are representing negative relations.Then next is
the graph of Score Ranking Matric with this we will able to get the commulative score the value for
bes Commulative Score is approximately 3.7 whereas, our commulative score is approximately 2.9
this means the our commulative score is very close to the best commulative score this is a positive
sign means model is good.
Then first graph is of Score Ranking Matric with this we will able to get the commulative score the
value for best Commulative Score is approximately 3.7 whereas, our commulative score is
approximately 2.8 this means the our commulative score is very close to the best commulative score
this is a positive sign means model is good. Then second graph is of Iteration plot which gives out the
value of Misclassification Rate its 0.211
Model Comparison & selection
All three models are compared where the Selection statistics is taken as Misclassification Rate and
below are the results:Here, Neural Network
is selected as best model because it has less value of
Misclassification in comparision with the other two models and less would be the misclassification
rate more would be the model is accurate or good.Then to clear this thing another chart i.e ROC chart
is taken into consideration:
ROC charts depicts the trade-off in between the sensitivity and the specificity, however, the classifier
which is much closer to the top left corner is the indicator of the best performance and if curve is
close to the base line means the performance is poor and with the diagram we can easily observe that
the the curve for neural Network is much closer to the top left corner in resoect to other two models so
we can say that the Neural Network here is best.
Then comes Lift score, the graph presents below shows the commulative lift score of all three
models:
Here, the Commulative Score for Neural Network is 3.5 whereas, the commulative lift score of
Regression is 3.4 and Decision Tree is 3.2 this reflects that the performance of Neural Network is
good in respect to other models. So by above mentioned graps we can easily conclude that The neural
Network is the best model. For more clarity below is the picture which depicts how Neural network is
best.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
If we see this here the misclassification rate for Neural Network is less in compare to all other models
weather in case of training or validation.
Confusion Matrix:
Model
Partition
of data
True +
True
-
False+
False-
Total
Accuracy
Decision Tree
Training
650
3251
362
657
4920
0.79
Validation
258
1410
140
304
2112
0.78
Neural Network
Training
690
3264
349
617
4920
0.80
Validation
274
1397
153
288
2112
0.79
Regression Training
668
3272
341
639
4920
0.80
Validation
264
1402
148
298
2112
0.78
TP + TN / TP + TN + FP + FN.
With the help of confusion matrix, the accuracy of the model is calculated and we can clearly observe
that The validation accuracy of Neural Network is high i.e. 0.79 in compare to other two models this
means this is the best model among all.
Conclusion
After analysing all three graph the below mention conclusions are drawn:
The value of Average square error is less of Neural Network model in compare to all other
models.
The ROC curve of Neural network is closer to the top left corner menas the performance of
this model is good in compare to other models.
If we focus on the lift score the lift score of Neural Network model is high in compare to
other models.
Value of accuracy of Neural network is more with value 0.79, means in compare to other this
model is more accurate and reliable.
All-in-all the Neural Network is best in compare to Decision tree model and Regression model
Task-03
In this part it is asked to give recommendations based on the analysis done in previous task(1,2).so,
recommendations are mainly based on the variables, means which is the important variable for the
Alpha Telecom so that they will focus on that first in order to retained their customers for longer
period of time.After deep analysis the major insights drawn are, In case of loyal customers they are
providing the profit to the company means they are happy with the services offered and they are
happily using the services offered by the company.Based on the insights drawn and our understanding
below mentioned are the few recommendations which Alpha Telco needs to take into consideration in
order reduce the churn rate of their customers:
The churn out ratio of the people who don’t have partners are high, means those who are
single are churrning out more, so it is essential for the company to introduce some extra
benefits and discounts to those who are single, like 10% off on fees of the service ,as retaining
them is essential for the company in order to reduce the churn out ratio. And to retained those
who have partners and loyal family plans are the best tactics but for singles who are churrning
at a good pace some initial discounts and schemes or some lomg term benefits and dioscounts
in the price of service if they use that service for longer period this will attract them and
helped to stay with the company for ;longer period of time.
Then the churn out ratio is high for the contact of 2 year, so for this some attractive long
term benefits needs to offer to the customer so they finf fruitful to invest for long term
contract and this will reduce the churn ratio for long term contracts. Here company can offere
few monetary discounts for those who adopt 2 year contract or some rewards points to use
later this will attract people to invest more in longer period of contracts.
The churn ratio is high for the customers in case of Electronic check whereas, there is less
count of churn of all other modes of payment so it essential for the company to provide some
discount vouchers for opting Electronic checks this will attracts new customers also this will
help to retain the existing customers as every one wants to opt those services which give
monetary benefit in return. Also it is good to opt because this will reduce the handling of
paper bills as every thing will present online.So, the introduction of Discount vouchers on the
Electronic cheks is fruitfull for the company to overcome the churn out ratio.
The churn out ratio of Female is more in respect to Male customers this is a sign to do
something for females so, company can compe up with buy one and get one offer only for
females means if they opt any one service they are eligible to opt any other service free of
cost so this will attract more females to opt the service as every body want more satisfaction
in less proice.
Then there is need to focus on the customers who are not satisfy with the Fiber Optic Internet
service as the churn count is more of those customers are using Fiber optic type this means
there is some problem in the fiber optic that’s why people are not happy and churnning out so
for that company needs to improve the quality of the service along with that to rertain
customers and to reduce churn out ratio they needs to reduce the price of the Fibre optic
service along with that also build customer relationship so that it will becom easy for the
company to resolve all the issues faced by the customer this will automatically help in
reduction of the churn rate.
If we focus on the variable Tech- support, it is important because people who are not getting
tech support or not subscribed to it ,their churrning ratio is high, means people want these
value added services to stay back with the company so, it is mandatory for the company to
provide Tech support in order to work on the problems faced by the customers due to which
they are not happy with the services and resolve their issue in less time. Also, if any of the
problem is taking time to get resolve then they can give some monetary benefit to the
customer as compensation. Moreover, improved Technical support is essential to retained the
existing customers as well as to attratct new customers and to look those who are not satisfy.
Other major thing is the Phone service as those who are using phone service are more likely
to churn means there is some problem with the quality of service or may be in the price which
is forcing the people to churn so, company needs to improve the quality of phone service they
are providing to their customer along with that few schemes needs to be launch for phone
services then only more customers will attract towards this service and if problems of
customers are solved on real time basis then the churn out ratio can be reduced.
Moreover, it is also essential for the company to focus on those who are loyal because
retention is necessary, if they not focus on them it will nlot good for the company so company
needs to provide extra benefits to those who are loyal just like the vodaphone is giving as if
your contract with vodaphone is of 6 months or more than 6 months provide discounts in the
fees of the plan and even if more services are opt then they are providing a huge concession in
the fees of the service.
It is also analyzed that there are some loyal customers who are not using much of the services
of the company so kit is essential to find out what they want and why they are not using or
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
opting mofre number of services and accordingly some iniotiatives needs to be taken so that
they will use the maximum services of the company and will remain loyal for the longer
period of time.
Integrated data management is essential to get rid from the churn out ratio as this will help to
understand the activities of the customer and accordingly company can provide special
discounts and offers to the interface of the customer.
Real time communiaction or personal contact with the customer is essential and accordingly
we can offer value services like extra minutes talk time just like vodaphone is doing in their
pack they are providing 1000 free international minutes.These type of offeres attract more
customers and also help in retaining the existing base of customer.
It is essential to track the the performance of the services offered this will help to know the
lacking areas and help to know what initiative is required to tackle the problem.
All-in -all the churning ratio of the customers is 27%, so it is essential for the company to take all
above mentioned initiatives in order to reduce this percentage and to convert them as loyal or
potential customers, because for the success of the firm it is essential to satisfy the customers through
their services better would be the service more would be the potential customer along with that
customer frelationship management is important now-a-days only with this company can gety to
know the demand and preference of the customer and accordingly they can deliever the services to
them this is good for both company and the customers.