Spam Email Filters.  A study by Forbes indicated that the five most common words appearing in spam emails are shipping!, today!, here!, available,  and fingertips!.  Many spam filters separate spam from ham (email not considered to be spam) through application of Bayes' theorem.  Suppose that for one email account, 1 in every 10 messages is spam and the proportions of spam messages that have the five most common words in spam email are given below. shipping! 0.051 today! 0.045 here! 0.034 available 0.014 fingertips! 0.014 Also suppose that the proportions of ham messages that have these words are: shipping! 0.0015 today! 0.0022 here! 0.0022 available 0.0041 fingertips! 0.0011 a.  If a message includes the word shipping!, what is the probability the message is spam?  If a message includes the word shipping!, what is the probability the message is ham?  Should messages that include the word shipping! be flagged as spam? b.  If a message includes the word today!, what is the probability the message is spam?  If a message includes the word here!, what is the probability the message is spam?  Which of these two words is a stronger indicator that a message is spam?  Why? c.  If a messages includes the word available, what is the probability the message is spam?  If a message includes the word fingertips!, what is the probability the message is spam?  Which of these two words is a stronger indicator that a message is spam?  Why? d.  What insights to the results of parts (b) and (c) yield about what enables a spam filter that uses Bayes' theorem to work effectively?

A First Course in Probability (10th Edition)
10th Edition
ISBN:9780134753119
Author:Sheldon Ross
Publisher:Sheldon Ross
Chapter1: Combinatorial Analysis
Section: Chapter Questions
Problem 1.1P: a. How many different 7-place license plates are possible if the first 2 places are for letters and...
icon
Related questions
Topic Video
Question

Spam Email Filters.  A study by Forbes indicated that the five most common words appearing in spam emails are shipping!, today!, here!, available,  and fingertips!.  Many spam filters separate spam from ham (email not considered to be spam) through application of Bayes' theorem.  Suppose that for one email account, 1 in every 10 messages is spam and the proportions of spam messages that have the five most common words in spam email are given below.

shipping! 0.051
today! 0.045
here! 0.034
available 0.014
fingertips! 0.014

Also suppose that the proportions of ham messages that have these words are:

shipping! 0.0015
today! 0.0022
here! 0.0022
available 0.0041
fingertips! 0.0011

a.  If a message includes the word shipping!, what is the probability the message is spam?  If a message includes the word shipping!, what is the probability the message is ham?  Should messages that include the word shipping! be flagged as spam?

b.  If a message includes the word today!, what is the probability the message is spam?  If a message includes the word here!, what is the probability the message is spam?  Which of these two words is a stronger indicator that a message is spam?  Why?

c.  If a messages includes the word available, what is the probability the message is spam?  If a message includes the word fingertips!, what is the probability the message is spam?  Which of these two words is a stronger indicator that a message is spam?  Why?

d.  What insights to the results of parts (b) and (c) yield about what enables a spam filter that uses Bayes' theorem to work effectively?

Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 4 steps

Blurred answer
Knowledge Booster
Propositional Calculus
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, probability and related others by exploring similar questions and additional content below.
Similar questions
Recommended textbooks for you
A First Course in Probability (10th Edition)
A First Course in Probability (10th Edition)
Probability
ISBN:
9780134753119
Author:
Sheldon Ross
Publisher:
PEARSON
A First Course in Probability
A First Course in Probability
Probability
ISBN:
9780321794772
Author:
Sheldon Ross
Publisher:
PEARSON