Question

In: Statistics and Probability

Please show and answer all the parts of this question. This is More About Tests and...

Please show and answer all the parts of this question. This is More About Tests and Intervals section in Statistics. Textbook: Stats Data And Models (4th Edition). De Veaux Velleman Bock

Suppose that a particular spam filter uses a points-based system in which various aspects of an email trigger an accumulation of points – with 100 points being the maximum and strongly indicating spam. So, more points for a particular email becomes stronger evidence that it is spam. After accumulating a sufficient number of points, the spam filter classifies the email as spam and it does not reach your inbox.

This process is similar to hypothesis testing in the following way for each email it reviews:    

H0: The email is a real message (not spam)

HA: The email is spam

Using the above hypothesis setting context, answer the following questions using language/terms related to hypothesis testing:

  1. When the filter allows spam to slip through into your inbox, which kind of error is that? Explain in terms of the hypotheses above.
  2. Which kind of error is it when a real (i.e., non-spam) email gets classified as spam and does not get to your inbox? Explain in terms of the hypotheses above.
  3. Suppose that this particular spam filter classifies spam as any email getting 50 points or higher. However, you reset the filter to use 60 points or higher before classifying it as spam. Is that analogous to choosing a higher or lower alpha level for a hypothesis test. Explain in terms of the hypotheses above.
  4. What impact does this change in the spam cutoff value have on the chance of each type of error in hypothesis testing? Explain.
  5. What does “power” mean in this context of the spam filter, and how is it related to one of the two types of errors? Explain in terms of the hypotheses above.

Solutions

Expert Solution

1)

When the filter allows spam to slip through into your inbox, it is a type 2 error. That is, a spam mail is classified as a real message and reaches your inbox. Type 2 error is the non rejection of a false null hypothesis.

2)

When a real email gets classified as spam and does not get to your inbox, it is a type 1 error. Here, a real mail is classified as a spam. Type 1 error is the rejection of true null hypothesis.

3)

The spam filter classifies spam as any email getting 50 points or higher. If you reset the filter to use 60 points or higher before classifying it as spam, the number of messages classified as spam will reduce. It is also the case when the significance level alpha is increased. When alpha is higher, the number of messages classified as spam will reduce, because the acceptance region will be bigger.

4)

When the spam cutoff value increases, probability of type 1 error decreases, because the probability of acceptance is more. Also the probability of type 2 error increases because acceptance region is bigger.

When the spam cutoff value decreases, probability of type 1 error increases, because the probability of acceptance is less. Also the probability of type 2 error decreases because acceptance region is smaller.

5)

Power is the probability of rejecting the null hypothesis when it is false. Here, it is the probability of classifying a message as spam when it is actually a spam.

Power is related to type 2 error as :

power = 1 - (Probability of type 2 error)


Related Solutions

Please show and answer all the parts of this question. This is Testing Hypotheses About Proportions...
Please show and answer all the parts of this question. This is Testing Hypotheses About Proportions in Statistics Suppose that in manufacturing a very sensitive electronic component, a company and its customers have tolerated a 2% defective rate. Recently, however, several customers have been complaining that there seem to be more defectives than in the past. Given that the company has made recent modifications to its manufacturing process, it is wondering if in fact the defective rate has increased from...
Please answer all the parts of this question Please answer all the parts of this question...
Please answer all the parts of this question Please answer all the parts of this question Question: a) Where is the flow accelerating in the control volume? b) Will the pressure be greater or less than hydrostatic in a region of accelerating flow? c) Is the hydrostatic estimate of force on the gate; larger, smaller, or the same, as that obtained from the Momentum Equation? Explain. d) Where does the change in flow momentum go in this control volume? Explain.
Please answer all parts of the question. Please show all work and all steps. 1a.) Show...
Please answer all parts of the question. Please show all work and all steps. 1a.) Show that the solutions of x' = arc tan (x) + t cannot have maxima 1b.) Find the value of a such that the existence and uniqueness theorem applies to the ivp x' = (3/2)((|x|)^(1/3)), x(0) = a. 1c.) Find the limits, as t approaches both positive infinity and negative infinity, of the solution Φ(t) of the ivp x' = (x+2)(1-x^4), x(0) = 0
Please show and answer all the parts of this question. Suppose that in manufacturing a very...
Please show and answer all the parts of this question. Suppose that in manufacturing a very sensitive electronic component, a company and its customers have tolerated a 2% defective rate. Recently, however, several customers have been complaining that there seem to be more defectives than in the past. Given that the company has made recent modifications to its manufacturing process, it is wondering if in fact the defective rate has increased from 2%. For quality assurance purposes, you decide to...
this as a whole question 1, answer all parts please a)Show that the derivative of f(x)...
this as a whole question 1, answer all parts please a)Show that the derivative of f(x) = 6+4x^2 is f(x)'=8x by using the definition of the derivative as the limit of a difference quotient. b)If the area A = s^2 of an expanding square is increasing at the constant rate of 4 square inches per second, how fast is the length s of the sides increasing when the area is 16 square inches? c)Find the intervals where the graph of...
Please answer all parts, thank you, and please type your answer and show all work including...
Please answer all parts, thank you, and please type your answer and show all work including excel formulas Exercise 15-15 The following data were taken from the balance sheet accounts of Shamrock Corporation on December 31, 2016. Current assets $554,000 Debt investments 596,000 Common stock (par value $10) 455,000 Paid-in capital in excess of par 148,000 Retained earnings 800,000 Prepare the required journal entries for the following unrelated items. (Credit account titles are automatically indented when amount is entered. Do...
Please answer both parts of this question. Please answer both parts of this question. Question: a)...
Please answer both parts of this question. Please answer both parts of this question. Question: a) The channel in question 2 now has a 0.3m high smooth extended shelf built across its base to cover a submerged pipeline but still carries 50m3/s. Plot the specific energy diagram for 0 < y < 4m and calculate the critical depth and minimum specific energy. What are now the two possible flow depths for a specific energy of 4m upstream of the obstructions?...
Please solve all parts of the following question. Please show all work and all steps. 1a.)...
Please solve all parts of the following question. Please show all work and all steps. 1a.) Solve x' = x + 3y + 2t y' = x - y + t^2 1b.) Solve x' + ty = -1 y' + x' = 2 1c.) Solve x' + y = 3t y' - tx' = 0
Please answer all parts of this question for both parts!! Prompt: In our study of the...
Please answer all parts of this question for both parts!! Prompt: In our study of the skeletal system this week it should be readily apparent to you how important synovial joints are in the normal function of the human body. This week you will pick two synovial joints to discuss. Pick one from each category below. For each selected joint you should provide the following information: the movements that are possible at that joint (use appropriate anatomical terminology), identification of...
Please answer the problem below for all parts. Please show all work and write clearly. Thanks....
Please answer the problem below for all parts. Please show all work and write clearly. Thanks. (Apply total probability and Bayes’ rules) A large industrial firm uses three local motels to provide overnight accommodations for its clients. From past experience it is known that 22% of the clients are assigned rooms at the Ramada Inn, 50% at the Sheraton, and 28% at the Lakeview Motor Lodge. If the plumbing is faulty in 5% of the rooms at the Ramada Inn,...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT