Question

In: Statistics and Probability

Spam Spam filters try to sort your e-mails, deciding which are real messages and which are...

Spam Spam filters try to sort your e-mails, deciding which are real messages and which are unwanted. One method used is a point system. The filter reads each incoming message and assigns points to the sender, the subject, key words in the message and so on. The higher the point total, the more likely it is that the message is unwanted. The filter has a cutoff value for the point total; any message rated lower than the cutoff passes through to your inbox, and the rest, suspected to be spam, are directed to the junk mailbox. We can think of the filter’s decision as a hypothesis test. The null hypothesis is that the e-mail is a real message and should go to your inbox. A higher point total provides evidence that the message may be spam; when there is sufficient evidence, the filter rejects the null, classifying the message as junk. This ususally works pretty well, but, of course, sometimes the filter makes a mistake. a. (1 mark) When the filter allows spam to slip through into your inbox, which kind of error is that? b. (1 mark) Which kind of error is it when a real message gets classified as junk? c. Some filters allow the user (that’s you) to adjust the cutoff. Suppose your filter has a default cutoff of 50 points, but you reset it to 40. Is that similar to choosing a larger value or similar to choosing a smaller value of α for a hypothesis test? Explain

Solutions

Expert Solution


Related Solutions

Spam filters try to sort your incoming e-mails, deciding which are real messages and which are...
Spam filters try to sort your incoming e-mails, deciding which are real messages and which are unwanted. One method used is a point system. The filter reads each incoming e-mail and assigns points according to the sender, the subject, key words in the message, and so on. The higher the point total the more likely it is that the message is unwanted. The filter has a cutoff value for the point total; any message rated lower than that cutoff passes...
If you want to know how important spam filters are to your online experience, try turning...
If you want to know how important spam filters are to your online experience, try turning them off for a day. You’ll quickly see why these tools we tend to take for granted are so essential. Generally speaking, a filtering solution applied to your email system uses a set of protocols to determine which incoming messages are spam and which are not. What the filters checks on can vary, but often they all do basically the same thing: scan header...
If you want to know how important spam filters are to your online experience, try turning...
If you want to know how important spam filters are to your online experience, try turning them off for a day. You’ll quickly see why these tools we tend to take for granted are so essential. Generally speaking, a filtering solution applied to your email system uses a set of protocols to determine which incoming messages are spam and which are not. What the filters checks on can vary, but often they all do basically the same thing: scan header...
(15) Imagine the time it takes your boss to answer your e-mails is uniformly distributed from...
(15) Imagine the time it takes your boss to answer your e-mails is uniformly distributed from 15 – 120 minutes. What is the average time it takes her to respond? What is the standard deviation for her response time? (16) What is the probability your boss will respond to an e-mail within one hour? (17) What is the probability it will take your boss longer than 30 minutes to respond? (18) What is the probability your boss will respond to...
Which of the following business messages would not use the direct strategy? a. An e-mail message...
Which of the following business messages would not use the direct strategy? a. An e-mail message to a staff introuducing a new employee b. A letter a customer denying his or her request for credit c. A letter to a coworker congratulating him or her on a recent promotion d. An oral presentation detailing the specifics of a new company wellness initiative
Pick one or two real cases happened in your real life, try to find and analyze...
Pick one or two real cases happened in your real life, try to find and analyze the economic law issues in them by using the rules we mentioned in Economic Law.
Design a program which uses functions to sort a list and perform a binary search. Your...
Design a program which uses functions to sort a list and perform a binary search. Your program should: Iinitialize an unsorted list (using the list provided) Display the unsorted list Sort the list Display the sorted list. Set up a loop to ask the user for a name, perform a binary search, and then report if the name is in the list. Use a sentinel value to end the loop. Do not use the Python built in sort function to...
a) Which of the following reactions occur when (E,Z,E)- octa-2,4,6-triene is heated? Circle your choice: -...
a) Which of the following reactions occur when (E,Z,E)- octa-2,4,6-triene is heated? Circle your choice: - Diels-Alder -Cycloaddition - Cope rearrangement -Electrocyclic -Sigmatropic b) Sketch the MAJOR product that is formed when (E,Z,E)-octa-2,4,6-triene is heated. Be sure clearly show features such as the correct regio- and stereochemistry.
In your case study, discuss the following aspects of the real company in the world which...
In your case study, discuss the following aspects of the real company in the world which must have offering bonds. you can chose any company which you like. 1. Provide a brief introduction of the company, including its name, headquarters, products/services offered, and approximate net worth. 2. Explain how the company is doing with respect to the ratios. Consider debt-to-equity, return on equity, current and quick ratio, working capital ratio, price earnings ratio, and the earnings per share. (chap 2)...
Describe a real-world prediction problem using urban data for which interpretability of your models and results...
Describe a real-world prediction problem using urban data for which interpretability of your models and results is essential, and for which it might be preferable to use decision trees rather than random forests. Argue why this is the case.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT