Define which are real messages and which are unwanted

Assignment Help Basic Statistics
Reference no: EM131745710

Question: Spam. Spam filters try to sort your e-mails, deciding which are real messages and which are unwanted. One method used is a point system. The filter reads each incoming e-mail and assigns points to the sender, the subject, key words in the message, and so on. The higher the point total, the more likely it is that the message is unwanted. The filter has a cutoff value for the point total; any message rated lower than that cutoff passes through to your inbox, and the rest, suspected to be spam, are diverted to the junk mailbox.

We can think of the filter s decision as a hypothesis test. The null hypothesis is that the e-mail is a real message and should go to your inbox. A higher point total provides evidence that the message may be spam; when there's sufficient evidence, the filter rejects the null, classifying the message as junk. This usually works pretty well, but, of course, sometimes the filter makes a mistake.

a) When the filter allows spam to slip through into your inbox, which kind of error is that?

b) Which kind of error is it when a real message gets classified as junk?

c) Some filters allow the user (that's you) to adjust the cutoff. Suppose your filter has a default cutoff of 50 points, but you reset it to 60. Is that analogous to choosing a higher or lower value of for a hypothesis test? Explain.

d) What impact does this change in the cutoff value have on the chance of each type of error?

Reference no: EM131745710

Questions Cloud

Explain three reasons for utilizing professional networking : Explain three reasons for utilizing professional networking during the job-hunting process. If you do not have experience with professional networking.
Regarding the insurer duty to defend the insured : Which of the following statements is not true regarding the insurer's duty to defend the insured?
Which type of error did the bank make : Loans. Before lending someone money, banks must decide whether they believe the applicant will repay the loan. One strategy used is a point system.
Calculate the load-distance score of preliminary layout : Describe the main criteria that you will use to evaluate alternate layout designs. Calculate the load-distance score of the preliminary layout.
Define which are real messages and which are unwanted : Spam. Spam filters try to sort your e-mails, deciding which are real messages and which are unwanted. One method used is a point system.
Manufacturer plans on using debt to finance the project : If the manufacturer plans on using debt to finance the project, should the estimated project cash flows be changed to reflect these interest charges
Three kinds of molded fiberglass recreational boats : The Skimmer Boat Company manufactures three kinds of molded fiberglass recreational boats a bass fishing boat, a ski boat, and a speedboat.
What are the clues that the women find and the men do not : What are the clues that the women find and the men do not? What does foreshadowing mean and what is an example? What do the women find in the box?
Define aspects of an applicant financial condition : Second loan. Exercise describes the loan score method a bank uses to decide which applicants it will lend money. Only if the total points awarded for various.

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd