What could you do to increase the filter’s power

Assignment Help Basic Computer Science
Reference no: EM131381616

More spam. Consider again the points-based spam filter described in Exercise 16. When the points assigned to various components of an e-mail exceed the cutoff value you've set, the filter rejects its null hypothesis (that the message is real) and diverts that e-mail to a junk mailbox.

a) In this context, what is meant by the power of the test?

b) What could you do to increase the filter's power?

c) What's the disadvantage of doing that?

Exercise 16

Spam. Spam filters try to sort your e-mails, deciding which are real messages and which are unwanted. One method used is a point system. The filter reads each incoming e-mail and assigns points to the sender, the subject, key words in the message, and so on. The higher the point total, the more likely it is that the message is unwanted. The filter has a cutoff value for the point total; any message rated lower than that cutoff passes through to your inbox, and the rest, suspected to be spam, are diverted to the junk mailbox. We can think of the filter's decision as a hypothesis test. The null hypothesis is that the e-mail is a real message and should go to your inbox. A higher point total provides evidence that the message may be spam; when there's sufficient evidence, the filter rejects the null, classifying the message as junk. This usually works pretty well, but, of course, sometimes the filter makes a mistake.

a) When the filter allows spam to slip through into your inbox, which kind of error is that?

b) Which kind of error is it when a real message gets classified as junk?

c) Some filters allow the user (that's you) to adjust the cutoff. Suppose your filter has a default cutoff of 50 points, but you reset it to 60. Is that analogous to choosing a higher or lower value of for a hypothesis test? Explain.

d) What impact does this change in the cutoff value have on the chance of each type of error?

Reference no: EM131381616

Questions Cloud

What could the bank do to increase the power : Only if the total points awarded for various aspects of an applicant's financial condition fail to add up to a minimum cutoff score set by the bank will the loan be denied.
Draw market demand and supply curves : In this assignment, hospitals are the firms, and two types of labor are being supplied: physicians (PHs) and physician assistants (PAs). Draw market demand and supply curves for physicians and physician assistants indicating a market clearing price..
Perception of the hacker changed over recent years : How has the perception of the hacker changed over recent years? (Lone wolf vs organized hacking groups) What is the profile of a hacker today?
What are the various types of malware : What are the various types of malware? How do worms differ from viruses? Do Trojan horses carry viruses or worms? ( 160 word minimum, give example of atleast one worm and virus and differences)
What could you do to increase the filter’s power : Some filters allow the user (that's you) to adjust the cutoff. Suppose your filter has a default cutoff of 50 points, but you reset it to 60. Is that analogous to choosing a higher or lower value of for a hypothesis test? Explain.
Greater concern than traditional malware : Why does polymorphism cause greater concern than traditional malware? How does it affect or avoid detection. (Min 270 of original words)
Linear relationship between income and percentage growth : Run a simple linear regression of these five pairs of numbers and estimate a linear relationship between income and percentage growth in wealth.
Define the components of risk management : Define the components of risk management. Explain the activities involved in each activity.
Discuss about the given scenario below : George Cosgrove is the Senior Vice President (SVP) for Quality Control at All-in-One Pharmaceutical, Inc. (hereinafter referred to as the Company), a multi-million-dollar medical supply manufacturer and distributor with offices in several states.G..

Reviews

Write a Review

Basic Computer Science Questions & Answers

  What have you found as the description of big data?

What have you found as the description of Big Data?

  Problem regarding the joint commission

What accrediting agency or regulatory body do you select on Quality? What agency, other than the Joint Commission (TJC), will you explore and share?

  How high should the designer make the hill

The designer of a coaster wants the coaster to have a velocity of 95 feet per second when it reaches the bottom of the hill. If the initial velocity of the coaster at the top of the hill is 15 feet per second, how high should the designer make the..

  Draw the phase diagram of the system for µ > 0 and µ

Draw the phase diagram of the system for μ > 0 and μ

  Write the windows cli commands that will clear the screen

Write the Windows CLI commands that will Clear the screen; Turn off Command echo; and display the current IP address, Subnet Mask, and Default Gateway

  Describe hardware components in a computer by relating them

Describe the hardware components in a computer by relating them with objects, processes, or analogies from the real world. Which of these components do you think would make the biggest difference in performance for a computer system?

  Discuss the implications of capital market imperfections

We return to this issue later in the chapter when we discuss the implications of capital market imperfections.

  Design and implement modified des cipher

Design and implement Modified DES cipher as mentioned below and observe the Avalanche effect. The number of rounds allowed from one to five only. Observe the Avalanche effect by c hanging the bits of the plain text.

  Write an equivalent query in the relational algebra

Write an equivalent query in the relational algebra

  Cpu utilization formula

Using the CPU utilization formula on page 96, solve the following problem. Suppose the CPU has to wait 40% of its time on I/O completion, and there are two processes that need to run: A and B.  If A and B takes 10 minutes each of CPU time, then ho..

  A project plan powerpoint presentation

This assignment consists of four(4) sections: a written project plan, a revised business requirements document, a project plan PowerPoint presentation, and the finalized project plan. You must submit the four(4) sections as separate files for the ..

  Find the area between each curve

Refer to Exercise 13 of Lesson 15-2. Use the marginal cost function to approximate the cost for the company to produce one more widget when the production level is 20 widgets.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd