What could you do to increase the filter’s power

Assignment Help Basic Computer Science
Reference no: EM131381616

More spam. Consider again the points-based spam filter described in Exercise 16. When the points assigned to various components of an e-mail exceed the cutoff value you've set, the filter rejects its null hypothesis (that the message is real) and diverts that e-mail to a junk mailbox.

a) In this context, what is meant by the power of the test?

b) What could you do to increase the filter's power?

c) What's the disadvantage of doing that?

Exercise 16

Spam. Spam filters try to sort your e-mails, deciding which are real messages and which are unwanted. One method used is a point system. The filter reads each incoming e-mail and assigns points to the sender, the subject, key words in the message, and so on. The higher the point total, the more likely it is that the message is unwanted. The filter has a cutoff value for the point total; any message rated lower than that cutoff passes through to your inbox, and the rest, suspected to be spam, are diverted to the junk mailbox. We can think of the filter's decision as a hypothesis test. The null hypothesis is that the e-mail is a real message and should go to your inbox. A higher point total provides evidence that the message may be spam; when there's sufficient evidence, the filter rejects the null, classifying the message as junk. This usually works pretty well, but, of course, sometimes the filter makes a mistake.

a) When the filter allows spam to slip through into your inbox, which kind of error is that?

b) Which kind of error is it when a real message gets classified as junk?

c) Some filters allow the user (that's you) to adjust the cutoff. Suppose your filter has a default cutoff of 50 points, but you reset it to 60. Is that analogous to choosing a higher or lower value of for a hypothesis test? Explain.

d) What impact does this change in the cutoff value have on the chance of each type of error?

Reference no: EM131381616

Questions Cloud

What could the bank do to increase the power : Only if the total points awarded for various aspects of an applicant's financial condition fail to add up to a minimum cutoff score set by the bank will the loan be denied.
Draw market demand and supply curves : In this assignment, hospitals are the firms, and two types of labor are being supplied: physicians (PHs) and physician assistants (PAs). Draw market demand and supply curves for physicians and physician assistants indicating a market clearing price..
Perception of the hacker changed over recent years : How has the perception of the hacker changed over recent years? (Lone wolf vs organized hacking groups) What is the profile of a hacker today?
What are the various types of malware : What are the various types of malware? How do worms differ from viruses? Do Trojan horses carry viruses or worms? ( 160 word minimum, give example of atleast one worm and virus and differences)
What could you do to increase the filter’s power : Some filters allow the user (that's you) to adjust the cutoff. Suppose your filter has a default cutoff of 50 points, but you reset it to 60. Is that analogous to choosing a higher or lower value of for a hypothesis test? Explain.
Greater concern than traditional malware : Why does polymorphism cause greater concern than traditional malware? How does it affect or avoid detection. (Min 270 of original words)
Linear relationship between income and percentage growth : Run a simple linear regression of these five pairs of numbers and estimate a linear relationship between income and percentage growth in wealth.
Define the components of risk management : Define the components of risk management. Explain the activities involved in each activity.
Discuss about the given scenario below : George Cosgrove is the Senior Vice President (SVP) for Quality Control at All-in-One Pharmaceutical, Inc. (hereinafter referred to as the Company), a multi-million-dollar medical supply manufacturer and distributor with offices in several states.G..

Reviews

Write a Review

Basic Computer Science Questions & Answers

  Identifies the cost of computer

identifies the cost of computer components to configure a computer system (including all peripheral devices where needed) for use in one of the following four situations:

  Input devices

Compare how the gestures data is generated and represented for interpretation in each of the following input devices. In your comparison, consider the data formats (radio waves, electrical signal, sound, etc.), device drivers, operating systems suppo..

  Cores on computer systems

Assignment : Cores on Computer Systems:  Differentiate between multiprocessor systems and many-core systems in terms of power efficiency, cost benefit analysis, instructions processing efficiency, and packaging form factors.

  Prepare an annual budget in an excel spreadsheet

Prepare working solutions in Excel that will manage the annual budget

  Write a research paper in relation to a software design

Research paper in relation to a Software Design related topic

  Describe the forest, domain, ou, and trust configuration

Describe the forest, domain, OU, and trust configuration for Bluesky. Include a chart or diagram of the current configuration. Currently Bluesky has a single domain and default OU structure.

  Construct a truth table for the boolean expression

Construct a truth table for the Boolean expressions ABC + A'B'C' ABC + AB'C' + A'B'C' A(BC' + B'C)

  Evaluate the cost of materials

Evaluate the cost of materials

  The marie simulator

Depending on how comfortable you are with using the MARIE simulator after reading

  What is the main advantage of using master pages

What is the main advantage of using master pages. Explain the purpose and advantage of using styles.

  Describe the three fundamental models of distributed systems

Explain the two approaches to packet delivery by the network layer in Distributed Systems. Describe the three fundamental models of Distributed Systems

  Distinguish between caching and buffering

Distinguish between caching and buffering The failure model defines the ways in which failure may occur in order to provide an understanding of the effects of failure. Give one type of failure with a brief description of the failure

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd