How many records would you expect would be removed

Assignment Help Basic Statistics
Reference no: EM131114153

1. A dataset has 1000 records and 50 variables with 5% of the values missing, spread randomly throughout the records and variables. An analysis decides to remove records that have missing values. About how many records would you expect would be removed?

2. Given a database table containing weather data as follows:

Outlook

Temperature

Humidity

Windy

Class: Play

Sunny

Hot

High

False

No

Sunny

Hot

High

True

No

Overcast

Hot

High

False

Yes

Rainy

Mild

High

False

Yes

Rainy

Cool

Normal

False

Yes

Rainy

Cool

Normal

True

No

Overcast

Cool

Normal

True

Yes

Sunny

Mild

High

False

No

Sunny

Cool

Normal

False

Yes

Rainy

Mild

Normal

False

Yes

Sunny

Mild

Normal

True

Yes

Overcast

Mild

High

True

Yes

Overcast

Hot

Normal

False

Yes

Rainy

Mild

High

True

No

Where Outlook, Temperature, Humidity, and Windy are the input variables (predictors), and Play is the output variable (response).

a. Compute the prior probability

P(PLAY='Yes') =
P(PLAY='No') =

b. Compute the conditional probability

P(Outlook='Sunny'|PLAY='Yes') =
P(Outlook='Sunny'|PLAY='No') =

P(Temperature = ‘Mild'|PLAY='Yes') =
P(Temperature = ‘Mild'|PLAY='No') =

P(Humidity = ‘High'| PLAY='Yes') =
P(Humidity = ‘High'| PLAY='No') =

P(Windy = ‘False'| PLAY='Yes') =
P(Windy = ‘False'| PLAY='No')=

3. Using naïve Bayes classification method to classify the following unknown record and to indicate whether to play or not.

(Outlook = ‘Sunny', Temperature = ‘Mild' , Humidity = ‘High' , Windy = ‘False')

4. Association Rule Mining:

Given a transaction database for mining association rule as follows:

Database D

TID

Items

100

A C D

200

B C E

300

A B C E

400

B E

Please useApriorialgorithm to mine association rules with minimum support count = 2.

(Please show the derivation process step by step with candidate itemsets.)

Reference no: EM131114153

Questions Cloud

How these elements contribute to the central ideas of play : Review the stage directions and, in your discussion post, identify the most important aspects of the setting.Then, consider how these elements contribute to the central ideas of the play
How many gates would such a system require : Develop a two-dimensional addressing system using a 6-to-64 decoder, a 64-word×128- bit matrix, and 16-input multiplexers. How many gates would such a system require?
How would the results be used to make a diagnosis : Explain what physical exams and diagnostic tests would be appropriate and how the results would be used to make a diagnosis. List five different possible conditions for the patient's differential diagnosis, and justify why you selected each.
Determine the value of the company shares : The average growth of dividends for the past five years is expected to persist in the foreseeable future. You are required to determine the value of the company's shares after payment of the dividend of 2004.
How many records would you expect would be removed : A dataset has 1000 records and 50 variables with 5% of the values missing, spread randomly throughout the records and variables. About how many records would you expect would be removed?
Explain the implied volatility : Find the price of a six month european call option on a non-dividend paying stock with a strike price of 20 when the current stock price is 18, the risk free rate is 6% per annum and the volatility is 30 per annum. Use the Black scholes merton mod..
Describe the two families in the film : Describe the two families in the film (ie the names of the family, people in household, jobs held, current financial situation,etc) - Did race impact the families lives? Explain
Minimum average collection period : The minimum average collection period required to approve the cash discount plan is _________days?
Show a block diagram of an srff connected to store 1 bit : Using 4 SRFFs obtain the block diagram for an SISO shift register.

Reviews

Write a Review

Basic Statistics Questions & Answers

  The east coast researcher decides to construct a two-sided

question a researcher in the west coast of the u.s. wants to estimate the amount of a newly discovered antibody in

  Of 900 customers surveyed 414 said they were very

1. of 900 customers surveyed 414 said they were very enthusiastic about a new home deacutecor scheme. construct a 99

  Problem regarding the withstand a stress

(i) What percent of these bolts will withstand a stress of 90 ksi without breaking? (ii) What range covers the middle 50% of breaking strengths for these bolts?

  Determining mean-median and mode for data set

Determine the mean, median, and mode for data set: 12.75, 18.32, 19.41, 12.75, 18.30, 19.45, 19.33

  An economic theory is that the money flowing in and out of

an economic theory is that the money flowing in and out of mutual funds fund flows is related to the performance of

  How large sample is needed to get desired information

He wants to be 95% confident that his estimate is correct. If the standard deviation is $1050, how large a sample is needed to get the desired information and to be accurate within $200?

  Hypothesis test for single factor-anova

Complete the table and answer the following questions. Use the .05 significance level.

  A plastic casing for a magnetic disk is composed of two

a plastic casing for a magnetic disk is composed of two halves. the thickness of each half is normally distributed with

  What is the response variable n perform a one-sample anova

What is the response variable? Is this an experimental study or an observational study? Explain your answer in at most 25-30 words.

  Scatterplot showing the amount of space

For each format, create a scatterplot showing the amount of space needed (in MB) for storage based on the length (in seconds) of the song. Comment on what you see in the scatterplots.

  If water is flowing through the pipe with a velocity of 3

a pump is designed to move water and deliver to a vertical distance of 80m what energy per kg of water must be supplied

  Annual return be reduced if the u.s. oil maximum

Refer to Figure 8.15, which shows the sensitivity report for Problem 7.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd