Decision tree

Assignment Help Computer Networking
Reference no: EM131109300

Decision Tree

One major issue for any decision tree algorithm is how to choose an attribute based on which the data set can be categorized and a well-balanced tree can be created. The most traditional approach is called the ID3 algorithm proposed by Quinlan in 1986. The detailed ID3 algorithm is shown in the slides. The textbook provides some discussions on the algorithm in Section 18.3. For this problem please follow the ID3 algorithm and manually calculate the values based on a data set similar to (but not the same as) the one in the slides (p. 147). This exercise should help you get deep insights
on the execution of the ID3 algorithm. Please note that concepts discussed here (for example, entropy, information gain) are very important in information theory and signal processing fields. The new data set is shown as follows. In this example row 10
is removed from the original set and all other rows remain the same.

Following the conventions used in the slides, please show a manual process and calculate the following values: Entropy(S), Entropy(S weather = sunny ) , 

Entropy(S weather = windy ) , Entropy(S weather = rainy ) , Gain (S, weather), Gain (S, parents) and 

Gain (S, money). Based on the last three values, which attribute should be chosen to split on? 

 

Please show detailed process how you obtain the solutions.

Weekend

Weather

Parents

Money

Decision

(Category)

W1

Sunny

Yes

Rich

Cinema

W2

Sunny

No

Rich

Tennis

W3

Windy

Yes

Rich

Cinema

W4

Rainy

Yes

Poor

Cinema

W5

Rainy

No

Rich

Stay in

W6

Rainy

Yes

Poor

Cinema

W7

Windy

No

Poor

Cinema

W8

Windy

No

Rich

Shopping

W9

Windy

Yes

Rich

Cinema

Reference no: EM131109300

Questions Cloud

Besler corporation had a projected benefit obligation : At December 31, 2010, Besler Corporation had a projected benefit obligation of $560,000, plan assets of $322,000, and prior service cost of $127,000 in accumulated other comprehensive income.
Describe the three-level architecture of dbms : Describe the three-level architecture of DBMS?
Calculate the firm''s cash conversion cycle : Calculate the firm's cash conversion cycle, its daily cash operating expenditure, and the amount of resources needed to support its cash conversion cycle.
What is molecular formula : A compound is found to contain 49.5% carbon, 5.19% hydrogen, 16.5% oxygen, and 28.9% nitrogen. Its molecular mass is 194.2 g/mol. What is its empirical formula? What is its molecular formula? Explain what each of these formulas tells us about the ..
Decision tree : One major issue for any decision tree algorithm is how to choose an attribute based on which the data set can be categorized and a well-balanced tree can be created. The most traditional approach is called the ID3 algorithm proposed by Quinlan in 198..
Calculate the firm''s operating cycle and cash conversion : Calculate the firm's operating cycle and cash conversion cycle. Calculate the firm's daily cash operating expenditure. How much in resources must be invested to support its cash conversion cycle?
How long should this information be kept : If this information could be used to help you establish an alibi, would you want the cell phone company to be able to release it to the police?
Question regarding the percent yield : How do the following influence the percent yield? Begin by stating which data item would be in error and explain whether the percent yield would be too large, too small, or not affected at all.
Mancuso corporation amended its pension plan : Mancuso Corporation amended its pension plan on January 1, 2010, and granted $160,000 of prior service costs to its employees.

Reviews

Write a Review

Computer Networking Questions & Answers

  Describe the one thing about prototyping

In this article review, you will describe the one thing about prototyping that surprised you the most. Find an article on prototyping and write a one-page (250-word) paper that includes the following: A description of the article and where it was f..

  Distinguish traffic in current isp backbones-manage quality

Based on reading, is there pressing require to distinguish traffic in current ISP backbones to manage quality of service? Explain why or why not.

  Describe the ip address and subnet

Which of the following subnet masks would offer 30 usable subnets with a minimum of 2040 usable hosts per subnet? We have been assigned the IP address in the exhibit. Choose the best answer.

  Tcpipproblem 1bullexplain what is the biggest problem in

tcpipproblem 1.bullexplain what is the biggest problem in routing security and you may not all agree on which problem

  Write a short paper on the network components

Perform research and write a short paper on the network components that make up a local area network.

  An organisation has been granted a block of addresses

an organisation has been granted a block of addresses starting with the address 172.154.68.022.a create three subnets

  Advantages of the three main topologies- ring bus and star

Write a paper detailing the advantages and disadvantages of the three main topologies: Ring, Bus, and Star

  Explain authentication and authorization

Explain authentication and authorization. Which depends on the other? How and why are these processes more complex in a modern networked organization.

  Computing higher data rate than atm line

Suppose that you have trained your dog to carry box of three data storage tapes. For what range of distances does dog have higher data rate than 155 Mbps ATM line ?

  Benefits of hyper v discussion

Imagine you are a network administrator and you are proposing the implementation of Windows Server 2012 with Hyper-V to replace the existing VMware vSphere infrastructure utilized by the organization. Examine at least two (2) characteristics of Hy..

  Discuss the characteristics of internet, wireless and lan

What solution would you offer? These offices do not need Internet connection, just internal, and in order for them to be able to communicate with each other, the network data packets need to route between the offices.

  Network that covers a large geographic area

A network that covers a large geographic area is most commonly referred to as a(n)

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd