Compute the support for item sets

Assignment Help Theory of Computation
Reference no: EM132242002

Question: 1. Consider the data set shown in Table 5.1.

Table 5.1. Example of market basket transactions.

Customer ID

Transaction ID

Items Bought

1

0001

{a, d, e}

2

0024

{a, b, c, e}

2

0012

{a, b, d, e}

2

0031

{a, c, d, e}

3

0015

{b, c, e}

3

0022

{b, d, e}

3

0029

{c, d}

4

0040

{a, b, c}

5

0033

{a, d, e}

(a) Compute the support for item sets {e}, {b, d}, and {b, d, e} by treating each transaction ID as a market basket.

S({c}) =

S({e, d}) =

S({a, b, d}) =

S({c, d}) =

(b) Use the results in part (a) to compute the confidence for the association rules {b, d} → {e} and {e} → {b, d}. Is confidence a symmetric measure?

c(bd → e) =

c(e → bd) =

2. Consider the market basket transactions shown in Table 5.2.

Table 5.2. Market basket transactions

Transaction ID

Items Bought

1

{Milk, Beer, Diapers}

2

{Bread, Butter, Milk}

3

{Milk, Diapers, Cookies}

4

{Bread, Butter, Cookies}

5

{Beer, Cookies, Diapers}

6

{Milk, Diapers, Bread, Butter, Cheese}

7

{Bread, Butter, Diapers}

8

{Beer, Diapers}

9

{Milk, Diapers, Bread, Butter}

10

{Beer, Cookies}

(a) What is the maximum number of association rules that can be extractedfrom this data (including rules that have zero support)?

(b) What is the maximum size of frequent item sets that can be extracted(assuming minusup>0)?

(c) Write an expression for the maximum number of size-3 itemsets thatcan be derived from this data set.

(d) Find an itemset (of size 2 or larger) that has the largest support.

(e) Find a pair of items, a and b, such that the rules {a} → {b} and{b} → {a} have the same confidence.

3. The Apriori algorithm uses a generate-and-count strategy for deriving frequentitemsets. Candidate itemsets of size k+1 are created by joining a pairof frequent itemsets of size k (this is known as the candidate generation step).A candidate is discarded if any one of its subsets is found to be infrequentduring the candidate pruning step. Suppose the Apriori algorithm is appliedto the data set shown in Table

5.3 with minsup = 30%, i.e., any itemsetoccurring in less than 3 transactions is considered to be infrequent.

Table 5.3. Example of market basket transactions.

Transaction ID

Items Bought

1

{a, b, c}

2

{b, c, d}

3

{a, b, d}

4

{a, c, d}

5

{b, c}

6

{c, d}

7

{c}

8

{a, b, c}

9

{a, d}

10

{b, d}

(a) Draw an itemset lattice representing the data set given in Table 5.3.

Label each node in the lattice with the following letter(s):

• N: If the itemset is not considered to be a candidate itemset by the Apriori algorithm. There are two reasons for an itemset not to be considered as a candidate itemset: (1) it is not generated at all during the candidate generation step, or (2) it is generated during the candidate generation step but is subsequently removed during the candidate pruning step because one of its subsets is found to be infrequent.

• F: If the candidate itemset is found to be frequent by the Apriori algorithm.

• I: If the candidate itemset is found to be infrequent after support counting.

(b) What is the percentage of frequent itemsets (with respect to all itemsetsin the lattice)?

(c) What is the pruning ratio of the Apriori algorithm on this data set?(Pruning ratio is defined as the percentage of itemsets not consideredto be a candidate because (1) they are not generated during candidategeneration or (2) they are pruned during the candidate pruning step.)

(d) What is the false alarm rate (i.e, percentage of candidate itemsets thatare found to be infrequent after performing support counting)?

Reference no: EM132242002

Questions Cloud

Identify all the stakeholders of the computer system : KII5002 Client Support Assignment, Kingsford International Institute, Australia. Identify all the stakeholders of the computer system
Explain how quantum cryptography works : In this essay, you will explain how quantum cryptography works and what role you think it will play in the future of cryptography. You will also provide.
Compute the return on equity percentage : Indicate how each of Apix's ratios differ, and indicate whether the two other companies' ratios or Apix's ratios are indicative of better performance.
What do you feel is more important job satisfaction or pay : Do you think that employees have an expectation of privacy in the workplace? Why or why not?
Compute the support for item sets : Consider the data set shown in Table 5.1. Compute the support for item sets {e}, {b, d}, and {b, d, e} by treating each transaction ID as a market basket.
Ow the company aligns its benefits with its corporate values : The internal and external strengths and weaknesses identified and how the company responded to these factors from a total rewards perspective;
How did globalization affect ciscos structure : Discuss the organizational structure at Cisco Systems. What design changes were needed? How did globalization affect Cisco's structure?
Determine the total value of your investment : Provide your final opinion / assessment of your investments. Did you make money or lose money? Discuss your results and, based on hindsight.
Write a brief report to be delivered to your sponsor : Develop a response that includes examples and evidence to support your ideas, and which clearly communicates the required message to your audience.

Reviews

Write a Review

Theory of Computation Questions & Answers

  Draw a turing machine that decides the language of all words

Draw two different parse trees for the word cacbc and the grammar - Give the set of nullable nonterminals for the grammar

  Write algorithm for finding useless-productive nonterminals

Write down the algorithm for finding useless/productive nonterminals. Describe how this gives you the algorithm for whether language generated by grammar is empty.

  Develop state table of deterministic finite-state automaton

Given the state table of a deterministic finite-state automaton and a string, decide whether this string is recognized by the automaton.

  Single-row and group functions

Lab #6 will introduce the various aspects of the Single-Row and Group Functions available in the Oracle Database. Most functions can be used in either the SELECT statement or the WHERE clause, but more commonly are used in the SELECT.

  Show polynomial-time algorithm for gdp

Goal is to find expedition of maximum profit. Either show that there exists polynomial-time algorithm for GDP, or show that corresponding decision problem is NP-complete.

  Find a nondeterministic finite-state automaton

Find a nondeterministic finite-state automaton that recognizes each of the languages in Exercise, and has fewer states, if possible, than the deterministic.

  Construct a deterministic finite-state automaton

Construct a deterministic finite-state automaton that is equivalent to the nondeterministic automaton with the state diagram shown here.

  Iterated operations and bounded quantifiers

CSC 720 Theory of Computation - Primitive Recursive Predicates and Iterated Operations and Bounded Quantifiers

  Show that there is no finite-state automaton with two states

Show that there is no finite-state automaton with two states that recognizes the set of all bit strings that have one or more 1 bits and end with a 0.

  Find language of nondeterministic finite-state automaton

Find a deterministic finite-state automaton that recognizes the same language as the nondeterministic finitestate automaton in Exercise.

  Design a binary finite state automaton to accept all strings

Design a binary finite state automaton (FSA) to accept all strings that represent valid messages (for your particular codes and parity property) and reject all

  Construct a dfa for the two simpler languages

Construct a DFA for the two simpler languages, then combine them using the construction discussed in footnote 3 to give the state diagram of a DFA for the language given.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd