Generate numerous conditional pattern bases

Assignment Help Basic Computer Science
Reference no: EM131221913

(Implementation project) Many techniques have been proposed to further improve the performance of frequent itemset mining algorithms. Taking FP-tree- based frequent pattern growth algorithms (e.g., FP-growth) as an example, implement one of the following optimization techniques. Compare the performance of your new implementation with the unoptimized version.

(a) The frequent pattern mining method of Section 6.2.4 uses an FP-tree to generate conditional pattern bases using a bottom-up projection technique (i.e., project onto the prefix path of an item p). However, one can develop a top down projection technique, that is, project onto the suffix path of an item p in the generation of a conditional pattern base. Design and implement such a top down FP-tree mining method. Compare its performance with the bottom-up projection method.

(b) Nodes and pointers are used uniformly in an FP-tree in the FP-growth algorithm design. However, such a structure may consume a lot of space when the data are sparse. One possible alternative design is to explore array- and pointer-based hybrid implementation, where a node may store multiple items when it contains no splitting point to multiple sub-branches. Develop such an implementation and compare it with the original one.

(c) It is time and space consuming to generate numerous conditional pattern bases during pattern-growth mining. An interesting alternative is to push right the branches that have been mined for a particular item p, that is, to push them to the remaining branch (es) of the FP-tree. This is done so that fewer conditional pattern bases have to be generated and additional sharing can be explored when mining the remaining FP-tree branches. Design and implement such a method and conduct a performance study on it.

Reference no: EM131221913

Questions Cloud

Determine the applicable rule number from the code : The following situation involves a possible violation of the AICPA's code of professional conduct. For this situation, determine the applicable rule number from the code, and decide whether or not the code has been violated.
Identify major challenge facing leaders : Leaders in the Purchasing and Material Management field today and make two recommendations on how this challenge might be addressed to ensure organizational success?
How many potted roses plants must flores roses sell : Flores owned by Livia, provides a variety of rare roses bushes to local nurseries that sell livias roses to the end consumer ( landscapers and retail customers ). Livia grows the roses from cutting that she has specifically cultuvated for their unusa..
By how much should it devalue or revalue its currency : If Boversia wants to return the real exchange rate to its 2020 level, by how much should it devalue or revalue its currency in 2025?
Generate numerous conditional pattern bases : One possible alternative design is to explore array- and pointer-based hybrid implementation, where a node may store multiple items when it contains no splitting point to multiple sub-branches. Develop such an implementation and compare it with th..
Different problems from a cultural perspective : What role does culture play on managers in global situations. For example, a US firm in Saudi Arabia, Indonesia, Singapore or Germany would all face different problems from a cultural perspective. What about a foreign venture into North America.
Difference between an appreciation and a revaluation : What is the difference between a depreciation of a currency and a devaluation of a currency? - What is the difference between an appreciation and a revaluation?
Predetermined overhead rate annually : Harris Fabrics computes its predetermined overhead rate annually on the basis of direct labor-hours. At the beginning of the year, it estimated that 34,000 direct labor-hours would be required for the period’s estimated level of production. Compute t..
What you are going to do needs to be defined : This is a pretty big question. Consider all the work that first needs to be completed BEFORE a schedule can be constructed. What you are going to do needs to be defined.

Reviews

Write a Review

Basic Computer Science Questions & Answers

  Identifies the cost of computer

identifies the cost of computer components to configure a computer system (including all peripheral devices where needed) for use in one of the following four situations:

  Input devices

Compare how the gestures data is generated and represented for interpretation in each of the following input devices. In your comparison, consider the data formats (radio waves, electrical signal, sound, etc.), device drivers, operating systems suppo..

  Cores on computer systems

Assignment : Cores on Computer Systems:  Differentiate between multiprocessor systems and many-core systems in terms of power efficiency, cost benefit analysis, instructions processing efficiency, and packaging form factors.

  Prepare an annual budget in an excel spreadsheet

Prepare working solutions in Excel that will manage the annual budget

  Write a research paper in relation to a software design

Research paper in relation to a Software Design related topic

  Describe the forest, domain, ou, and trust configuration

Describe the forest, domain, OU, and trust configuration for Bluesky. Include a chart or diagram of the current configuration. Currently Bluesky has a single domain and default OU structure.

  Construct a truth table for the boolean expression

Construct a truth table for the Boolean expressions ABC + A'B'C' ABC + AB'C' + A'B'C' A(BC' + B'C)

  Evaluate the cost of materials

Evaluate the cost of materials

  The marie simulator

Depending on how comfortable you are with using the MARIE simulator after reading

  What is the main advantage of using master pages

What is the main advantage of using master pages. Explain the purpose and advantage of using styles.

  Describe the three fundamental models of distributed systems

Explain the two approaches to packet delivery by the network layer in Distributed Systems. Describe the three fundamental models of Distributed Systems

  Distinguish between caching and buffering

Distinguish between caching and buffering The failure model defines the ways in which failure may occur in order to provide an understanding of the effects of failure. Give one type of failure with a brief description of the failure

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd