Explains the approach that you used to mine the dataset

Assignment Help Database Management System
Reference no: EM131027492

Assignment:

Applied Frequent Itemset or Association Rule Mining: Choose a dataset that is well suited for frequent itemset or association rule mining. You can use any dataset that you would like to mine. A good number of datasets can be found in the UCI machine learning data repository (https://archive.ics.uci.edu/ml/datasets.html) but feel free to use any dataset that you want. You will want to stick with datasets that are categorical in nature. Categorical datasets can be found in the UCI Machine Learning Repository by selecting Categorical link in the Attribute Type box on the left site navigation menu. Numerical datasets will have to be discretized so that itemsets can be created.

Once you have selected a dataset, you can then use a tool such as the arules package in R, or RapidMiner to mine frequent itemsets or association rules in the dataset.

The deliverable for this project will be:

1- The data the used

2- The code in R or rapid miner

3- The report that details your experiment. The report should be in either ACM or IEEE conference paper format and should include an introductory section that details the dataset and the objectives of the analysis, a methodology section that explains the approach that you used to mine the dataset including the algorithms and parameters (e.g. confidence and support) as well as any steps that you had to take to preprocess the data, a results section that shows the results of your analysis and any interesting patterns that you found, and a conclusion section that summarizes your results and discusses the limitations of your approach and any difficulties that you had with your experiment.

Links to format templates:
https://www.ieee.org/conferences_events/conferences/publishing/templates.html
https://www.acm.org/sigs/publications/proceedings-templates

Reference no: EM131027492

Questions Cloud

Three critical components for determining data quality : What are the three critical components for determining data quality? How does achieving data quality differ in person-administered surveys and self-administered surveys?
What are some of the challenges experienced by individuals : Many experts assert that globalization has essentially made us less independent and more closely connected to other people than ever before. This enhanced connectivity has important implications for individuals, small businesses, corporations, and go..
Question regarding the ethics case : Al-Sadd Food Company is a small traded company in Doha. The Company pays annual bonuses based on a percentage of net income. Waleed, the controller of Al-Sadd Food Company, has noticed that the Company holds equity securities in a variety of compa..
Speech and censorship in light of modern digital landscape : Baase discusses freedom of speech and censorship in light of the modern digital landscape, especially given the dubious ways in which technology can sometimes be utilized. Superior expertise about how technologies work does not guarantee superior jud..
Explains the approach that you used to mine the dataset : Choose a dataset that is well suited for frequent itemset or association rule mining. You can use any dataset that you would like to mine.
Segmentation is the process of breaking a population down : Segmentation is the process of breaking a population down into smaller groups and marketing to it. Is it possible for a small business to over segment its market? How might that be dangerous?
Meaningful-likable-transferable-adaptable and protectable : Consider a brand of choice and compare its elements to the six criteria for choosing brand elements. The six criteria are memorable, meaningful, likable, transferable, adaptable, and protectable.
Womans behavior an example of an alter-directed adaptor : A couple begins to argue, but the woman silently folds her arms while the man continues to talk loudly. Why is the woman's behavior an example of an alter-directed adaptor?
Charge of recruiting and staffing the software engineers : Suppose you were in charge of recruiting and staffing the software engineers who work for google. Do you think that a company like google should hire software engineers with the skills it needs or train them to develop those skills? why?

Reviews

Write a Review

Database Management System Questions & Answers

  Knowledge and data warehousing

Design a dimensional model for analysing Purchases for Adventure Works Cycles and implement it as cubes using SQL Server Analysis Services. The AdventureWorks OLTP sample database is the data source for you BI analysis.

  Design a database schema

Design a Database schema

  Entity-relationship diagram

Create an entity-relationship diagram and design accompanying table layout using sound relational modeling practices and concepts.

  Implement a database of courses and students for a school

Implement a database of courses and students for a school.

  Prepare the e-r diagram for the movie database

Energy in the home, personal energy use and home energy efficiency and Efficient use of ‘waste' heat and renewable heat sources

  Design relation schemas for the entire database

Design relation schemas for the entire database.

  Prepare the relational schema for database

Prepare the relational schema for database

  Data modeling and normalization

Data Modeling and Normalization

  Use cases perform a requirements analysis for the case study

Use Cases Perform a requirements analysis for the Case Study

  Knowledge and data warehousing

Knowledge and Data Warehousing

  Stack and queue data structure

Identify and explain the differences between a stack and a queue data structure

  Practice on topic of normalization

Practice on topic of Normalization

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd