Analyze the wine quality data set

Assignment Help Other Subject
Reference no: EM132290605 , Length: word count:4000

Assignment Requirement

In this assignment, you work as a professional to write a report to your client. The report is on a decision tree analysis of a data set from your client. The assignment is quite open and the quality depends on what you do, how you do it, how you interpret the results, and whether the results are convincing.

You are asked to analyze the Wine Quality data set which you have used in a practical. The aim of the analysis is to show how wine quality is affected by other factors of the data set. You are asked to write a report of your analysis. Your report should cover the following areas. Length of each part should be within 5 pages.

1. Initial inspection of the dataset and discussion on how the observation may affect the result of the analysis.
In this part, you would include information about column types, value distributions, skewness, etc., and discuss possible impact of the observed properties on the classifiers to be leant.
You would also investigate the correlation of columns to see if some attributes are dependent, and indicate how the dependence affects the learnt classifier, and whether it is necessary to apply dimension reduction/feature selection.

2. Building a decision tree model using the SAS decision tree algorithm.

In this part, you need to try many major parameters in the properties on the left-hand side of the decision tree node. You then choose the top three trees to present their important parameter settings and their performance results including precision, recall, F-score and other performance indicators that you want to include. You then present the best tree, describe it, and interpret it. Interpretation includes major (affecting many tuples) split attributes and major decision groups, etc. Your interpretation must be in the language understandable by people who are not from the technical area.

Before presenting the results, you should also summarize how data is processed for this model, what features are used, and how data is partitioned.

3. Use data in a different way and then build another decision tree.
"Different way" may mean binning columns differently or using different features, or another aspect/consideration.
The building of the model and the presentation of the results are the same as Task 2.

4. Model comparison.

You compare the two models you obtained above. You draw conclusions from them. At the end, you describe what you learnt from this analysis.

Your report must have a cover, TOC, and an executive summary (1/2-3/4 of a page) in addition to the formal contents. The font needs to be Ebrima or Calibri of size 10, single spaced.

Attachment:- Assignment--wine Quality.zip

Reference no: EM132290605

Questions Cloud

Solve the question using the Green Theorem : Use Green's Theorem to compute. Where C is the unit circle, oriented counterclockwise. Note - Please explain how you get the answer
Prepare a report based on content and submit through lms : MGT315 - Project Management - Prepare a project report considering a product of your choice by explaining effective project management techniques
Does the infinite sum coverage : Question - Does the infinite sum coverage?
Discuss the concept of contract consideration. would bart : How will the Court rule ? Why ? Please discuss and explain what analysis the court will employ to decide if MVCC has substantially performed the contract.
Analyze the wine quality data set : Analyze the Wine Quality data set which you have used in a practical. The aim of the analysis is to show how wine quality is affected by other factors
The determination of risks related to the six challenges : The archetypes contribute to the private sector in the determination of risks related to the six challenges associated with Homeland Security.
Discuss your reaction to the public service announcement : Discuss your reaction to the public service announcement (PSA) video below in relation to the marketing and advertising of junk food to children.
Describe how intelligence analysis may be used by law : Discuss analysis and crime investigative methods that may be used identifying gangs and/or gang activities.
Analyze the american correctional system : Analyze the American correctional system and its use of alternative programs when administering justice.

Reviews

len2290605

4/22/2019 4:39:15 AM

Your report must have a cover, TOC, and an executive summary (1/2-3/4 of a page) in addition to the formal contents. The font needs to be Ebrima or Calibri of size 10, single spaced. The assignment is quite open and the quality depends on what you do, how you do it, how you interpret the results, and whether the results are convincing.

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd