Data mining using unsupervised and supervised learning

Assignment Help Database Management System
Reference no: EM13779816

Objectives: Data Mining using Unsupervised and Supervised Learning Approaches

Assume that a local company has collected a data set from their ecommerce website and ask you to analyze it. However, the company didn't provide much of background information about the data itself, e.g., the nature of attributes for the data set. However, based on the discussion with the people who collected the data and your observation on the data set, you felt that the first or second column, X1 or X2 may be decision column.

The basic strategy you will use is first to determine the decision column (or class attribute) using K-means clustering algorithm (unsupervised learning approach) to verify if the result of clustering is consistent with either attribute X1, X2, or both X1 and X2. Once the decision column(s) is determined, you build a model (or concepts) using supervised learning approach hoping that you will be able to offer an advice to the company for their business. To successfully complete the data analysis using this strategy, perform the following tasks:

(a) Use K-means algorithm (unsupervised learning) to cluster the data set and to verify the class field(s).

(b) Using the class field(s) determined in step (a), perform a supervised learning using any of those learning algorithms discussed in class such as Version Space, Decision Tree, and Neural Network, and build a model.

To perform above tasks, you are allowed to use either an existing system or program you implemented. However, in order to receive the maximum bonus points your program should work properly and must be powerful enough for effective data analysis. Otherwise, only a partial bonus point may be given. Therefore, it is more important to complete the above tasks (a) and (b) than implementing your own program.

Write a brief report that summarizes your data analysis activities and results including (1) your name(s) and contact email addresses; the percentage contribution to this assignment if the assignment was completed by a team. If a team cannot reach a consensus on the individual contribution, include the individual's claimed percent contribution with a brief description on specific tasks performed, (2) the language used for K-means algorithm implementation or the source of the software used, parameter settings such as K specifying how you determined the best K, clustering results, verified class field(s), and other relevant information to the task, (3) the name of the supervised learning algorithm used, the source of the implementation or software, parameter settings if any, the result of learning including the learned model and other relevant information, (4) the results of your data analysis, useful advice to the company's business, etc., and (5) other relevant discussion about your experience and data analysis results.

Reference no: EM13779816

Questions Cloud

Awareness of oppression and arousing sympathy of supporters : By creating awareness of oppression and arousing sympathy of supporters, the arts can be a form of protest. Identify and describe an example of how either black slaves or white abolitionists used the arts as a form of protest against slavery. Be s..
Intellectual disability, autism, and multiple disabilities : Identify areas of curriculum necessary for students with mild to moderate disabilities and explain why they are needed.
Research design and data collection : Identify the variables in this study. What are some extraneous variables that might impact your research? How would you control for extraneous variables?
Merits of the liquidators arguments : The merits of the liquidator's arguments, in British company law, that Mr Lay cannot recover his loan from the company and that he should instead be made to contribute to the company's debt on the ground that there is no difference between him and..
Data mining using unsupervised and supervised learning : Data Mining using Unsupervised and Supervised Learning Approaches, Use K-means algorithm (unsupervised learning) to cluster the data set and to verify the class field(s).
Write a paper about competence based education : Write a paper about Competence Based Education.
Internal and external stakeholders : Identify the company's goals and identify the following, specifically:
Find the optimal solution using the simplex method : Find the optimal solution using the simplex method based on the equation z= 2A+3B subject to the following constraints 2.1A+1B less than and equal to 6
Evidence-based psychological interventions : According to the text, the imbalance in the diversity of clinical psychologists

Reviews

Write a Review

Database Management System Questions & Answers

  Analyze the data in the database and in application exercise

To complete this assignment, you will need to do data calculations. Remember to follow good database practice here by not saving your calculations as part of the data table itself (they should appear only in your queries).

  Question 1 for each of the following tasks youll use the

question 1 for each of the following tasks youll use the xxxxxxxx database. you need only provide the query or command

  Explain user activity monitoring

In this lab, you will save user activity data in a database. A record of each user's IP address and the current date and time will be created whenever a user visits the Personnel form.

  You are the trainer for a major technology firm one of the

you are the trainer for a major technology firm. one of the problems your firm has is hiring new technologists who have

  Identify the data analytics tasks

Provide a clear statement of the aims and objectives of the data analytics study and the possible outcomes in terms of discovered knowledge and its potential application towards solution of the problem. In this section you need to discuss the busi..

  What is the functionality of the tool

What is the functionality of the tool and what is the actual running environment (software and hardware) of the tool - how will you evaluate the tool based on your own experience?

  Case study requirement and analysis disciplines through

case study requirement and analysis disciplines through analysis of a simple case study and to express the results

  Create three use case diagrams for the new billing

Using the Hillside School Case Study and your stage 1, 2, and 3 projects, develop a decision paper that serves as a system "sign-over" document for system deployment and transfer of responsibility for the newly designed and implemented system to t..

  The current database section found through the office button

In the Current Database section found through the Office Button, add Add Messages-Operator's Version as the application title. Make the start-up form Unread Messages open automatically when the application starts.

  Prove-leaves of binary search tree are located in bottom

Examples for small n are given bellow, where a small square box represents an unsuccessful search. Prove that leaves of any binary search tree are located in the bottom two levels.

  Task 1 create 3 rows of data for each table ensuring that

task 1 create 3 rows of data for each table ensuring that the referential integrity is valid.task 2 add the 30 rows of

  You have been approached by the owner of custom auto body

you have been approached by the owner of custom auto body to help set up an application that will automate the customer

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd