Reference no: EM132357470
Assignment - CLUSTER ANALYSIS
Instructions - Perform cluster analysis using several methods for different features.
How did you determine a suitable number of clusters for each method?
Use internal and external validation measures to describe and compare the clustering models and the clusters (some visual methods would be good).
Describe your results. What findings are the most interesting?
Required the codes used step by step Data file attached.
Some information about the data file:
Input variables:
1) S: employee's satisfaction level (on a scale of 0 to 1, 0 - the least satisfied, 1 - the most satisfied)
2) LPE: employee's last project evaluation level (on a scale of 0 to 1, 0 - the lowest, 1 - the highest)
3) NP: the number of projects worked on by the employee in the last 12 months.
4) ANH: the average numbers of hours worked weekly by the employee over the last 12 months
5) TIC: the time spent in the company in years (1 - one year, 2 - 2 years and so on)
6) newborn: whether or not the employee had a newborn within the last 12 months (0 - no newborn, 1 - had a newborn)
Outcome variable:
1) left: if the employee indeed left or not (0 - stay, 1 - left).
Attachment:- Statistics Assignment Files.rar