Compare the cluster centroid to characterize clusters

Assignment Help Applied Statistics
Reference no: EM132296289

Part 1: Frequent Flyers and Marketing

Overview - The dataset below EastWestAirlinesCluster.csv contains information on 3999 passengers who belong to an airline's frequent flier program.

For each passenger, the data include information on their mileage history and on different ways they accrued or spent miles in the last year. The goal is to try to identify clusters of passengers that have similar characteristics for the purpose of targeting different segments for different types of mileage offers.

In R Your Job is To:

  • Apply hierarchical clustering with Euclidean distance and Ward's method. Make sure to normalize the data first. How many clusters appear?
  • Tell me: What would happen if the data were not normalized?
  • Compare the cluster centroid to characterize the different clusters, and try to give each cluster a label.
  • Check the stability of the clusters, by removing a random 5% of the data (by taking a random sample of 95% of the records), and repeat the analysis. Does the same picture emerge?
  • Use k-means clustering with the number of clusters that you found above. Does the same picture emerge?
  • Tell me: Which clusters would you target for offers, and what types of offers would you target to customers in that cluster?

Part 2: Classifying Internet Discussion Posts

Overview - In this problem, you will use the data from the chapter assigned for this week, particularly problem 20.6 Online Discussions on Autos and Electronics, in which the task is to develop a model to classify documents as either auto-related or electronics-related.

In R Your Job is To:

  • Load the above file into R and create a label vector.
  • Preprocess the documents. Explain what would be different if you did not perform the "stemming" step.
  • Use the lsa package from R to create 10 concepts. Explain what is different about the concept matrix, as opposed to the TF-IDF matrix.
  • Using this matrix, fit a predictive model (different from the model presented in the chapter illustration) to classify documents as autos or electronics. Compare its performance to that of the model presented in the chapter illustration.

Attachment:- Assignment Files.rar

Reference no: EM132296289

Questions Cloud

What is the relationship between planning and management : What is the relationship between planning and management? What is the relationship between planning and policy?
The organisation quality and delivery standards : When monitoring the team’s performance to consistently meet the organisation’s quality and delivery standards,
Explain anchoring and adjustment : Relate your analysis to the roles of System 1 and System 2 reflecting a clear understanding of these concepts.
Benefits and risks associated with financial leverage : What is financial leverage? What are the benefits and risks associated with financial leverage?
Compare the cluster centroid to characterize clusters : Frequent Flyers and Marketing - Compare the cluster centroid to characterize the different clusters, and try to give each cluster a label
Manager used good strategy in your organization : What happened when a manager used good strategy in your organization.
Explain the importance of planning : Explain the importance of planning, organization, staffing, directing, and controlling for effective business management
How do you summarize thoughts on value propositions : How do you summarize thoughts on value propositions and their relevance to your startup business?
Which quantitative-qualitative manpower forecasting method : Which quantitative or qualitative manpower forecasting method do you believe Honeywell used to decide to move forward with furloughs rather than layoffs?

Reviews

Write a Review

Applied Statistics Questions & Answers

  Hypothesis testing

What assumptions about the number of pedestrians passing the location in an hour are necessary for your hypothesis test to be valid?

  Calculate the maximum reduction in the standard deviation

Calculate the maximum reduction in the standard deviation

  Calculate the expected value, variance, and standard deviati

Calculate the expected value, variance, and standard deviation of the total income

  Determine the impact of social media use on student learning

Research paper examines determine the impact of social media use on student learning.

  Unemployment survey

Find a statistics study on Unemployment and explain the five-step process of the study.

  Statistical studies

Locate the original poll, summarize the poling procedure (background on how information was gathered), the sample surveyed.

  Evaluate the expected value of the total number of sales

Evaluate the expected value of the total number of sales

  Statistic project

Identify sample, population, sampling frame (if applicable), and response rate (if applicable). Describe sampling technique (if applicable) or experimental design

  Simple data analysis and comparison

Write a report on simple data analysis and comparison.

  Analyze the processed data in statistical survey

Analyze the processed data in Statistical survey.

  What is the probability

Find the probability of given case.

  Frequency distribution

Accepting Manipulation or Manipulating

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd