Advertising campaign to target specific clusters

Assignment Help Other Subject
Reference no: EM132804715

Finance Data Analysis Assignment

Assignment 3 takes the same form as the other two assessment items, where you are required to perform some data analysis and write up your results. Again, there will be marks allocated to both the technical side) and the quality of the interpretations. Please write your responses below the questions in a word document.

Question 1 - Infant Mortality

The file infant.xls contains World Bank data on country-level infant mortality rates across the developed and developing world. The data set contains 394 observations over a 25-year period (from 1990-2014) and includes a number of potential covariates that might plausibly affect health outcomes. The variables are defined as follows:

Infant Mortality - The number of deaths per 1000 live births.
GDP - The real GDP per capita (in US dollars) for each country.
Year - The year the observation was recorded.
Contraceptive Prevalence - The proportion of individuals with access to contraceptives.
Physicians - The number of doctors per 1000 people.
Sanitation - The proportion of individuals with access to modern sanitation facilities.
Education - A measure of educational attainment in terms of years of post-primary attendance.
Your task is to produce an econometric model that will yield useful information for understanding the factors that drive infant mortality. The idea is ultimately to inform policies that could alleviate problems associated with infant mortality in developing countries.You should analyse the data and produce a small report providing expert advice based upon your results.
When interpreting your model you should also consider at least some of the following:
(1) The quality of the fit
(2) The sign and significance of the coefficients
(3) Whether the specification of your model is appropriate for the data
(4) What the model ultimately implies about infant mortality rates, and what can be done about them.

Question 2 - Psychological Traits and Violent Crime

Regression models can be used in an incredibly wide array of contexts, including areas such as psychology and criminology. In this (fictitious) example you are to take some data on the incidence of violent crime, and use it to produce a model that can identify suspects in a murder case. The file crime.xls has data on a set of individual-level personality traits obtained from the Big Five and Dark Triad constructs. You also have data on whether the individual has been convicted of a violent crime. The variables are given in the file crime.xls and appear as follows:

Extroversion {1-10} - 10 indicates more extroverted.
Openness to Experience {1-10} - 10 indicates more openness.
Sensitivity {1-10} - 10 indicates more sensitivity to negative emotion.
Agreeableness {1-10} - 10 indicates more agreeable.
Conscientiousness{1-10} - 10 indicates more conscientious.
Narcissism {1-7} - 7 indicates higher levels.
Psychopathy{1-7} - 7 indicates higher levels.
Machiavellianism {1-7} - 7 indicates higher levels.
Age - measured in years.
Female {0-1} where 1 indicates female.
Offence {0-1} where 1 indicates has been convicted of a violent offence.
Your task is to analyse the data set in order to extract some practical insights into the psychology of violent crime.
Calculate the average values of the eight psychometric variables for individuals who have, and have not, been convicted of violent offences. Report your results. Which variables have the biggest differences between the two groups? Discuss.
Estimate a linear probability model explaining violent crime convictions as a function of age, gender, and the suite of personality characteristics given above {hint: this is a straightforward regression model in excel using convictions as the dependent variable}. You should interpret the psychometric variables as continuous while gender is a dummy variable.
Interpret your model with the aim of providing useful information for non-statisticians. Which variables are the most important in this multiple regression model?
In the first dot-point, you identified key variables by looking for large discrepancies in average values between our violent and non-violent groups. In the second, you identified key variables using a regression model. Briefly discuss the difference between these two approaches. Which one is more likely to identify variables that cause violent crime? Why?
Suppose you are part of an investigation into a murder case. The police have identified four suspects and a personality evaluation is given to each. Using your model, obtain a prediction for y (i.e.y ^=β ^_0+β ^_1 x_1+?β ^_k x_k) for each suspect using the data below. Which one(s) would you recommend the police focus their attention on? Why?

 

John Smith

Jane Doe

Adam Jones

Brian Greene

Extraversion

7

4

5

2

Openness

3

3

8

4

Sensitivity

3

7

2

6

Agreeableness

7

7

3

3

Conscientiousness

8

8

7

8

Narcissism

3

5

7

2

Psychopathy

2

3

6

3

Machiavellianism

2

2

7

6

Age

36

41

34

32

Female

0

1

0

0

Question 3 - Non-linear Functional Forms

Most companies face a dilemma when setting prices for their products. If they set the price of a good too low, then a high volume of sales will not be enough to make up for a small margin on each item. On the other hand, if they set the price too high, the large margin per sale will not make up for a reduced quantity. Maximising profit therefore requires a balancing act that trades off prices and quantities.
The file textiles.xls has data on textile profits, prices per unit of output, and production costs {a quality index, labour costs, price of materials}. You are to analyse the data with a statistical model and produce a brief statement advising a textile company on pricing policy.

Since firms are trying to maximise profits, estimate a model for profit of the form below using Excel, and report your results.

Profit=β_0+β_1 Price+β_2 ?Price?^2+β_3 Qual+β_4 Labour+β_5 Materials+e

Briefly interpret your model. Do your control variables (quality, labour, materials) have the expected signs? Why or why not?
What percentage of the variation in profit is accounted for by the model? Explain.
The parameter β_2 (and the quadratic transform ?Price?^2) produce the non-linearity in the model. What sign (positive or negative) do we expect for β_2? Why?
The optimal price in this model is given by ?-β?_1/2β_2. What price would you recommend a textile manufacturer to charge per unit of output?
Statistical models can often be improved by including more variables in the estimation. Suggest two variables that you could include in your model that would improve its predictive capacity.

Question 4. Clustering

Political scientists are aware that individuals respond in differing ways to various policy platforms and campaign messages.For example, some political messages will resonate strongly in certain segments of the community, and yet be very unpopular in other segments. For this reason,campaigns often like to split voters into groups and target each set of voters in different ways.
Suppose you work for an election campaign that wishes to target voters in this way. The file voting.xls has data on the ages, income levels, and population densities of the home addresses of 20 voters (note that the z-transformed versions are also available). You are to use a k-means clustering algorithm to show your campaign colleagues how such a breakdown may be performed.
• Clustering works by taking some randomly chosen "seed" points and allocating each observation to the nearest available seed. The process then "iterates" until the allocations are stable. Explain what is meant by the term "iterates" in this context.
• Take the z-transformed variables z_age, z_income and z_popdensity and perform a k-means clustering procedure to sort your data into three groups (hint: use the excel file k-means for this). Approximately low many iterations did you need to use before the process was complete?
• Present three scatter plots showing your allocation of observations into clusters. Which cluster has the most observations? Which has the highest income? Which one is situated in the most densely packed urban area? Answer these questions by reporting and interpreting the centroid means for each cluster.
• Briefly explain how you could use your results to tailor your advertising campaign to target specific clusters within the broader population.

Attachment:- Finance Data Analysis.rar

Verified Expert

In this excel project, the analysis is done by using data analysis toolpak. The correlation and the regression analysis were performed to analyze infant mortality. To analyze the people who are involved in convicted of violent offenses or not been convicted of violent offenses by using the linear regression and averages of variables. Maximize the profit by using regression analysis. The clustering algorithm is used to analyze the voters.

Reference no: EM132804715

Questions Cloud

What are the structural factors that cause conflict : What are the structural factors that cause conflict in an organization? What are ways of preventing and/or eliminating intergroup conflict within.
What is the present value of an annuity : What is the present value of an annuity that pays $90 every six months for six ?years?
How you can adapt the technology tools for students : Identify two apps or software you would recommend for use in the classroom. For each technology tool selected, provide an aligning ELA activity for the general.
Question about vwap trading strategy : Volume of security has been observed for two weeks. You implement a VWAP strategy. Half the day is over, and you are only 20% filled on your order.
Advertising campaign to target specific clusters : Briefly interpret your model. Do your control variables and What percentage of the variation in profit is accounted for by the model? Explain
What is the present value of the contract : A specialty juice company has contracted with a local grower to deliver 50,000 pounds of fruit had a fixed price of $2.50 per pound per year for three years. I
What is the return on assets : Sales were $1,950,000, the total debt ratio was .60, and total debt was $750.000. What is the return on assets. A fire has destroyed a large percentage
How do the examples model the ethical use of data : Data is used daily in the classroom environment. Provide two examples of how teachers use data to support instruction. How do these examples model the ethical.
What is the present value of the technology : What is the present value of the technology if the discount rate is 7.75%?

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd