Estimate a multiple linear regression

Assignment Help Econometrics
Reference no: EM132136265

Basic Econometrics Research Report Group Assignment -

This assignment uses data from the BUPA health insurance call centre. Each observation includes data from one call to the call centre. The variables describe several characteristics of the call (eg the length of the call, the amount of silence in the call), characteristics of the customer (eg state of residence, family type, number of adults and children), and measures of performance (eg net promoter score, sentiment score of the customer). In this assignment we are interested in predicting the net promoter score and the length of the call.

Please use the dataset CallCentre.dta and associated information file CC_DEFINITIONS_.XLSX to answer these questions. Use the software program STATA 15 available through RMIT MyDesktop for all data analysis. This is a group assignment where you can work alone or with up to three other students (a maximum group size of four). All group members will receive the same marks for the assignment. You must submit an electronic copy of your assignment in Canvas in pdf, doc or docx format. Hard copies will not be accepted. Show your tables and calculations as well as answering the questions in full sentences. Please make sure your tables of results are neatly formatted, not just copied and pasted from STATA, and that you write your answers in clear sentences. You should write no more than 1000 words (not including tables/calculations) in total for this assignment. The number of words, tables, graphs, calculations given in parentheses after each question are a guide.

1. Calculate descriptive statistics using the 'summarize' command for the variables net_promoter_score, total_silence, total_silence_weighted, agent_to_cust_index and agent_crosstalk_weighted and present the results in a table. Comment on what we learn about these variables from the descriptives. Graph a scatter plot of net_promoter_score against agent_crosstalk_weighted and describe the relationship between these two variables. (100 words, 1 table, 1 graph)

2. Estimate a multiple linear regression with net_promoter_score as the dependent variable and total_silence_weighted, agent_to_cust_index and agent_crosstalk_weighted as the explanatory (independent) variables. Predict the change in net_promoter_score associated with a 0.1 increase in total_silence_weighted and a 0.01 increase in agent_crosstalk_weighted. Assuming this is the correct model specification, are we sure that total_silence_weighted has a negative effect? [Hint: consider the t-statistic and p-value] (50 words, 1 table, 2 calculations)

3. Add dummy variables to the regression to control for all of the potential effects of State and Package. Make sure the base category is customers with the "HOSPITAL AND EXTRAS" package in NSW. Carefully interpret the estimated coefficient on the package1 dummy variable you have included. Why is this NOT a very important result? [Hint: Use the variable labels to include and interpret the correct variables, consider the descriptive statistics of the dummy variables to interpret their importance] (50 words, 1 table)

4. Include a quadratic specification of the variable "sentiment_score_cust" in the model along with the existing explanatory variables. Calculate and interpret the marginal effect of a 1 point change in "sentiment_score_cust" when sentiment_score_cust = 1 and when sentiment_score_cust=4. (50 words, 1 table, 2 calculations)

5. Explain the conditional mean independence assumption and assess its relevance with respect to the explanatory variable "sentiment_score_cust". [Hint: Think about factors that may be included in the error term of the regression: the customer's experience with the company (positive or negative), the general attitude of the customer towards call centre conversations (positive or negative) and whether these may be correlated with sentiment_score_cust] (100 words)

6. As agent time is a cost to their business, BUPA may also be interested in predicting lcall_duration (the natural log of call_duration). Design a regression model to predict lcall_duration. Choose the explanatory variables to include, and whether to include them as dummies/ logs/ polynomials/ interactions as you feel appropriate. Present the results of the descriptive statistics and your final regression model in tables. Discuss the statistical significance of the explanatory variables in your model. Discuss how you have designed your model with reference to the "Gauss Markov" assumptions and whether these assumptions are likely to be met. Interpret the results of THREE of your explanatory variables, which you consider to be the key drivers of lcall_duration (ie the length of the call). Do NOT include the variables net_promoter_score, nps_group3, sentiment_score_cust, call_duration or call_durationsq in your model. (400 words, 2 tables, 3 calculations).

Reference no: EM132136265

Questions Cloud

Compensation and benefits package they want : What do millennials need to consider to get the compensation and benefits package they want?
Reporting of quality performance : Discuss the organizations involved in public reporting of quality performance data for healthcare organizations.
Why do we tend to blame others : How much do you know about the social world? There are 10 statements. Two of the 10 statements are false, the rest are true. Which two are false?
Decentralized methods of control : Compare and contrast the hierarchical and decentralized methods of control.
Estimate a multiple linear regression : Basic Econometrics Research Report Group Assignment - Estimate a multiple linear regression with net_promoter_score as the dependent variable
Explain the contributions that teams : Explain the contributions that teams make and how managers can help teams be more effective.
Leadership and management development : Individual differences in leadership and management development: why not clone managers?
Enterprise systems for the organization : Explain why integrating organizational functions using enterprise systems for the organization is preferable/necessary.
Examples of employment or employee laws : Please assist with giving two examples of employment or employee laws that you believe were vital in changing or creating today's workplace

Reviews

len2136265

10/9/2018 10:05:16 PM

Requires STATA software. Use the software program STATA 15 available through RMIT MyDesktop for all data analysis. This is a group assignment where you can work alone or with up to three other students (a maximum group size of four). All group members will receive the same marks for the assignment.

len2136265

10/9/2018 10:05:09 PM

You must submit an electronic copy of your assignment in Canvas in pdf, doc or docx format. Hard copies will not be accepted. Show your tables and calculations as well as answering the questions in full sentences. Please make sure your tables of results are neatly formatted, not just copied and pasted from STATA, and that you write your answers in clear sentences. You should write no more than 1000 words (not including tables/calculations) in total for this assignment. The number of words, tables, graphs, calculations given in parentheses after each question are a guide.

len2136265

10/9/2018 10:05:03 PM

Rubric for marking - 1. Descriptive statistics A) Present descriptive statistics table, B) comment on descriptives, C) present and comment on graph. 2. Multiple linear regression A) Estimate regression model, B) present table, C) two predictions, D) comment on total_silence_weighted effect 3. Dummy variables A) Include dummy variables correctly, B) Comment on package1 coefficient C) Why not an important result 4. Quadratic Specification A) Include quadratic specification correctly and present results in table. B) Calculate marginal effect when sentiment_score_cust=1 C) Calculate marginal effect when sentiment_score_cust=4

len2136265

10/9/2018 10:04:56 PM

5. Conditional mean independence A) Explain conditional mean independence assumption. B) Discuss with reference to the variable "sentiment_score_cust". 6. Design model 1 A) Present tables of preliminary regressions/descriptive statistics B) Present tables of final regression results C) Discuss appropriate specification (logs/polynomials) D) Discuss appropriate specification (dummies) E) Discuss statistical significance of coefficients in model. 6. Design model 2 A) Discuss Gauss_Markov assumptions 1-3 B) Discuss Gauss_Markov assumptions 4-5 C) Prediction 1 D) Prediction 2 E) Prediction 3. 7. Neat formatting of tables. 8. Clear expression of answers in full sentences. There will be up to 5 additional marks awarded for presentation of your answers (neat formatting of tables and clear expression of answers in full sentences).

Write a Review

Econometrics Questions & Answers

  Calculate monthly returns and average monthly returns

Choose two firms from different industrial sectors, e.g. high tech computers, health, customer non durables, cyclical etc.

  What is the change in total surplus

If the quantity demanded decreases by 100 sandwiches an hour at each price, what is the equilibrium price and what is the change in total surplus?

  Natural rate of capacity utilization

The natural rate of capacity utilization is defined as the rate at which Y is zero. What is this rate for the period under the study?

  Calculate the efficient level of production

Calculate the efficient (i.e. socially optimal) level of production.

  How does a currency depreciation affect balance of payments

How does a currency depreciation affect the balance of payments? When might a currency depreciation intended to correct a current account deficit fail?

  Find the quantity of bread produced in paris

To achieve the socially optimal output, government can use a price-based intervention. Determine the ideal measure for government to use to achieve this goal. Specify both the type of policy and its magnitude.

  Calculate each project''s payback period

Calculate each project's payback period, net present value (NPV), and internal rate of return (IRR).

  Rewrite the auditor''s report in acceptable format

Consider all facts given and rewrite the auditor's report in acceptable and complete format incorporating any necessary departures form the standard unqualified report.

  How much can approximate sell of products

The price elasticity of demand for a firm's product is equal to -1.8. the firm currently sells 4,000 units per day at a price of $2. if the firm increases its product price by 10%, then how much can it approximately sell

  How will the turtle firm respond to the threat of entry

Draw a game tree like the one shown in Figure 27.10 on page 588 and predict the outcome of the game. How will the turtle firm respond to the threat of entry? Will the frog firm enter the market? c. How would your response to part (a) change if the..

  What is the marginal product of the sixth farm worker

What is the marginal product of the sixth farm worker? If, when the price of asparagus rises to $3 a bunch, the farm hires eight workers, what is the marginal product of the eighth worker?

  Which country had the smallest proportional increase

Carry out the exercise in part (c) for all the countries for which you have data. Which country has had the highest proportional increase in GDP per capita since 1970? Which country had the smallest proportional increase? What fraction of countrie..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd