Estimate a multiple linear regression

Assignment Help Econometrics
Reference no: EM132136265

Basic Econometrics Research Report Group Assignment -

This assignment uses data from the BUPA health insurance call centre. Each observation includes data from one call to the call centre. The variables describe several characteristics of the call (eg the length of the call, the amount of silence in the call), characteristics of the customer (eg state of residence, family type, number of adults and children), and measures of performance (eg net promoter score, sentiment score of the customer). In this assignment we are interested in predicting the net promoter score and the length of the call.

Please use the dataset CallCentre.dta and associated information file CC_DEFINITIONS_.XLSX to answer these questions. Use the software program STATA 15 available through RMIT MyDesktop for all data analysis. This is a group assignment where you can work alone or with up to three other students (a maximum group size of four). All group members will receive the same marks for the assignment. You must submit an electronic copy of your assignment in Canvas in pdf, doc or docx format. Hard copies will not be accepted. Show your tables and calculations as well as answering the questions in full sentences. Please make sure your tables of results are neatly formatted, not just copied and pasted from STATA, and that you write your answers in clear sentences. You should write no more than 1000 words (not including tables/calculations) in total for this assignment. The number of words, tables, graphs, calculations given in parentheses after each question are a guide.

1. Calculate descriptive statistics using the 'summarize' command for the variables net_promoter_score, total_silence, total_silence_weighted, agent_to_cust_index and agent_crosstalk_weighted and present the results in a table. Comment on what we learn about these variables from the descriptives. Graph a scatter plot of net_promoter_score against agent_crosstalk_weighted and describe the relationship between these two variables. (100 words, 1 table, 1 graph)

2. Estimate a multiple linear regression with net_promoter_score as the dependent variable and total_silence_weighted, agent_to_cust_index and agent_crosstalk_weighted as the explanatory (independent) variables. Predict the change in net_promoter_score associated with a 0.1 increase in total_silence_weighted and a 0.01 increase in agent_crosstalk_weighted. Assuming this is the correct model specification, are we sure that total_silence_weighted has a negative effect? [Hint: consider the t-statistic and p-value] (50 words, 1 table, 2 calculations)

3. Add dummy variables to the regression to control for all of the potential effects of State and Package. Make sure the base category is customers with the "HOSPITAL AND EXTRAS" package in NSW. Carefully interpret the estimated coefficient on the package1 dummy variable you have included. Why is this NOT a very important result? [Hint: Use the variable labels to include and interpret the correct variables, consider the descriptive statistics of the dummy variables to interpret their importance] (50 words, 1 table)

4. Include a quadratic specification of the variable "sentiment_score_cust" in the model along with the existing explanatory variables. Calculate and interpret the marginal effect of a 1 point change in "sentiment_score_cust" when sentiment_score_cust = 1 and when sentiment_score_cust=4. (50 words, 1 table, 2 calculations)

5. Explain the conditional mean independence assumption and assess its relevance with respect to the explanatory variable "sentiment_score_cust". [Hint: Think about factors that may be included in the error term of the regression: the customer's experience with the company (positive or negative), the general attitude of the customer towards call centre conversations (positive or negative) and whether these may be correlated with sentiment_score_cust] (100 words)

6. As agent time is a cost to their business, BUPA may also be interested in predicting lcall_duration (the natural log of call_duration). Design a regression model to predict lcall_duration. Choose the explanatory variables to include, and whether to include them as dummies/ logs/ polynomials/ interactions as you feel appropriate. Present the results of the descriptive statistics and your final regression model in tables. Discuss the statistical significance of the explanatory variables in your model. Discuss how you have designed your model with reference to the "Gauss Markov" assumptions and whether these assumptions are likely to be met. Interpret the results of THREE of your explanatory variables, which you consider to be the key drivers of lcall_duration (ie the length of the call). Do NOT include the variables net_promoter_score, nps_group3, sentiment_score_cust, call_duration or call_durationsq in your model. (400 words, 2 tables, 3 calculations).

Reference no: EM132136265

Questions Cloud

Compensation and benefits package they want : What do millennials need to consider to get the compensation and benefits package they want?
Reporting of quality performance : Discuss the organizations involved in public reporting of quality performance data for healthcare organizations.
Why do we tend to blame others : How much do you know about the social world? There are 10 statements. Two of the 10 statements are false, the rest are true. Which two are false?
Decentralized methods of control : Compare and contrast the hierarchical and decentralized methods of control.
Estimate a multiple linear regression : Basic Econometrics Research Report Group Assignment - Estimate a multiple linear regression with net_promoter_score as the dependent variable
Explain the contributions that teams : Explain the contributions that teams make and how managers can help teams be more effective.
Leadership and management development : Individual differences in leadership and management development: why not clone managers?
Enterprise systems for the organization : Explain why integrating organizational functions using enterprise systems for the organization is preferable/necessary.
Examples of employment or employee laws : Please assist with giving two examples of employment or employee laws that you believe were vital in changing or creating today's workplace

Reviews

len2136265

10/9/2018 10:05:16 PM

Requires STATA software. Use the software program STATA 15 available through RMIT MyDesktop for all data analysis. This is a group assignment where you can work alone or with up to three other students (a maximum group size of four). All group members will receive the same marks for the assignment.

len2136265

10/9/2018 10:05:09 PM

You must submit an electronic copy of your assignment in Canvas in pdf, doc or docx format. Hard copies will not be accepted. Show your tables and calculations as well as answering the questions in full sentences. Please make sure your tables of results are neatly formatted, not just copied and pasted from STATA, and that you write your answers in clear sentences. You should write no more than 1000 words (not including tables/calculations) in total for this assignment. The number of words, tables, graphs, calculations given in parentheses after each question are a guide.

len2136265

10/9/2018 10:05:03 PM

Rubric for marking - 1. Descriptive statistics A) Present descriptive statistics table, B) comment on descriptives, C) present and comment on graph. 2. Multiple linear regression A) Estimate regression model, B) present table, C) two predictions, D) comment on total_silence_weighted effect 3. Dummy variables A) Include dummy variables correctly, B) Comment on package1 coefficient C) Why not an important result 4. Quadratic Specification A) Include quadratic specification correctly and present results in table. B) Calculate marginal effect when sentiment_score_cust=1 C) Calculate marginal effect when sentiment_score_cust=4

len2136265

10/9/2018 10:04:56 PM

5. Conditional mean independence A) Explain conditional mean independence assumption. B) Discuss with reference to the variable "sentiment_score_cust". 6. Design model 1 A) Present tables of preliminary regressions/descriptive statistics B) Present tables of final regression results C) Discuss appropriate specification (logs/polynomials) D) Discuss appropriate specification (dummies) E) Discuss statistical significance of coefficients in model. 6. Design model 2 A) Discuss Gauss_Markov assumptions 1-3 B) Discuss Gauss_Markov assumptions 4-5 C) Prediction 1 D) Prediction 2 E) Prediction 3. 7. Neat formatting of tables. 8. Clear expression of answers in full sentences. There will be up to 5 additional marks awarded for presentation of your answers (neat formatting of tables and clear expression of answers in full sentences).

Write a Review

Econometrics Questions & Answers

  How price controls present a problem for measuring gdp

Carefully explain how price controls present a problem for measuring GDP and for measuring the price level and inflation.

  Empirical results on the heckscher-ohlin model

Explain how this would affect the concept of factor-price equalization.

  Where does opportunity cost enter the picture

The house can be purchased for $200,000, and the tenant has this much money in a bank account that pays 4 percent interest per year. Is buying the house a good deal for the tenant? Where does opportunity cost enter the picture?

  How to imply is the minimum elasticity of demand

We defined the Lerner Index LI = 1/-e where e is the elasticity of demand. We also showed that LI can be alternatively expressed as (P-MC)/P . Use these relationships to show that LI can never exceed 1. What does this imply is the minimum elasticity ..

  Determine how many unit will t produce

consider a monopolist facing the market demand p=100-2q. Marginal cost equals to 10; How many unit will t produce At what price Compute and identify in a graph: monoplist profit, consumer surplus and deadweith loss.

  Explain this tendency of industrial clusters to break up

Explain this tendency of industrial clusters to break up in terms of the theory of external economies.

  1 consider a cobb-douglas utiltiy function of the form u

1. consider a cobb-douglas utiltiy function of the form u xyz xyz with three consumer goods x y and zset up the

  Why does it usually first experience increasing returns

why does it usually first experience increasing returns to scale?

  Whether you are witnessing a j-curve effect

What other macroeconomic change might bring about a currency depreciation coupled with a deterioration of the current account, even if there is no J-curve?

  How to collect data on the customers shopping behavior

Do you believe it is proper for businesses such as internet service providers to collect data on their customers shopping behavior and personal profiles Under what situations, if any, is it proper or improper  Should all companies collecting custo..

  Which set of data illustrates aggregate supply

GDP Price Level Real GDP Price Level Real GDP 110 275 100 200 110 225 100 250 100 225 100 225 95 225 100 250 95 225 90 200 100 275 90 225 a. Which set of data illustrates aggregate supply in the immediate short-run in North Vaudeville

  Determine predicted quantity demanded

A multiple regression analysis based on a information set that consists of thrity observations yielded the following estimated demand equation:

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd