Explain a substantial amount of the variation in salaries

Assignment Help Other Subject
Reference no: EM131202746

Linear Regression and Multiple Regression

Question 1

Baseball is a sport that generates a lot of data, which fans use to try to predict the factors that lead to successful teams. One fan compiled the team batting average and the team percentage of games won for the 14 American League teams at the end of a recent season.The presumption is that a team with a greater batting average should win more games.Supposing that these data represent a random collection of observations of these two measures, let's explore whether batting average can predict winning percentage.The data are stored in the file Baseball.xls.

a) Plot the data, and comment on what you observe.

b) Find the correlation coefficient.

c) Find the coefficient of determination R2, for this data, and interpret its meaning.

d) Find the sample regression line, and interpret the meaning of the coefficients of your equation.

e) Is there evidence at a 5% level of significance, that batting average can be used to predict winning percentage? Explain.

Question 2

Physicians are recommending more exercise for patients, especially those who are overweight.One benefit of regular exercise is thought to be a reduction of bad cholesterol.To study the relationship, a doctor selected a sample of patients who did not do regular exercise, and measured their cholesterol level.She then started the patients on a program of exercise, and asked them to record the number of minutes per week that they exercised.After 4 months, she re-measured their cholesterol levels.The data are contained in the file Cholesterol.xls.

a) Plot the data.Does it appear that amount of exercise and cholesterol level change is related?

b) Determine the regression equation relating cholesterol reduction to amount of exercise, and find a 95% confidence interval for the independent variable (exercise).Provide a brief and meaningful written interpretation of the coefficients and the confidence interval.

c) Can we conclude that exercise affects the change in cholesterol level of the exerciser?

d) How well does the linear model fit this data? Justify your response using the regression output.

Question 3

Hardwood trees are harvested in a selective manner for the manufacture of fine furniture.Environmental groups are concerned that as few trees are selected for cutting as possible while companies feel that they need a certain amount of wood for manufacturing.To help each group predict the volume of lumber in a selected tree, various measurements are made before the tree is cut.Unfortunately, volume is not easily determined before harvesting.

Two common measurements made before cutting down the tree are DBH (the diameter of the tree at breast height, 4.5 feet off the ground) and the height of the tree measured with sighting instruments.After the tree is harvested the volume of lumber may be measured.

Both groups believe that a regression model relating volume to diameter and/or height will be helpful.The data file below gives the diameters, heights, and volumes of 31 trees harvested in the Allegheny National Forest in Pennsylvania.

The data are contained in the file Wood.xlsx.

a) Estimate the two simple regression models and the multiple regression model thatare appropriate for these data based upon the description above. This will require you to conduct three separate regressions. DO NOT try to use any other model for the volume estimate. There are other models available but for the purpose of this assignment you are limited to the models described above.

b) Which of the three models would you recommend that the two groups use? Why?HINT: you should analyze the correlation matrix to answer this question.

c) A specific tree with a height of 62 feet and a diameter of 17.9 inches has just arrived at the mill. DO you believe your model is appropriate for making a point estimate for this tree? What volume can be expected? Build a 95% confidence interval for the volume of this tree.

Question 4

Lotteries are important sources of revenue for governments and charities.Many people have criticized lotteries, however, as taxes on the poor and uneducated.To explore the issue, a sample of 100 adults were asked how much they spend on lottery tickets and a number of socio-economic variables were also recorded.The study was meant to test the following beliefs:

I. Relatively uneducated people spend more on lotteries that do educated people.

II. Older people spend more on lotteries than do younger people.

III. People with more children spend more that people with fewer children.

IV. Relatively poor people spend a greater proportion of their income on lotteries that the better off.

The file Lottery.xls contains data for the 100 respondents on the amount spend on lottery tickets as a percentage of household income, number of years of education, age, number of children and personal income (in thousands of dollars).

a) Ignoring any co-linearity issues, develop a single multiple regression model relating lottery expenditures to all of the independent variables described above and test the four beliefs listed above at the 95% confidence level. What would be the appropriate conclusions from this analysis?

b) Now check the correlation matrix for this problem. Describe any concerns this matrix raises regarding the usefulness of the independent variables and the relationships between them. Without actually creating a new model, describe which independent variables you would use to improve the quality of model compared to the full model used in part a).

Question 5

A large corporation was recently accused of discriminating against female managers.A random sample of 100 managers from the firm found that the mean annual salary of the 38 female managers in the sample was $76,189, and the mean annual salary of the 62 male managers was $97,832.This looks like pretty damning evidence of discrimination.The CEO of the corporation was indignant, claiming that the firm followed a strict policy of equal pay for equal work, and that maybe some other factor or factors were responsible for the perceived differences.He has asked you to look into this, and you were able to find the number of years of education and years of experience for each member of the sample.These data are contained in the file Discrimination.xls, which records the member's gender/sex as a 0 for males and a 1 for females.

a) Do these data taken as a whole explain a substantial amount of the variation in salaries among these managers?

b) What is your best estimate of the systematic difference between male and female salaries?

c) Does it appear that gender/sex is a significant factor in the differences in salary in this sample?

Attachment:- Data.rar

Verified Expert

This assignment is a mix of 5 assignments , with each based on regression but different types of data sets, it includes calculations of confidence intervals and studying the plot of data to find the optimal fit. The 5 question have been answered with their sub parts under the question itself and all the relevant charts are attached as excel charts and tables. The work is fully original and does not contain any copy material.

Reference no: EM131202746

Questions Cloud

How many more antennae would be needed : A cellular network has 200 antennae. If a switch to PCS (personal communication service) is to be made, how many more antennae would be needed?
Determine the signalto-noise ratio : This signal exists in background noise which has a level of 5 mV rms. Determine the signalto-noise ratio and the information rate of the digitized signal.
Explain each part of the key components : Explain each part of the key components. Explain each part of contributing factors. Provide examples of ways to measure each part.
Determine the rate of the resulting bit stream : In the design of a PCM system, an analog signal for which the highest frequency is 10 kHz is to be digitized and encoded with 6 bits. Determine the rate of the resulting bit stream and the signal-to-noise ratio.
Explain a substantial amount of the variation in salaries : MBQC 862 - Business Decision Modelling Explain a substantial amount of the variation in salaries among these managers - What is your best estimate of the systematic difference between male and female salaries?
Find the reflection coefficient : If a very short, 10 V pulse is launched at the input, calculate the time it will take for the pulse to return to the input and voltage be.
Discuss the benefits of a forensic readiness plan : Discuss the benefits of a forensic readiness plan and name what you believe are the top 3 requirements to establish forensic readiness within a private sector business.
Determine the bandwidth required for a data rate of 10 mbps : Determine the bandwidth required for a data rate of 10 Mbps. Assume simple pulses, equally spaced, such as in the square wave shown in Fig. 9.12, to represent the information bits.
What can wanda do to try to ensure edith cooperation : Should Wanda perform the test? Explain your answer. How can Wanda emphasize the importance of following the diet, fasting, and test instructions?

Reviews

len1202746

9/12/2016 3:34:22 AM

Please find attached Assignment #2 with related excel files. All answers need explanation. Kindly provide quotes so that I can complete the transactions.

Write a Review

Other Subject Questions & Answers

  Pricing decisions on cost and market factors

Managers should base pricing decisions on both cost and market factors. In addition, they must also consider legal issues. Describe the influence that the law has on pricing decisions.

  Explain the positions and beliefs

Explain the positions and beliefs of the "Radical Republicans" and assess their effectiveness in assisting Africian Americans economically, socially, and politically. Were the actions of the 'Radical Republicans" necessary? Why?

  Compare and contrast the traditional marketing

Crowd sourcing has been predicted to be the future of marketing, advertising, product design etc. Companies that have used crowdsourcing include Starbucks, InnoCentive,Inc, uTest, etc.

  Compare and contrast justice model with the welfare model

Compare and contrast therapeutic and coercive treatment. Which do you believe is most effective? Support your answer. What are the characteristics of juvenile court? Why are these important in dealing with juveniles? Determine the family and school r..

  Damaged by an accident or disease

Discuss the different areas of the brain and how memory is affected by one area being damaged by an accident or disease.

  Describe the quasi-experimental research method

Describe the quasi-experimental research method in detail. Design and describe an experiment that uses this approach. Cleary define the independent and dependent variables as well as how you will assess these different measures

  Difference between information systems and it

Write a business report outlining the above case, stating assumptions you make. Provide critique into the management decisions substantiating with reference to literature

  Ideologies legitimate wars and terrorism

In at least three to four paragraphs, describe what attitudes and ideologies legitimate wars and terrorism? What steps could be taken to diminish the possibility of war and terrorist acts?

  Determine the age of ocean basins

How do researchers determine the age of ocean basins? What role does the theory of plate tectonics play in determining the age of ocean basins?

  Living organisms in the ocean

Explain WHY most of the living organisms in the ocean are in the top layer, the Epipeligic and why the phytoplankton and seaweeds are only at the top.

  Robin hood figure

Discuss Pablo Escobar contrasting images--that of a ruthless drug lord and that of a "Robin Hood" figure.

  Adopting a standard costing system

What benefits might Recycled Plastics, Inc. receive from adopting a standard costing system? What disadvantages or problems might arise from adopting a standard costing system?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd