Evidence of multicollinearity between independent

Assignment Help Basic Statistics
Reference no: EM131201994

A college Admissions Officer is interested in determining the extent to which a prospective student's high school GPA and/or SAT score can be used as a basis for predicting his or her college freshman GPA. He believes that prospective students who have higher high school GPAs and SAT scores will have a higher college freshman GPA. He randomly selected forty students who recently completed their freshman year and collected the information reflected in the following table.

Student No.

College Freshman GPA

High School GPA

SAT Score

1

3.56

3.95

1383

2

2.59

2.82

1014

3

3.09

3.29

1217

4

3.68

3.84

1458

5

2.91

2.97

1157

6

2.48

3.09

1238

7

2.34

2.85

856

8

2.38

2.84

880

9

2.58

3.00

959

10

2.70

3.06

1011

11

2.54

2.82

959

12

2.89

3.14

1100

13

3.02

3.21

1156

14

3.33

3.46

1282

15

3.16

3.23

1227

16

2.60

3.25

1266

17

2.65

3.23

1291

18

3.27

3.89

1168

19

2.89

3.36

1041

20

2.91

3.31

1060

21

2.85

3.16

1044

22

3.65

3.97

1350

23

3.54

3.77

1319

24

3.83

3.99

1436

25

3.05

3.11

1150

26

3.15

3.94

1498

27

2.35

2.86

1117

28

3.06

3.65

1458

29

3.25

3.78

1134

30

3.49

3.97

1230

31

3.02

3.36

1075

32

2.91

3.16

1043

33

3.48

3.70

1258

34

3.09

3.22

1128

35

3.66

3.73

1343

36

2.31

2.89

1069

37

2.49

3.04

1154

38

2.42

2.88

1122

39

2.78

3.23

1292

40

2.98

3.38

1015

1. Perform a simple linear regression with a 95% confidence level using college freshman GPA as the dependent variable and high school GPA as the independent variable, and evaluate the statistical significance of the regression model.

2. Perform a simple linear regression with a 95% confidence using college freshman GPA as the dependent variable and SAT score as the independent variable, and evaluate the statistical significance of the regression model.

3. Compare the two simple linear regression models, select your preferred simple regression model, and explain the basis for selecting your preferred model.

4. Perform a multiple linear regression with a 95% confidence using college freshman GPA as the dependent variable and high school GPA and SAT score as the independent variables.

a. Evaluate the statistical significance of the regression model as a whole.

b. Evaluate the statistical significance of the linear relationship between the dependent variable and each independent variable.

c. Discuss the extent to which there is evidence of multicollinearity between the independent variables.

5. Compare your preferred simple linear regression model (i.e., the regression model you selected in step 3) to the multiple linear regression model. Discuss whether the simple regression model or multiple regression model would be your overall preferred regression model, including explaining the basis for selecting your preferred model.

6. Discuss the contribution of each independent variable for your overall preferred regression model (i.e., the model you selected in step 5) to predicting the value of the dependent variable. Round the coefficients to four decimal places.

7. Discuss the range of values for the independent variable(s) for your preferred regression model (i.e., the regression model you selected in step 5) for which the regression model is valid.

8. Discuss the p-value for the coefficient for the y-intercept for your overall preferred regression model, including explaining why a p-value that is not less than or equal to α = 0.05 would not be cause for rejecting the regression model. (Hint: Consider the range of values for the independent variables associated with the given data set.)

9. Identify the regression equation associated with your overall preferred regression model and associated degree of error associated with using the model to predict a student's college freshman GPA. Round the coefficients and degree of error to four decimal places.

10. Calculate the predicted college freshman GPA for a student with a high school GPA of 3.25 and an SAT score of 1115 using your overall preferred regression model. Round your answer to four decimal places.

11. Identify the lower and upper limits associated with a 95% confidence level interval estimate for the predicted college freshman GPA for a student with a high school GPA of 3.25 and an SAT score of 1115 using your overall preferred regression model. Round the coefficients and your final answers to four decimal places.

Reference no: EM131201994

Questions Cloud

Differences in age between husbands and wifes : a. Differences in age between husbands and wifes ( husband age minus wife age) of married couples have a distribution that approximately follows the normal curve with a mean of 1.5 years and a std of 2.1 years. negative differences indicate that t..
Perform a heritage assessment on three families : Perform a heritage assessment on three families. Complete the "Heritage Assessment Tool" for each of the three families interviewed. These must be included with your submission to LoudCloud.
Phillips head screw under a with replacement scenario : A box contains 11 two-inch screws, of which 6 have a Philips head and 5 have a regular head. Suppose that you select 3 screws randomly from the box. What is the probability that there will be more than one Phillips head screw under a with replace..
Commute from home to work for a group of nurses : The following is the number of minutes to commute from home to work for a group of nurses. A frequency distribution needs to be developed:
Evidence of multicollinearity between independent : Evaluate the statistical significance of the linear relationship between the dependent variable and each independent variable. Discuss the extent to which there is evidence of multicollinearity between the independent variables.
Has vogl eliminated its exposure to exchange rate risk : In this way, Vogl Co. would not have to convert Canadian dollars to U.S. dollars each year. Has Vogl eliminated its exposure to exchange rate risk by using this strategy?
Determine the net present value for the project : Assuming that Cellular Two's overseas after-foreign-tax profits can be repatriated to the US without further tax liability and that Cellular Two has a 14 percent required return, determine the net present value for the project.
Strength of a relationship between two variables : Correlation is used to determine the strength of a relationship between two variables. What kind of variables? Quantitative ones. The result of a correlation will be a number ranging between -1 and 1.
Calculate the mass flow rate of air to the engine : The pressure drop across the orifice is 80 mm of paraffin. The coefficient of discharge of the orifice is 0.62 and the densities of air and paraffin are 1.2 kg)m3 and 830 kg/rn3 respectively. Calculate the mass flow rate of air to the engine.

Reviews

Write a Review

Basic Statistics Questions & Answers

  A fair die is thrown until the sum of the results of the

a fair die is thrown until the sum of the results of the throws exceeds 6. the random variable x is the number of

  Mcq on logistic regression

A case-control study is performed to study the relationship between esophageal cancer and an exposure (exposure A).

  Mini meta analysis

Conduct a mini-meta-analysis. There is really no such thing as a mini-meta-analysis, but you don't have time to conduct a full meta-analysis.

  Global test of hypothesis to find any regression coefficient

Conduct a global test of hypothesis to determine whether any of the regression coefficients are not zero. Use the .05 significance level

  Distribution of annual costs problem

For the most recent year available the mean annual cost to attend a private university in the US was $26,889. Assume the distribution of annual costs follows a normal probability distribution and the standard deviation is $4,500. 95% of all studen..

  Determine the standard error of the mean

A random sample of 100 computers showed a mean of 115 gigabytes used with a standard deviation of 20 gigabytes. What is the standard error of the mean?

  Shipping schedule using transshipment model

Determine the most economical monthly shipping schedule. Provide a table showing the routes and total costs.

  Problem regarding the campaign manager claim

a. If the campaign manager's claim is correct, what is the probability that the sample proportion would be no more than 0.49 for a sample of this size? b. Based on your answer to part (a), speculate on whether the campaign manager's claim might be ..

  What is the necessary sample size

Prostate cancer are actually proven correct through subsequent biopsy. JBGHI demands a sample large enough to ensure an error of ± 2% with 90% confidence. What is the necessary sample size?

  Solve a linear programming model for julia

Formulate and solve a linear programming model for Julia that will help you advise her on which offer to accept from the director if any. Julia's younger brother who is a freshman at Tech wants to help Julia increase her profits by loaning her so..

  Module five evaluation of qualitative research

Read Chapter 1: The Nature of Mixed Methods Research from John Creswell and Vicki Plano-Clark's book, Designing and Conducting Mixed Methods Research, which was published by Sage in 2011.

  Relative frequency probability

What is the probability that a particular driver has two speeding violations.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd