What is the value of the chi square statistic

Assignment Help Basic Statistics
Reference no: EM13996948

Question:

In the Week Two Assignment sheet, complete the problems below and submit your work in an Excel document. Be sure to show all of your work and clearly label all calculations. All statistical calculations will use the Employee Salary Data Set and Weekly Assignment Sheet.

The ongoing question that the weekly assignments will focus on is: Are males and females paid the same for equal work (under the Equal Pay Act)?

Note: to simplfy the analysis, we will assume that jobs within each grade comprise equal work.

The column labels in the table mean:

ID - Employee sample number Salary - Salary in thousands

Age - Age in years Performance Rating - Appraisal rating (employee evaluation score)

Service - Years of service (rounded) Gender - 0 = male, 1 = female

Midpoint - salary grade midpoint Raise - percent of last raise

Grade - job/pay grade Degree (0= BS\BA 1 = MS)

Gender1 (Male or Female) Compa - salary divided by midpoint

(Note: Questions 1- 4 have additional elements to respond to below the analysis results and included in the Week Two Assignment sheet are 2 one-sample t-tests comparing male and female average salaries to the overall sample mean.)

Week 1. Measurement and Description

The goal this week is to gain an understanding of our data set - what kind of data we are looking at, some descriptive measurse, and a
look at how the data is distributed (shape).

1. Measurement issues. Data, even numerically coded variables, can be one of 4 levels - nominal, ordinal, interval, or ratio. It is important to identify which level a variable is, as this impact the kind of analysis we can do with the data. For example, descriptive statistics such as means can only be done on interval or ratio level data.

Please list under each label, the variables in our data set that belong in each group.

Nominal Ordinal

b. For each variable that you did not call ratio, why did you make that decision?

2. The first step in analyzing data sets is to find some summary descriptive statistics for key variables.

For salary, compa, age, performance rating, and service; find the mean, standard deviation, and range for 3 groups: overall sample, Females, and Males.

You can use either the Data Analysis Descriptive Statistics tool or the Fx =average and =stdev functions.

(the range must be found using the difference between the =max and =min functions with Fx) functions.

Note: Place data to the right, if you use Descriptive statistics, place that to the right as well.  Some of the values are completed for you - please finish the table.



Salary Compa Age Perf. Rat. Service
Overall Mean

35.7 85.9 9.0

Standard Deviation

8.2513 11.4147 5.7177

Range

30 45 21
Female Mean

32.5 84.2 7.9

Standard Deviation

6.9 13.6 4.9

Range

26.0 45.0 18.0
Male Mean

38.9 87.6 10.0

Standard Deviation

8.4 8.7 6.4

Range

28.0 30.0 21.0

3 What is the probability for a:

a. Randomly selected person being a male in grade E?

b. Randomly selected male being in grade E?

Note part b is the same as given a male, what is probabilty of being in grade E?

c. Why are the results different?

4. A key issue in comparing data sets is to see if they are distributed/shaped the same. We can do this by looking at some measures of where some selected values are within each data set - that is how many values are above and below a comparable value.

For each group (overall, females, and males) find:

A The value that cuts off the top 1/3 salary value in each group
i The z score for this value within each group?
ii The normal curve probability of exceeding this score:
iii What is the empirical probability of being at or exceeding this salary value?

B The value that cuts off the top 1/3 compa value in each group.
i The z score for this value within each group?
ii The normal curve probability of exceeding this score:
iii What is the empirical probability of being at or exceeding this compa value?

C How do you interpret the relationship between the data sets? What do they mean about our equal pay for equal work question?

5. What conclusions can you make about the issue of male and female pay equality? Are all of the results consistent? What is the difference between the sal and compa measures of pay?

Conclusions from looking at salary results:

Conclusions from looking at compa results:

Do both salary measures show the same results?

Can we make any conclusions about equal pay for equal work yet?

Week 2. Testing means - T-tests

1 Below are 2 one-sample t-tests comparing male and female average salaries to the overall sample mean.

(Note: a one-sample t-test in Excel can be performed by selecting the 2-sample unequal variance t-test and making the second variable = Ho value - a constant.)

Note: These values are not the same as the data the assignment uses. The purpose is to analyze the results of t-tests rather than directly answer our equal pay question.

Based on these results, how do you interpret the results and what do these results suggest about the population means for male and female average salaries?

Males


Females
Ho: Mean salary = 45.00

Ho: Mean salary =
Ha: Mean salary =/= 45.00

Ha: Mean salary =/=

Note: While the results both below are actually from Excel's t-Test: Two-Sample Assuming Unequal Variances, having no variance in the Ho variable makes the calculations default to the one-sample t-test outcome - we are tricking Excel into doing a one sample test for us.

  Male Ho
Mean 52 45
Variance 316 0
Observations 25 25
Hypothesized Mean Difference 0
df 24
t Stat 1.968903827
P(T<=t) one-tail 0.03030785
t Critical one-tail 1.71088208
P(T<=t) two-tail 0.060615701
t Critical two-tail 2.063898562  

  Female Ho
Mean 38 45
Variance 334.667 0
Observations 25 25
Hypothesized Mean Difference 0
df 24
t Stat -1.91321
P(T<=t) one-tail 0.03386
t Critical one-tail 1.71088
P(T<=t) two-tail 0.06772
t Critical two-tail 2.0639  

Note: the Female results are done for you, please complete the male results.

Is this a 1 or 2 tail test? Is this a 1 or 2 tail test?
- why? - why?

P-value is: P-value is:

Is P-value < 0.05 (one tail test) or 0.25 (two tail test)? Is P-value < 0.05 (one tail test) or 0.25 (two tail test)?

Why do we not reject the null hypothesis? Why do we not reject the null hypothesis?

2 Based on our sample data set, perform a 2-sample t-test to see if the population male and female average salaries could be equal to each other.

(Since we have not yet covered testing for variance equality, assume the data sets have statistically equal variances.)

Ho: Male salary mean = Female salary mean

Ha: Male salary mean =/= Female salary mean

Test to use: t-Test: Two-Sample Assuming Equal Variances

b. Is the one or two sample t-test the proper/correct apporach to comparing salary equality? Why?

3 Based on our sample data set, can the male and female compas in the population be equal to each other? (Another 2-sample t-test.)
Again, please assume equal variances for these groups.
Ho:
Ha:
Statistical test to use:

What is the p-value:
Is P-value < 0.05 (one tail test) or 0.25 (two tail test)?
Reject or do not reject Ho:
If the null hypothesis was rejected, calculate the effect size value:
If calculated, what is the meaning of effect size measure:

Interpretation:

4 Since performance is often a factor in pay levels, is the average Performance Rating the same for both genders?

NOTE: do NOT assume variances are equal in this situation.

5 If the salary and compa mean tests in questions 2 and 3 provide different results about male and female salary equality, which would be more appropriate to use in answering the question about salary equity? Why?

What are your conclusions about equal pay at this point?

Week 3 Paired T-test and ANOVA

1 Many companies consider the grade midpoint to be the "market rate" - the salary needed to hire a new employee.

Does the company, on average, pay its existing employees at or above the market rate?

Use the data columns at the right to set up the paired data set for the analysis.

Null Hypothesis:
Alt. Hypothesis:

Statistical test to use:

What is the p-value:

Is P-value < 0.05 (one tail test) or 0.25 (two tail test)?

What else needs to be checked on a 1-tail test in order to reject the null?

Do we REJ or Not reject the null?

If the null hypothesis was rejected, what is the effect size value:

If calculated, what is the meaning of effect size measure:

Interpretation of test results:

Let's look at some other factors that might influence pay - education(degree) and performance ratings.

2 Last week, we found that average performance ratings do not differ between males and females in the population.

Now we need to see if they differ among the grades. Is the average performace rating the same for all grades?

(Assume variances are equal across the grades for this ANOVA.)

The rating values sorted by grade have been placed in columns I - N for you.

Null Hypothesis: Ho: means equal for all grades

Alt. Hypothesis: Ha: at least one mean is unequal

Place B17 in Outcome range box.

Here are the data values sorted by grade level.
A B C D E F
90 80 100 90 85 70
80 75 100 65 100 100
100 80 90 75 95 95
90 70 80 90 55 95
80 95 80 95 90 95
85 80

95 95
65 90

90
70


75
95


95
60


90
90


95
75


80
95




90




100




Interpretation of test results:

What is the p-value:

Is P-value < 0.05?

Do we REJ or Not reject the null?

If the null hypothesis was rejected, what is the effect size value (eta squared):

Meaning of effect size measure:

What does that decision mean in terms of our equal pay question:

3. While it appears that average salaries per each grade differ, we need to test this assumption.

Is the average salary the same for each of the grade levels?

Use the input table to the right to list salaries under each grade level.

(Assume equal variance, and use the analysis toolpak function ANOVA.)

Null Hypothesis:

Alt. Hypothesis:

4. The table and analysis below demonstrate a 2-way ANOVA with replication. Please interpret the results.


BA MA
Male 1.017 1.157

0.870 0.979

1.052 1.134

1.175 1.149

1.043 1.043

1.074 1.134

1.020 1.000

0.903 1.122

0.982 0.903

1.086 1.052

1.075 1.140

1.052 1.087
Female 1.096 1.050

1.025 1.161

1.000 1.096

0.956 1.000

1.000 1.041

1.043 1.043

1.043 1.119

1.210 1.043

1.187 1.000

1.043 0.956

1.043 1.129

1.145 1.149

Note: These values are not the same as the data the assignment uses. The purpose of this question is to analyze the result of a 2-way ANOVA test rather than directly answer our equal pay question.

Interpretation:

For Ho: Average compas by gender are equal Ha: Average compas by gender are not equal

What is the p-value:

Is P-value < 0.05?

Do you reject or not reject the null hypothesis:

If the null hypothesis was rejected, what is the effect size value (eta squared):

Meaning of effect size measure:

For Ho: Average compas are equal for all degrees Ha: Average compas are not equal for all grades

What is the p-value:

Is P-value < 0.05?

Do you reject or not reject the null hypothesis:

If the null hypothesis was rejected, what is the effect size value (eta squared):

Meaning of effect size measure:

For: Ho: Interaction is not significant Ha: Interaction is significant

What is the p-value:

Is P-value < 0.05?

Do you reject or not reject the null hypothesis:

If the null hypothesis was rejected, what is the effect size value (eta squared):

Meaning of effect size measure:

What do these three decisions mean in terms of our equal pay question:

5. Using the results up thru this week, what are your conclusions about gender equal pay for equal work at this point?

Week 4 Confidence Intervals and Chi Square

1 Using our sample data, construct a 95% confidence interval for the population's mean salary for each gender.

Interpret the results.
Mean St error t value Low to High
Males
Females
<Reminder: standard error is the sample standard deviation divided by the square root of the sample size.>
Interpretation:

2 Using our sample data, construct a 95% confidence interval for the mean salary difference between the genders in the population.
How does this compare to the findings in week 2, question 2?
Difference St Err. T value Low to High

Yes/No

Can the means be equal? Why?
How does this compare to the week 2, question 2 result (2 sampe t-test)?

a. Why is using a two sample tool (t-test, confidence interval) a better choice than using 2 one-sample techniques when comparing two samples?

3 We found last week that the degree values within the population do not impact compa rates.

This does not mean that degrees are distributed evenly across the grades and genders.

Do males and females have athe same distribution of degrees by grade?

(Note: while technically the sample size might not be large enough to perform this test, ignore this limitation for this exercise.)
Ignore any cell size limitations.

What are the hypothesis statements:
Ho:
Ha:

Note: You can either use the Excel Chi-related functions or do the calculations manually.

Data InTables The Observed Table is completed for you.

OBSERVED B C D E F Total
M Grad 1 1 1 1 5 3 12
Fem Grad 5 3 1 1 1 2 13
Male Und 2 2 2 1 5 1 13
Female Und 7 1 1 2 1 0 12

15 7 5 5 12 6 50

Interpretation:

4 Based on our sample data, can we conclude that males and females are distributed across grades in a similar pattern
within the population?
What are the hypothesis statements:
Ho:
Ha:

What is the value of the chi square statistic:

What is the p-value associated with this value:

Is the p-value <0.05?

Do you reject or not reject the null hypothesis:

If you rejected the null, what is the Phi correlation:

If calculated, what is the meaning of effect size measure:

What does this decision mean for our equal pay question:

5. How do you interpret these results in light of our question about equal pay for equal work?

Week 5 Correlation and Regression

1. Create a correlation table for the variables in our data set. (Use analysis ToolPak or StatPlus:mac LE function Correlation.)

a. Reviewing the data levels from week 1, what variables can be used in a Pearson's Correlation table (which is what Excel produces)?

b. Place table here (C8):

c. Using r = approximately .28 as the signicant r value (at p = 0.05) for a correlation between 50 values, what variables are significantly related to Salary?

To compa?

d. Looking at the above correlations - both significant or not - are there any surprises -by that I mean any relationships you expected to be meaningful and are not and vice-versa?

e. Does this help us answer our equal pay for equal work question?

2. Below is a regression analysis for salary being predicted/explained by the other variables in our sample (Midpoint, age, performance rating, service, gender, and degree variables. (Note: since salary and compa are different ways of expressing an employee's salary, we do not want to have both used in the same regression.)

Plase interpret the findings.

Note: These values are not the same as the data the assignment uses. The purpose is to analyze the result of a regression test rather than directly answer our equal pay question.
Ho: The regression equation is not significant.
Ha: The regression equation is significant.
Ho: The regression coefficient for each variable is not significant
Ha: The regression coefficient for each variable is significant

SUMMARY OUTPUT


Regression Statistics
Multiple R 0.9915591
R Square 0.9831894
Adjusted R Square 0.9808437
Standard Error 2.6575926
Observations 50

ANOVA




  df SS MS F Significance F
Regression 6 17762.3 2960.38 419.1516 1.812E-36
Residual 43 303.7003 7.0628

Total 49 18066      

  Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept -1.749621 3.618368 -0.4835 0.631166 -9.046755 5.5475126 -9.04675504 5.54751262
Midpoint 1.2167011 0.031902 38.1383 8.66E-35 1.1523638 1.2810383 1.152363828 1.28103827
Age -0.004628 0.065197 -0.071 0.943739 -0.136111 0.1268547 -0.13611072 0.1268547
Performace Rating -0.056596 0.034495 -1.6407 0.108153 -0.126162 0.0129695 -0.12616237 0.01296949
Service -0.0425 0.084337 -0.5039 0.616879 -0.212582 0.1275814 -0.21258209 0.12758138
Gender 2.4203372 0.860844 2.81159 0.007397 0.6842792 4.1563952 0.684279192 4.15639523
Degree 0.2755334 0.799802 0.3445 0.732148 -1.337422 1.8884885 -1.33742165 1.88848848

Interpretation:
For the Regression as a whole:
What is the value of the F statistic:
What is the p-value associated with this value:
Is the p-value <0.05?
Do you reject or not reject the null hypothesis:
What does this decision mean for our equal pay question:

For each of the coefficients:
What is the coefficient's p-value for each of the variables:
Is the p-value < 0.05?
Do you reject or not reject each null hypothesis:
What are the coefficients for the significant variables?
Using the intercept coefficient and only the significant variables, what is the equation?
Is gender a significant factor in salary:
If so, who gets paid more with all other things being equal?
How do we know?

3 Perform a regression analysis using compa as the dependent variable and the same independent variables as used in question 2. Show the result, and interpret your findings by answering the same questions.

Note: be sure to include the appropriate hypothesis statements.
Regression hypotheses
Ho:
Ha:
Coefficient hyhpotheses (one to stand for all the separate variables)
Ho:
Ha:

Place c94 in output box.

Interpretation:
For the Regression as a whole:
What is the value of the F statistic:
What is the p-value associated with this value:
Is the p-value < 0.05?
Do you reject or not reject the null hypothesis:
What does this decision mean for our equal pay question:

For each of the coefficients:
What is the coefficient's p-value for each of the variables:
Is the p-value < 0.05?
Do you reject or not reject each null hypothesis:
What are the coefficients for the significant variables?
Using the intercept coefficient and only the significant variables, what is the equation?
Is gender a significant factor in compa:
Regardless of statistical significance, who gets paid more with all other things being equal?
How do we know?

4 Based on all of your results to date,

Do we have an answer to the question of are males and females paid equally for equal work?

Does the company pay employees equally for for equal work?

How do we know?

Which is the best variable to use in analyzing pay practices - salary or compa? Why?

What is most interesting or surprising about the results we got doing the analysis during the last 5 weeks?

5 Why did the single factor tests and analysis (such as t and single factor ANOVA tests on salary equality) not provide a complete answer to our salary equality question?

What outcomes in your life or work might benefit from a multiple regression examination rather than a simpler one variable test?

Attachment:- employee salary data set.zip

Verified Expert

This assignment deals with carrying statistical analysis, by using t-test in order to determine the mean performance of salary, compa and performance rating of male and female employees and coming up with evidence to see if there is a substantial difference in various variables between employees based on their gender.

Reference no: EM13996948

Questions Cloud

What market form has a small number of firms : What market form has a small number of firms but a big market share? what form of market will do well only under inelastic demands.
Create fiscal redistributive initiatives : Most people perceive international public administration to be a set of state structures, institutions, and processes. Opponents of globalization argue that it weakens states, making it difficult for them to sustain social welfare and environmental p..
What are the pros and cons of john rickles five-point : Question: What are the pros and cons of john rickles five-point
Determine the entropy change of the block : A 5.5-kg block slides on a level surface and stops because of friction. Its initial speed is 9.0 m/s and the temperature of the surface is 20oC. Determine the entropy change of the block, which is the system in this process.
What is the value of the chi square statistic : What are the coefficients for the significant variables and using the intercept coefficient and only the significant variables, what is the equation?
Work together to effectively address the issue : Post your perspective of how administrators need to work collaboratively to address the issue. Describe a potential outcome if effective collaboration was not utilized in this scenario. Provide any suggestions to improve the outcome of this scenar..
Residential real estate market after the fed stopped : What is the impact on residential real estate market after the Fed stopped QE3 (buying bonds monthly from banks) in 2014? Please state the shift on either demand curve or supply curve. If a price ceiling on real estate agency commission is set at 1% ..
Structure for a presentation about emotional intelligence : In this project you will be working on the structure for a presentation about emotional intelligence. You will rehearse the presentation and practice navigating through the sl ide show in Presenter view. You will change the handout master and prin..
Compare and contrast holism versus reductionism : This research paper will compare and contrast Holism (Systems Thinking) versus Reductionism. The research paper must be submitted using Safe Assign. Compare and contrast Holism versus Reductionism

Reviews

Write a Review

Basic Statistics Questions & Answers

  Auto glass companies now have a machine that can repair

auto glass companies now have a machine that can repair broken windshields with a success rate of 85. if a random

  Testing of hypothesis and confidence interval

Body mass index is calculated by dividing a persons weight by the square of his or her height; it is a measure of the extent to which the individual is overweight.

  What score would sandra need to beat

What score would Sandra need to beat 91 percent of her mathematics class? The mean score was 73, and the standard deviation was 12.

  For how many years should the company guarantee its work

the foundation corporation specializes in constructing the concrete foundations for new houses in the south. the

  Normal probability distribution for application

Suppose the time necessary to process an application for a license plate for a newly purchased vehicle is normally distributed with mean of 6 minutes and standard deviation of 1.5 minutes.

  What is the random variable involved in this study describe

if you could stop time and live forever in good health what age would you pick? answers to this question were reported

  Probability that mean weight of sixteen parts is more

Quality control inspectors randomly select 16 parts, weigh each, and then compute the sample average weight for the 16 parts. The probability that the mean weight of these 16 parts is more than 41 grams or less than 39 grams is?

  Water displacement method

Each egg was also measured for volume using a water displacement method. We wanted to know if the two methods agreed or not

  Interpretation of subjetive hypothesis without numeric data

Pollsters just report the results of hypothesis tests that show statistical significance but not those in which there was insufficient evidence.

  Knowing the mean score for a certain quiz in a statistics

Make a List of type of possible errors. Assume that you were interested in knowing the mean score for a certain quiz in a statistics class.

  Accurate interpretation of the situation

Advertising coefficient is 200 and t stat is 1.75. Which of the following statements is an accurate interpretation of the situation when using an alpha of .10

  How many subjects are needed to find number of books

How many subjects are needed to estimate the number of books read the previous year within one book with 99% confidence?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd