Contingency table by gender and department

Assignment Help Basic Statistics
Reference no: EM13925540

1. Table 1 shows the number of applicants to graduate school at Berkeley for the six largest departments in 1973 by gender and department. Table 2 shows the number of rejected applicants by gender and department. Recall the notation GM, GF , DA, DB, DC , DD, DE and DF

Table 1: The contingency table by gender and department (counting the number of applicants)

Table 2: The contingency table by gender and department (counting the number of rejected applicants)

a. By dividing the number of rejected applicants by the number of applicants, complete Table 3. Note that this table is not a contingency table. The table summarizes the rejection rate by each subgroup. For instance, 0.379 represents the rejection rate for male applicants who applied to Department A. It can be translated to P(Rejected | GM ∩ DA) = 0.379 . In addition, you should interpret P(Rejected | DA) = 0.356 , as the rejection rate in Department A (regardless of gender). Table 3: The proportions of rejected applicants by gender and department

b.  From Table 3, report the conditional probability that an applicant was rejected among male applicants, namely P(Rejected | GM).

c.  From Table 3, report the conditional probability that an applicant was rejected among female applicants, namely P(Rejected | GF ).

d.  Based on P(Rejected | GM) and P(Rejected | GF ) only, which gender has a greater rejection rate?

e.  From Table 1, report the six ratios of conditional probabilities P(GM | DA) P(GF | DA) , . . . , P(GM | DF ) P(GF | DF ) . Round to two decimal places.

f.  From Table 3, report the six conditional probabilities P(Rejected | DA), . . . , P(Rejected | DF ).

g.  Find the six ratios of conditional probabilities P(Rejected | GM ∩ DA) P(Rejected | GF ∩ DA) , . . . , P(Rejected | GM ∩ DF ) P(Rejected | GF ∩ DF ) .

h.  In part g, what does a ratio greater than one imply? What does a ratio close to one imply?

i.  Do you still believe that there was gender discrimination in the freshmen recruitment? Provide your reason based on part g.

j.  Using Law of Total Probability, write P(Rejected | GF ) = 0.696 as a weighted average of six probabilities.

k.  Using Law of Total Probability, write P(Rejected | GM) = 0.555 as a weighted average of six probabilities.

l.  In one sentence, explain why P(Rejected | GF ) > P(Rejected | GM) happened

2. Suppose a company which produces fire alarms has claimed that the fire alarms make only one false alarm per year, on average. Let X denote the number of false alarms per year. Assume X ∼ Poisson(λ). Under the company's claim, the probability of observing x fire alarms per year is P(X = x) = e -λ λ x x! = e -1 x! , x = 0, 1, . . . . A customer had a bad experience with the fire alarm he purchased before. He wants to conduct hypothesis testing H0: λ = 1 versus H1: λ > 1, and he purchased another fire alarm from the same company. He allows 1% chance for falsely rejecting H0. After a year, he observed three false alarms.

a.  Find the p-value based on the single observation (three false alarms for the year).

b.  Draw a conclusion based on the p-value in part a (in the context of this problem without using any symbols).

c.  He gathered one hundred people who observed three or more false alarms and observed (X1, X2, . . . , X100) = (4, 6, . . . , 3) with X¯ 100 = 1 100 X 100 i=1 Xi = 3.32 . Ignoring any flaw of data collection, calculate the test statistic (which is compared to the standard normal distribution) and the approximate p-value for testing H0: λ = 1 versus H1: λ > 1. (Hint: If we observe Poisson random variables, the population mean and the population variance are equal to λ.)

d.  In two sentences, argue why the sample of size n = 100 is not useful for the hypothesis testing.

3. In lecture we discussed the association between gestational age X and birth weight Y . Here is a portion of the R output. Intercept - 1410.7 155.8 - 9.055 < 2e - 16 x 124.1 4.0 31.026 < 2e - 16 We estimate the slope β1 as βˆ 1 = Pn i=1(Xi - X¯ n)(Yi - Y¯ n) Pn i=1(Xi - X¯ n) 2 . and the intercept β0 as βˆ 0 = Y¯ n - βˆ 1 X¯ n . If we transform a random sample (X1, Y1), . . . ,(Xn, Yn) as T = βˆ 1 - β1 SE , SE = s 1 n-2 Pn i=1(Yi - βˆ 0 - βˆ 1Xi) 2 Pn i=1(Xi - X¯ n) 2 , the transformed random variable T follows the T distribution with n - 2 degrees of freedom, where β1 is the true slope under the linear model. In this exercise, our goal is to derive a 95% confidence interval (CI) for the unknown slope β1. In the dataset, we observed 2500 babies.

a.  Find the constant t ∗ such that P -t ∗ ≤ βˆ 1 - β1 SE ≤ t ∗ ! = 0.95 . (1) You need to use R. Round t ∗ to three decimals.

b.  Using algebra inside the probability statement, we are able to rewrite Equation (1) as P (L ≤ β1 ≤ U) = 0.95 for some L and U. Since the true value of β1 is in (L, U) with probability 0.95 (if we take a random sample of size 2500 many times), the random interval (L, U) becomes a 95% CI for β1. Using algebra inside the probability statement of Equation (1), derive L and U in terms of t ∗ , SE, and βˆ 1. Do not insert any numeric value yet.

c.  In the R output, the estimated β1 is 124.1 and the calculated SE is 4.0. (SE quantifies the uncertainty associated with our estimate βˆ 1 which is called the standard error.) Report the observed 95% CI for β1. Round to two decimal places.

d.  We have 103 students in Stats 67. Suppose all students collect a random sample of size 2500 from the same population, and each student constructs a 95% CI for β1 from his/her own data. What is the expectation for number of students who will miss the true value of β1? (Hint: The number of students who miss the true value of β1 follows a binomial distribution.)

4. Suppose we observe a random sample (X1, . . . , Xn) with n = 10, where Xi ∼ Bernoulli(p) and p is the proportion of black cars at UCI. Suppose we observed (x1, . . . , x10) = (1, 0, 0, 1, 0, 1, 0, 0, 1, 0).

a. Find the likelihood function L(p) given the ten binary observations.
b. Find the log-likelihood function l(p) = log L(p).
c. Take the first derivative of l(p) with respect to p.
d. Report the estimate of the population proportion of black cars based on the method of maximum likelihood estimation.

Reference no: EM13925540

Questions Cloud

What is the portfolios residual risk and active risk : what is the portfolio's residual risk? What is its active risk? How does this compare to the difference between the portfolio risk and the benchmark risk?
Classification systems: a team approach : In this unit, you are reviewing the DSM-IV classification system as part of the Mental, Behavioral and Neurodevelopment Disorder chapters of ICD-10-CM. For this discussion, you will continue your review of other clinical classifications systems utili..
Companies in the global environment : What do you think the opportunities are for companies in the global environment? Do you think that they are only for large multinationals or do you think that small companies can be involved also?
How has the internet created and impacted global perspective : How has the Internet created and impacted global perspectives? What are examples of personal global perceptions changing after the use of the Internet?
Contingency table by gender and department : Table 1 shows the number of applicants to graduate school at Berkeley for the six largest departments in 1973 by gender and department. Table 2 shows the number of rejected applicants by gender and department. Recall the notation GM, GF , DA, DB, ..
Disadvantages and challenges of global expansion : Write a short paper on the advantages, disadvantages and challenges of global expansion; Develop a suggested strategy for global expansion for your organization, another existing company, or an imaginary business (include a brief summary of whatev..
Malware paper - how effective it is at evading detection : Malware Paper, Note three kinds of malware that are active threats today. Note the following for each type: How common it is, A brief explanation of how it works and How effective it is at evading detection
What is your certainty equivalent return : If a portfolio has an expected excess return of 6 percent and risk of 20 percent, what is your certainty equivalent return, the certain expected excess return that you would fairly trade for this portfolio?
Colter company prepares monthly cash budgets : Colter Company prepares monthly cash budgets. Relevant data from operating budgets for 2017

Reviews

Write a Review

Basic Statistics Questions & Answers

  The quality-control manager at a light bulb factory needs

the quality-control manager at a light bulb factory needs to determine whether the mean life of a large shipment of

  The average age of doctors in a certain hospital is 490

the average age of doctors in a certain hospital is 49.0 years old. suppose the distribution of ages is normal and has

  They are now thermally connected by a reversible heat

a constant pressure pistoncylinder has 1 kg of saturated liquid water at 100kpa. a rigid tank contains air at 1200k

  A random sample of 500 shoppers at sharpstown mall found

a random sample of 500 shoppers at sharpstown mall found that 120 favored longer shopping hours. is this sufficient

  Determine which has higher coefficient of variation

Stock 2 closing price over last month has the mean of 59.2 and the standard deviation of 3.3. Determine which has higher coefficient of variation?

  Computing f-statistic for linear construct

Compute an F-statistic for each linear construct and determine its significance. Also, state what questions each of these contrasts address.

  Confidence interval for population mean of waiting times

The following data represents the amount of time ( in minutes ) that a person had to wait for a bus to work on a random sample of 5 working days.

  A bright idea a company produces lightbulbs whose life

a bright idea. a company produces lightbulbs whose life follows the n1500 300 distribution i.e. the lifetimes of

  What conclusion can be drawn based on level of confidence

If university officials say that at least 70% of the voting student population supporting the fee increase, what conclusion can be drawn based on a 95% level of confidence?

  The standard normal table shows an area value of 08 for a

the standard normal table shows an area value of 0.8 for a z-score of 0.84. what percentage of the observations of a

  Calculating the test statistic of a printing plant

The superintendent of a printing plant has selected a random sample of 100 rolls of paper from a large shipment. The average length of the sample rolls is 416 feet, with a variance of 2704 feet.

  Normal distribution and z-value

What is a normal distribution? What is a Z-Value? Can you provide and example of how they relate?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd