Assumptions in regression, Applied Statistics

Assumptions in Regression

To understand the properties underlying the regression line, let us go back to the example of model exam and main exam. Now we can find an estimate of a student's main exam points, if we also know his or her points on the model exam. As we have stated, a student with score of 85 in the model exam should receive points for the main exam in the vicinity of 75 to 95.

If we knew the model exam scores of all students along with their main exam scores, we would then have the population of values. The mean and the variance of the population of the model exam would be μx and σx2 and respectively. The measurements for the main exam points are  μy  and  σy2 .

The assumptions in regression are:

  1. The relationship between the distributions X and Y is linear, which implies the formula E(Y|X=x) = A + Bx at any given value of X = x.

  2. At each X, the distribution of Yx is normal, and the variances  σx2  are equal. This implies that E's have the same variance,  σ2.

  3. The Y-values are independent of each other.

  4. No assumption is made regarding the distribution of X.

    Since we do not have all of the students' course points and main exam points we must estimate the regression line E(Y|X = x) = A + BX.

    The figure shows a line that has been constructed on the scatter diagram. Note that the line seems to be drawn through the collective mid-point of the plotted points. The term  2148_simple linear regression.png  is the estimate of the true mean of Y's at any particular X = x.

    Figure 8

    682_assumptions in regression.png
Posted Date: 9/15/2012 5:05:34 AM | Location : United States







Related Discussions:- Assumptions in regression, Assignment Help, Ask Question on Assumptions in regression, Get Answer, Expert's Help, Assumptions in regression Discussions

Write discussion on Assumptions in regression
Your posts are moderated
Related Questions
The incidence of occupational disease in an industry is such that the workers have a 20% chance of suffering from it. What is the probability that out of six workers 4 or more will

Cause and Effect Even a highly significant correlation does not necessarily mean that a cause and effect relationship exists between the two variables. Thus, correlation does

1. Assume the random vector (Trunk Space, Length, Turning diameter) of Japanese car is normally distributed and the unbiased estimators for its mean and variance are the truth. For

regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual

Sampling A  Population  is a collection of all the data points being studied. For example, if we are studying the annual incomes of all the people in India, then the population

the president of a certain firm concerned about the safety record of the firms employee sets aside $50 million a year for safety education. the firms accountant believes that more

a. How can break-even analysis be used in selecting a new plant site? b. What are potential advantages and disadvantage of locating a production facility in foreign country i

Root Mean Square Deviation The standard deviation is also called the ROOT MEAN SQUARE DEVIATION. This is because it is the ROOT (Step 4) of the MEAN (Step 3) o

PROPERTIES   1. The value of standard deviation remains the same if, in a series each of the observation is increased or decreased by a constant quantity. In statistical lan

Q. The following system of equations illustrates the algebraic form of a partial (individual) market equilibrium model, which is a model of price (P) and quantity (Q) determination