Simple linear regression, Applied Statistics

Simple Linear Regression

 

While correlation analysis determines the degree to which the variables are related, regression analysis develops the relationship between the variables.

Thus coefficient of correlation indicates the strength of a linear relationship. And here we compute the linear model that best fits the relationship. Once again, we reiterate the importance of using qualitative analysis to arrive at a cause and effect relationship before computing the model. 

Regression analysis is based on the relationship between two or more variables. The known variable is the independent variable and the variable we are trying to predict is the dependent variable. An inverse relationship exists between the variables.

If X represents the cause and Y, the effect, we are searching for

                    1885_simple linear regression.png  = E(Y|X = x) = A + Bx,

i.e., if X takes on the value x, we would expect Y to assume A + Bx.

Since it is (usually) impossible to obtain all possible pairs (X, Y), we need to estimate the model using a sample. The approximate model is given by

                   E (Y|X = x) = a + bx

In this case, a is an estimate of A and b is an estimate of B.

We may rewrite the population regression line and the sample regression lines as,

                   y = A + Bx + ex

and

                   y = a + bx + ex

Where ex and ex are random variables with mean 0.

Posted Date: 9/15/2012 5:02:06 AM | Location : United States







Related Discussions:- Simple linear regression, Assignment Help, Ask Question on Simple linear regression, Get Answer, Expert's Help, Simple linear regression Discussions

Write discussion on Simple linear regression
Your posts are moderated
Related Questions
CALCULATE THE PERCENTAGE OF REFUNDS EXPECTED TO EXCEED $1000 UNDER THE CURRENT WITHHOLDING GUIDELINES

Using a random sample of 670 individuals for the population of people in the workforce in 1976, we want to estimate the impact of education on wages. Let wage denote hourly wage in

Disadvantages For calculating median it is necessary to arrange the data; other averages do not need any arrangement. Since it is a positional average, its value is not d


Exercise: (Binomial and Continuous Model.) Consider a binomial model of a risky asset with the parameters r = 0:06, u = 0:059, d =  0:0562, S0 = 100, T = 1, 4t = 1=12. Note that u

The following table shows the results of fitting a linear regression model of starting annual salaries on a constant, GPA (4 point scale), and a variable (Metrics =1) indicating wh

Old Faithful Geyser in Yellowstone National Park derives its names and fame from the regularity (and beauty) of its eruptions. Rangers usually post the predicted times of eruptions

The PCA is amongst the oldest of the multivariate statistical methods of data reduction. It is a technique for simplifying a dataset, by reducing multidimensional datasets to lower

Normal Distribution Meaning: According  to ya Lun Chou  There perfectly smooth and symmetrical  curve, resulting  from the expansion of the binomial (p+q) n    when n approac

This box plot displays the diversity wfood; the data ranges from 0.05710 being the minimum value and 0.78900 being the maximum value. The box plot is slightly positively skewed at