Simple linear regression, Applied Statistics

Simple Linear Regression


While correlation analysis determines the degree to which the variables are related, regression analysis develops the relationship between the variables.

Thus coefficient of correlation indicates the strength of a linear relationship. And here we compute the linear model that best fits the relationship. Once again, we reiterate the importance of using qualitative analysis to arrive at a cause and effect relationship before computing the model. 

Regression analysis is based on the relationship between two or more variables. The known variable is the independent variable and the variable we are trying to predict is the dependent variable. An inverse relationship exists between the variables.

If X represents the cause and Y, the effect, we are searching for

                    1885_simple linear regression.png  = E(Y|X = x) = A + Bx,

i.e., if X takes on the value x, we would expect Y to assume A + Bx.

Since it is (usually) impossible to obtain all possible pairs (X, Y), we need to estimate the model using a sample. The approximate model is given by

                   E (Y|X = x) = a + bx

In this case, a is an estimate of A and b is an estimate of B.

We may rewrite the population regression line and the sample regression lines as,

                   y = A + Bx + ex


                   y = a + bx + ex

Where ex and ex are random variables with mean 0.

Posted Date: 9/15/2012 5:02:06 AM | Location : United States

Related Discussions:- Simple linear regression, Assignment Help, Ask Question on Simple linear regression, Get Answer, Expert's Help, Simple linear regression Discussions

Write discussion on Simple linear regression
Your posts are moderated
Related Questions
For each of the following scenarios, explain how graph theory could be used to model the problem described and what a solution to the problem corresponds to in your graph model.

Your employer, Quick Hit Agency (QHA), is a debt collections agency. The company specializes in collecting small accounts. QHA does not deal in large accounts and does not take on

Jocko's Garage has been accused of insurance fraud. Data on estimates made by Jocko and another garage were obtained for 10 damaged vehicles (available in 'jockogarage.txt'). Here

There are n seats on an airplane and n passengers have bought tickets. Unfortunately, the first passenger to enter the plane has lost his ticket and, so he just chooses a seat at r

The box plot displays the diversity of data for the income; the data ranges from 20 being the minimum value and 1110 being the maximum value. The box plot is positively skewed at 4

The Neatee Eatee Hamburger Joint specializes in soyabean burgers. Customers arrive according to the following inter - arrival times between 11.00 am and 2.00 pm: Interval-arriva

Muti linear regression model problem An investigator is studying the relationship between weight (in pounds) and height (in inches) using data from a sample of 126 high school

In the context of multivariate data analysis, one might be faced with a large number of v&iables that are correlated with each other, eventually acting as proxy of each other. This

Agreement The degree to which different observers, raters or diagnostic the tests agree on the binary classification. Measures of agreement like that of the kappa coefficient qu

The cost of living index number on a some data was 200. From the base period, the percentage enhances in prices were-Rent Rs 60, clothing Rs 250, Fuel and Light Rs 150 and Miscella