Non-sampling errors, Applied Statistics

Statistics Can Lead to Errors

The use of statistics can often lead to wrong conclusions or wrong estimates. For example, we may want to find out the average savings by individual investors in 1994-95. Hence, we would have to question every investor in our population or some sample of such investors. In either case, the average savings calculated may not be the true average savings for the population. This is basically due to the occurrence of errors. Errors may be classified into two categories.

NON-SAMPLING ERRORS

Such errors are caused by deficiencies in the collection and editing of data. Three reasons for such errors include Procedural Bias, Biased Observations and Non-Response Bias. Such errors may occur in a sample or in a census.

Procedural Bias

Procedural bias is the distortion of the representativeness of the data due to the procedure adopted in collecting the data.

For instance in our retailers example, Procedural Bias may creep in, if the retailer excludes all customers making purchases under Rs.2,000. In effect she will then study only high value customers, not all customers.

Suppose data is being collected about the rent levels in a city. The question “How much rent do you pay for accommodation?” can introduce a Procedural Bias because the rent may be for accommodation without furniture, etc. or for accommodation with furniture. In some cases the rent may include charges of the co-operative housing society for maintenance, etc. In other cases it may be a composite rent including even electricity and water charges. Hence, the above question must necessarily be supplemented by questions about what is included and excluded from the rent.

Unless questions are correctly framed, a procedural bias can creep into the investigation.

Non-Response Bias

Absence of response can lead to Non-Response Bias.

In the retailers case, she may ask the customers for their suggestions for better products and services. The customers would require some time for thinking about this and may not be able to give an immediate answer. But, once they leave the shop they may forget all about responding. It is not possible to conclude that the customers do not have any suggestion for improving the service.

For another example, investors may be asked “How much do you expect to invest in shares in 1998 if the Sensex rises to 6000 by the end of the year?” If a significant proportion, say 60% of the investors, do not reply then there will be significant Non-Response Bias. It cannot be assumed that those who did not respond will not invest in shares in 1998. Nor can it be assumed that those who did not respond will invest in the same way as those who responded.

Biased Observation

Here the observations do not correctly reflect the characteristics of the population being studied. The retailer may exclude important information like the quantity and type of equipment purchased, etc. and only concentrate on the bill amount. Hence, a purchaser of a number of low value items would be treated on the same footing as a purchaser of a single high value item. This may be unjustified as the two purchasers are likely to have distinctly different needs.

For another example, a study may be conducted to find out the annual earnings of various types of finance executives as compared with their qualifications. In such a case if all CFAs and CAs are grouped together as professionals then there is Biased Observation. This is because each of these qualifications is distinct and many executives may have more than one of the above qualifications. 

Posted Date: 9/15/2012 3:03:41 AM | Location : United States







Related Discussions:- Non-sampling errors, Assignment Help, Ask Question on Non-sampling errors, Get Answer, Expert's Help, Non-sampling errors Discussions

Write discussion on Non-sampling errors
Your posts are moderated
Related Questions
Large Sample Test for Proportion A random sample of size n (n > 30) has a sample proportion p of members possessing a certain attribute (success). To test the hypothesis that t

These techniques are applied when the rows and the columns of the data table represent the same units and when the measure is a disiance or a similarity. The goal of the analysis i

Histogram: It is generally used for charting continuous frequency   distribution. In histogram, data are plotted as a series  of rectangle one over the other. Class intervals

Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ...

Your company has developed a new product .Your company is a reputed company with 50% market share of same range of products. Your competitors also come with their new products equa

Correspondence Analysis (CA) is a generalization of PCA to contingency tables. The factors of correspondence analysis give an orthogonal decomposi:ion of the Chi- square associated

1. For each of the following variables: major, graduate GPA, and height: a. Determine whether the variable is categorical or numerical. b. If the variable is numerical, deter

it is said that management is equivalent to decision making? do you agree? explain

As we stated above, we start factor analysis with principal component analysis, but we quickly diverge as we apply the a priori knowledge we brought to the problem. This knowled

There are two diagnostic tests for a disease. Among those who have the disease, 10% give negative results on the first test, and independently of this, 5% give negative results on