Which average is to be used to describe statistical data?, Applied Statistics

Assignment Help:

There are situations where none of the three averages is fully satisfactory. For example, if the number of items in a series is very small, none of these averages has any meaning as a measure of central tendency. Moreover, if the items are what has been called "freakishly deployed" (as when concentrated at one end), an average for the series can be found, but it is not descriptive of the series.

The data available may exclude the use of certain averages. For instance, in open-end distributions, the mean cannot be accurately found. But the median or mode can be used unless the median or modal class happens to be open-ended. In a situation of bimodality, the mode makes no sense as a measure of central tendency.

The characteristics of the averages and the demands of the problem to be solved usually determine which one is to be used. The data may call for a certain average.

In such problems as the average score on personnel tests or the average productivity rate of workers, where the values in the series are really ranks, the median is the average to use. These scores and rates are not additive, that is, they indicate not unit quantities, but rather the position of an individual with respect to other individuals. Therefore, a positional average namely the median is appropriate.

The arithmetic mean is the most commonly used and the best known of the averages, and is preferred unless precluding circumstances are present, such as extreme values at either end of the series or open-end classes, or varying class intervals, or unless we definitely wish to establish the most frequent value or some other positional average. Where further computational techniques are involved in the investigation, the mean is the average to be used.

Great care should be used in choosing an average. On occasions, it may even be advisable to work-out all three averages and present them. The added burden is preferable to the use of a single average that may be an incomplete description.

But considering all these points, the statistician, guided by the desire to present an accurate picture of the data and to command respect, is the final judge of which average is the most appropriate.


Related Discussions:- Which average is to be used to describe statistical data?

Statistical process control, Statistical Process Control The variabilit...

Statistical Process Control The variability present in manufacturing process can either be eliminated completely or minimized to the extent possible. Eliminating the variabilit

Regression, The 4 assumptions of regression: 1.       Variables are norm...

The 4 assumptions of regression: 1.       Variables are normally distributed 2.       Linear relationship between the independent and dependent variables 3.       Homosced

Multiple correspondence analysis, Correspondence Analysis (CA) is a general...

Correspondence Analysis (CA) is a generalization of PCA to contingency tables. The factors of correspondence analysis give an orthogonal decomposi:ion of the Chi- square associated

Questionnaire, Framing of Questionnaire: Questionnaire is a schedule of...

Framing of Questionnaire: Questionnaire is a schedule of questions which is sent to the respondents for filling in by them in their own handwriting together the information rel

Evaluate the standard deviation, You have an assembly line which produces 1...

You have an assembly line which produces 1L bottles of seltzer with a standard deviation of 0.05L. • Assuming the distribution of volume is normal, what is the chance any single

Binomial and continuous model, Exercise: (Binomial and Continuous Model.) C...

Exercise: (Binomial and Continuous Model.) Consider a binomial model of a risky asset with the parameters r = 0:06, u = 0:059, d =  0:0562, S0 = 100, T = 1, 4t = 1=12. Note that u

Factor loadings matrix, As we stated above, we start factor analysis with p...

As we stated above, we start factor analysis with principal component analysis, but we quickly diverge as we apply the a priori knowledge we brought to the problem. This knowled

#regression, #regression line drawn as Y=C+1075x, when x was 2, and y was 2...

#regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual

Scatter diagram - correlation analysis, Scatter Diagram The first step ...

Scatter Diagram The first step in correlation analysis is to visualize the relationship. For each unit of observation in correlation analysis there is a pair of numerical value

Predict higpa using a logistic regression, In this problem, we use the CSDA...

In this problem, we use the CSDATA data set, which is available in 'CSDATA.txt'. We do ne an indicator variable, say HIGPA, to be 1 if the GPA is 3.0 or better and 0 other- wise. S

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd