Calculate cutoff values and analyzing histograms, Advanced Statistics

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality


above or below median loinc










average ACT score 1994

below the median for low inc % 1993







above the median for low inc % 1993








According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."



f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?

Posted Date: 3/11/2013 3:28:24 AM | Location : United States

Related Discussions:- Calculate cutoff values and analyzing histograms, Assignment Help, Ask Question on Calculate cutoff values and analyzing histograms, Get Answer, Expert's Help, Calculate cutoff values and analyzing histograms Discussions

Write discussion on Calculate cutoff values and analyzing histograms
Your posts are moderated
Related Questions
A term commonly encountered in the application of the agglomerative hierarchical clustering techniques, where it refers to the 'tree-like' diagram illustrating the series of steps

The phrase first spoken by one of the witches in Macbeth. Now this is used to describe the exponential rise in the number of possible locations in the multivariate space as dimensi

Regression discontinuity design is the quasi-experimental design in which participants in, for instance, an intervention study, are assigned to the treatment and control groups on

1. define statistical algorithms 2. write the flow charts for statistical algorithms for sums, squares and products. 3. write flow charts for statistical algorithms to generates ra

Models which make use of the smoothing techniques such as locally weighted regression to identify and represent the possible non-linear relationships between the explanatory and th

5. Packages from a machine a normally distributed with a mean 200g and its standard deviation 2grams. Find the probability that a package from the machine weighs a) Less than

Classification matrix: A term many times used in discriminant analysis for the matrix summarizing the results and outputs obtained from the derived classi?cation rule, and obtaine

The measure of the degree to which the particular model differs from the saturated model for the data set. Explicitly in terms of the likelihoods of the two models can be defined a

Mauchly test is a test which a variance-covariance matrix of pair wise differences of responses in the set of longitudinal data is the scalar multiple of identity matrix, a proper