Calculate cutoff values and analyzing histograms, Advanced Statistics

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality


above or below median loinc










average ACT score 1994

below the median for low inc % 1993







above the median for low inc % 1993








According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."



f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?

Posted Date: 3/11/2013 3:28:24 AM | Location : United States

Related Discussions:- Calculate cutoff values and analyzing histograms, Assignment Help, Ask Question on Calculate cutoff values and analyzing histograms, Get Answer, Expert's Help, Calculate cutoff values and analyzing histograms Discussions

Write discussion on Calculate cutoff values and analyzing histograms
Your posts are moderated
Related Questions
The GRE has a combined verbal and quantitative mean of 1000 and a standard deviation of 200.

Models for the analysis of the survival times, or the time to event, data in which it is expected that a fraction of the subjects will not experience the event of interest. In a cl

The particular projection which an investigator believes is most likely to give an accurate prediction of the future value of some process. Commonly used in the context of the anal

ain why the simulated result doesn''t have to be exact as the theoretical calculation

Multiple imputation : The Monte Carlo technique in which missing values in the data set are replaced by m> 1 simulated versions, where m is usually small (say 3-10). Each of simula

Mixture experiment is an experiment in which the two or more ingredients are blended together to form an end product. The measurements are taken on the several blends of the ingre

Component bar chart : A bar chart which shows the component parts of the aggregate represented by the whole length of the bar. The component parts are shown as the sectors of bar w

Non linear mapping (NLM ) is a technique for obtaining a low-dimensional representation of the set of multivariate data, which operates by minimizing a function of the differences

Mean squarederror is the expected value of square of the difference between an estimator and the true value of the parameter. If the estimator is unbiased then the mean of the squ

Probability distribution : For the discrete random variable, a mathematical formula which provides the probability of each value of variable. See, for instance, binomial distributi