Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Calibration, Calibration : A procedure which enables a series of simply obt...

Calibration : A procedure which enables a series of simply obtainable but inaccurate measurements of some quantity of interest to be used to provide more precise estimates of the r

Bayesian network, Bayesian network : It is essentially an expert system in ...

Bayesian network : It is essentially an expert system in which the uncertainty is dealt with using the conditional probabilities and Bayes' Theorem. Formally such type of network c

Historigram, difference between histogram and historigram

difference between histogram and historigram

Disease surveillance, The procedure which targets to use the health and hea...

The procedure which targets to use the health and health-related data which precede diagnosis and/or confirmation to identify possible outbreaks of the disease, mobilize a rapid re

Curvature measures, The diagnostic tools or devices used to approach the cl...

The diagnostic tools or devices used to approach the closeness to the linearity of the non-linear model. They calculate the deviation of so-called expectation surface from the plan

Fibonacci distribution, The probability distribution of the various observa...

The probability distribution of the various observations is required to obtain the run of two successes in the series of Bernoulli trials with the probability of success equal to a

Evaluate the maximum flow, In the network shown below, the rst of the two ...

In the network shown below, the rst of the two numbers on each arc indicates the arc capacity and the second (in parentheses) of the two numbers indicates the current  flow. Use t

Design matrix, It is used generally for the matrix which specifies a statis...

It is used generally for the matrix which specifies a statistical model for a set of observations. For instance, in a one-way design with the three observations in one group, tw

Percentage, Looking for the correct answer.Y=50+.079(149)-.261(214)=

Looking for the correct answer.Y=50+.079(149)-.261(214)=

Population averaged models, Population averaged models are the models for ...

Population averaged models are the models for kind of clustered data in which the marginal expectation of response variable is the main focus of interest. An alternative approach

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd