Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Codominance, Codominance : The relationship between genotype at the locus a...

Codominance : The relationship between genotype at the locus and a phenotype to which it in?uences. If an individuals with heterozygote (such as, AB) genotype is phenotypically dif

Multi-hit model, Multi-hit model is the model for a toxic response which r...

Multi-hit model is the model for a toxic response which results from the random occurrence of one or the more fundamental biological events. A response is supposed to be induced o

Explain normal approximation, Normal approximation : Normal distributions w...

Normal approximation : Normal distributions which approximate other distributions; such as, a normal distribution with the mean np and variance np(1 - p) which acts as an approxima

Missing data - reasons for screening data, Missing Data - Reasons for scree...

Missing Data - Reasons for screening data In case of any missing data, the researcher needs to conduct tests to ascertain that the pattern of these missing cases is random.

Ljung-box q-test, The Null Hypothesis - H0: There is no autocorrelation ...

The Null Hypothesis - H0: There is no autocorrelation The Alternative Hypothesis - H1: There is at least first order autocorrelation Rejection Criteria: Reject H0 if LBQ1 >

Play-the-winner rule, Play-the-winner rule is a process sometimes consider...

Play-the-winner rule is a process sometimes considered in the clinical trials in which the response to treatment is positive (a success) or negative (a failure). One of two treatm

Normal distribution, Your first task is to realize two additional data gene...

Your first task is to realize two additional data generation functions. Firstly, extend the system to generate random integral numbers based on normal distribution. You need to stu

Glim, Glim is the software package specifically suited for fitting the gen...

Glim is the software package specifically suited for fitting the generalized linear models (the acronym stands for the Generalized Linear Interactive Modelling), including the log

Multivariate data, Multivariate data is the data for which each observatio...

Multivariate data is the data for which each observation consists of the values for more than one random variable. For instance, measurements on the blood pressure, temperature an

Zero sumgame, Zero sumgame is a game played by the number of persons in wh...

Zero sumgame is a game played by the number of persons in which the winner takes all stakes given by the losers so that the algebraic sum of gains at any stage is zero. Number of

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd