Calculate cutoff values and analyzing histograms, Advanced Statistics

Assignment Help:

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?


Related Discussions:- Calculate cutoff values and analyzing histograms

Clinical trials, Clinical trials : Medical experiments designed to assess w...

Clinical trials : Medical experiments designed to assess which of two or more treatments is much more effective. It is based on one of the oldest philosophy of the scienti?c resear

Genomics, Genomics  is the study of the structure, function and the evoluti...

Genomics  is the study of the structure, function and the evolution of deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) sequences which comprise the genome of living organisms

Dendro gram, A term commonly encountered in the application of the agglomer...

A term commonly encountered in the application of the agglomerative hierarchical clustering techniques, where it refers to the 'tree-like' diagram illustrating the series of steps

Nearest-neighbour methods, Nearest-neighbour methods are the methods of di...

Nearest-neighbour methods are the methods of discriminant analysis are based on studying the training set subjects much similar to the subject to be classified. Classification mig

Describe nuisance parameter, Nuisance parameter : The parameter of the mode...

Nuisance parameter : The parameter of the model in which there is no scienti?c interest but whose values are generally required (but in usual are unknown) to make inferences about

Geographical analysis machine, Geographical analysis machine is the proced...

Geographical analysis machine is the procedure designed to detect the clusters of rare diseases in a particular area. Circles of fixed radii are created at each point of the squar

Profile plots, Profile plots  is a technique of representing the multivaria...

Profile plots  is a technique of representing the multivariate data graphically. Each of the observation is represented by a diagram comprising of a sequence of equispaced vertical

Gambling, It is the art of attempting to exchange something quite small and...

It is the art of attempting to exchange something quite small and certain, for something which are large and uncertain. Gambling is big business; in the US, for instance, it is at

Implementation of huffman coding, Input to the compress is a text le with a...

Input to the compress is a text le with arbitrary size, but for this assignment we will assume that the data structure of the file fits in the main memory of a computer. Output of

The f-wald test, Primary Model Below is a regression analysis without ...

Primary Model Below is a regression analysis without 17 outliers that have been removed Regression Analysis: wfood versus totexp, income, age, nk The regression equat

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd