Calculate cutoff values and analyzing histograms, Advanced Statistics

1. You are interested in investigating if being above or below the median income (medloinc) impacts ACT means (act94) for schools. Complete the necessary steps to examine univariate grouped data in order to respond to the questions below. Although deletions and/or transformations may be implied from your examination, all steps will examine original variables.

a. How many subjects have missing values for medlonic and act94?

b. Is there a severe split in frequencies between groups?

According to the descriptive analysis, no severe split is detected. This is also reflected in the skewness number which is lower than .5.

c. What are the cutoff values for outliers in each group?

d. Which outlying cases should be deleted for each group?

Average ACT score 1994 Stem-and-Leaf Plot for

medloinc= below the median for low inc % 1993

 Frequency Stem & Leaf

 7.00 14 . 1223789

 9.00 15 . 234478888

 5.00 16 . 12788

 4.00 17 . 1378

 2.00 18 . 09

 1.00 19 . 6

 3.00 20 . 069

 1.00 Extremes (>=22.5)

 Stem width: 1.0

 Each leaf: 1 case(s)

e. Analyzing histograms, normal Q-Q plots, and tests of normality, what is your conclusion regarding normality? If a transformation is necessary, which one would you use?

Tests of Normality

 

above or below median loinc

Kolmogorov-Smirnova

Shapiro-Wilk

 

Statistic

df

Sig.

Statistic

df

Sig.

average ACT score 1994

below the median for low inc % 1993

.162

32

.032

.903

32

.007

above the median for low inc % 1993

.166

32

.025

.921

32

.023

 

According to the information and the test of normality, it appears that this is a normal distribution.  Therefore, for the transformation, we would select 'Square Root."

 

 

f. Do the results from Levene's Test of Equal Variances indicate homogeneity of variance? Explain.

In running the test, there were no significant differences between the categories. Therefore; we can assume that this indicates homogeneity of variance.

2. Examination of the variable of scienc93 indicates a substantial to serve positively skewed distribution. Transform this variable using the most two appropriate methods. After examining the distribution for these transformed variables, which produced the best alteration?

Posted Date: 3/11/2013 3:28:24 AM | Location : United States







Related Discussions:- Calculate cutoff values and analyzing histograms, Assignment Help, Ask Question on Calculate cutoff values and analyzing histograms, Get Answer, Expert's Help, Calculate cutoff values and analyzing histograms Discussions

Write discussion on Calculate cutoff values and analyzing histograms
Your posts are moderated
Related Questions
Write a c++ program to find the sum of 0.123 ? 10 3 and 0.456 ? 10 2 and write the result inthree significant digits.

Introduction to Generalized Linear Models (GLM) We introduce the notion of GLM as an extension of the traditional normal-theory-based linear regression models. This will be very

HOW TO OBTAIN THE LASPEYRES QUANTITY INDEX AND THE FORMULA

Regression dilution is the term which is applied when a covariate in the model cannot be measured directly and instead of that a related observed value must be used in analysis. I

Please help with following problem: : Let’s consider the logistic regression model, which we will refer to as Model 1, given by log(pi / [1-pi]) = 0.25 + 0.32*X1 + 0.70*X2 + 0.

Tracking is the term sometimes used in the discussions of data from the longitudinal study, to describe the ability to predict the subsequent observations from previous values. In

McNemar's test  is the test for comparing proportions in data involving the paired samples. The test statistic can be given by   it is most useful when the data have a symmetri

You have probably noticed by now that some of the statements of necessary and sufficient conditions sound more natural than others. For example it seems more natural to express "We

Canonical correlation analysis : A process of analysis for investigating the relationship between the two groups of variables, by ?nding the linear functions of one of the sets of

Bivariate boxplot : A bivariate analogue of boxplot in which the inner area contains 50%of the data, and a 'fence' helps to identify the potential outliers. Robust methods or techn