Missing data - reasons for screening data, Advanced Statistics

Missing Data - Reasons for screening data

In case of any missing data, the researcher needs to conduct tests to ascertain that the pattern of these missing cases is random.

Create dichotomous variable - non-missing vs missing for a specific variable. Run a simple independent samples t-test on a different variable in the collected sample to see if there are any significant differences.

Handling missing values:

1. Delete missing data (good idea if there are only a few missing cases)

2. Delete variables containing missing values (good idea if most of the missing values are concentrated to only a couple of variables. Still problematic if they are important to the ultimate goal of the research)

3. Estimate missing values

4. Prior knowledge

5. Replace missing values with the mean (main concern: lowers the calculated variance as compared to the unknown actual variance)
One variation involves using group means for missing values for cases involving group comparison analysis

6. Regression approach: use several IVs to explain the DV (that includes several missing values). Predict missing values using IV values.

7. Concerns include finding proper IVs that explain DV, estimates obtained from prediction more consistent with the scores used to predict them compared to the real values.

8. When we use any of the techniques described above, as a researcher we have to ascertain that our solution hasn't changed the results of the analysis (run the tests, with and without the treatment).

Posted Date: 3/4/2013 6:07:24 AM | Location : United States







Related Discussions:- Missing data - reasons for screening data, Assignment Help, Ask Question on Missing data - reasons for screening data, Get Answer, Expert's Help, Missing data - reasons for screening data Discussions

Write discussion on Missing data - reasons for screening data
Your posts are moderated
Related Questions
The method or technique for producing the sequence of parameter estimates that, under the mild regularity conditions, converges to maximum likelihood estimator. Of particular signi

I do have a data of real gdp for each state and from 2000 to 2010 and I also have estimated population of illigel immigrants for each state from 2000 to 2010. In my thesis I am try

This graph for Cross Correlation Function for RES1, RES1 shows that there is possibly negative autocorrelation as there are alternating spikes; also the first spike is negative whi

Probability weighting is the procedure of attaching weights equal to inverse of the probability of being selected, to each respondent's record in the sample survey. These weights

Generalized method of moments (gmm) is the estimation method popular in econometrics which generalizes the method of the moments estimator. Essentially same as what is known as the

1. define statistical algorithms 2. write the flow charts for statistical algorithms for sums, squares and products. 3. write flow charts for statistical algorithms to generates ra

Wilcoxon's ranksum test is the distribution free method or technique used as an alternative to the Student's t-test for assessing whether two populations have the same location. G

The values assigned to factors for the individual sample units in a factor analysis. The most common approach is "regression method". When the factors are seen as the random variab

Poisson regression In case of Poisson regression we use ηi = g(µi) = log(µi) and a variance V ar(Yi) = φµi. The case φ = 1 corresponds to standard Poisson model. Poisson regre