Reasons for screening data, Advanced Statistics

Reasons for screening data

  •     Garbage in-garbage out
  •     Missing data

    
    a. Amount of missing data is less crucial than the pattern of it.

  • If randomly scattered not a problem/ nonrandom patterns limit the use of data (can't generalize results)
  • Extreme values or outliers - cases with extreme values on one or a combination of variables that can potentially distort the results of the analysis. Ascertain that the data fulfills the basic assumptions for statistical techniques:  

 

a. Data being normally distributed
    b. Linear relationship between variables Homoscedasticity

1438_Reasons for screening data.png

Posted Date: 3/4/2013 6:01:09 AM | Location : United States







Related Discussions:- Reasons for screening data, Assignment Help, Ask Question on Reasons for screening data, Get Answer, Expert's Help, Reasons for screening data Discussions

Write discussion on Reasons for screening data
Your posts are moderated
Related Questions
Designs which permits two or more questions to be addressed in the investigation. The easiest factorial design is one in which each of the two treatments or interventions are p

Minimization is the method or technique for allocating patients to the treatments in clinical trials which is usually the acceptable alternative to random allocation. The procedur

1) Question on the first day questionnaire asked students to rate their response to the question Are you deeply moved by the arts or music? Assume the population that is sampled

Case series : It is the series of reports on the condition of the individual patients made by treating physician. Such reports might be helpful and informative for the rare disease

Greenhouse geissercorrection is the method of adjusting the degrees of freedom of the within- subject F-tests in the analysis of the variance of longitudinal data so as to allow t

It is the art of attempting to exchange something quite small and certain, for something which are large and uncertain. Gambling is big business; in the US, for instance, it is at

The theory of measurement which recognizes that in any measurement situation there are multiple (actually infinite) sources of variation (known as facets in the theory), and that a

Change point problems : Problems with chronologically ordered data collected over the period during which there is known to have been a change in the underlying data generation cou

Basic reproduction number : A term used in the theory of infectious diseases for the number of secondary cases which one case would generate in a completely susceptible population.

Partial least squares is an alternative to the multiple regressions which, in spite of using the original q explanatory variables directly, constructs the new set of k regressor v