Dummy variables, Advanced Statistics

The variables resulting from the recoding categorical variables with more than two categories into the sequence of binary variables. Marital status, for instance, if originally labeled 1 for the married, 2 for single and 3 for divorced, widowed or separated, can be rede?ned in the terms of two variables which are given as follows




Variable 1: 1 if single, 0 otherwise;

Variable 2: 1 if the divorced, widowed or separated, 0 otherwise;


For the married person both the new variables would be zero. In common the categorical variable with k categories would be recorded in the terms of k 1 dummy variables. Such recoding is made in use before polychotomous variables are used as the explanatory variables in a regression analysis to avoid the unreasonable supposition with the original numerical codes for the categories, that is the values 1; 2; ... ; k, correspond to the interval scale. This procedure is generally known as dummy coding

 

Posted Date: 7/27/2012 6:31:52 AM | Location : United States







Related Discussions:- Dummy variables, Assignment Help, Ask Question on Dummy variables, Get Answer, Expert's Help, Dummy variables Discussions

Write discussion on Dummy variables
Your posts are moderated
Related Questions
High-dimensional data : This term used for data sets which are characterized by the very large number of variables and a much more modest number of the observations. In the 21 st

The Null Hypothesis - H0:  γ 1 = γ 2 = ...  =  0  i.e.  there is no heteroscedasticity in the model The Alternative Hypothesis - H1:  at least one of the γ i 's are not equal

Multi dimensional unfolding is the form of multidimensional scaling applicable to both the rectangular proximity matrices where the rows and columns refer to the different sets of

Graphical deception : Statistical graphics which are not as honest as they should be. It is relatively simple. To mislead the unwary with the graphical material. For instance, c

Length-biased sampling : The bias which arises in the sampling scheme based on the visits of patient, when some individuals are more likely to be chosen than others simply because

Change point problems : Problems with chronologically ordered data collected over the period during which there is known to have been a change in the underlying data generation cou

It is the diagram used to display the values graphically in a frequency distribution. The frequencies are graphed as an ordinate against the class mid-points as abscissae. The p

Catastrophe theory : A theory of how little is the continuous changes in the independent variables which can have unexpected, discontinuous effects on the dependent variables. Exam

The computer programs designed to mimic the role of the expert human consultant. This type of systems are capable to cope with the complex problems of the medical decision makin

Reasons for screening data     Garbage in-garbage out     Missing data          a. Amount of missing data is less crucial than the pattern of it. If randomly