Define the term multicollinearity, Applied Statistics


(i) Define the term multicollinearity.

(ii) Explain why it is important to guard against multicollinearity.

(b) (i) Sometimes we encounter missing values in databases with a large number of fields. A common method of handling missing values is simply to omit from the analysis the records or fields with missing values. Explain why this may be dangerous.

(ii) Data analysts have turned to methods that would replace the missing value with a value substituted according to various criteria. Briefly give a choice of three possible replacement values for missing data.

(c) Variables tend to have ranges that vary greatly from each other. Data miners should normalise the numerical variables to standardise the scale of effect each variable has on the results. Name two techniques for normalisation and differentiate between each one of them.

(d) The usual measure used to evaluate estimation and prediction models is the mean square error (MSE). Write down the expression for the MSE.

(e) (i) Explain briefly the term measures of variability.
(ii) Give four examples of typical measures of variability.

Posted Date: 11/20/2013 5:26:30 AM | Location : United States

Related Discussions:- Define the term multicollinearity, Assignment Help, Ask Question on Define the term multicollinearity, Get Answer, Expert's Help, Define the term multicollinearity Discussions

Write discussion on Define the term multicollinearity
Your posts are moderated
Related Questions
Given a certain population there are various ways in which a sample may be drawn from it. The chart below illustrates this point: Figure 1 In  Judgem

Betting on sporting events is big business both in the US and abroad. Consider, for instance, next winter’s American football tournament known as the Superbowl. Billions of dollars

Explanation of standard deviation and variance Describe the importance of standard deviation and variance, what they calculate and why they are required. Importance of char

Lorenz Curve   It is a graphic method of measuring dispersion. This curve was devised by Dr. Max o Lorenz a famous statistician.  He used this technique for wealth it i

Poisson Distribution The poisson Distribution  was discovered  by French mathematician simon  denis  poisson. It is a discrete probability distribution. Meaning : In bi

Sequential Sampling Under this method, a number of sample lots are drawn one after another from a universe depending on the results of the earlier samples. Such sampling is gen

Geometric Mean is defined as the n th root of the product of numbers to be averaged. The geometric mean of numbers X 1 , X 2 , X 3 .....X n is given as

Advantages It is especially useful in case of open-end classes since only the position and not the values of items must be known. The median is also recommended if th

Significance of Correlation The study of correlation is of immense use in practical life. Correlation analysis contributes to the understanding of economic behavior, aids in lo

Education seems to be a very difficult field in which to use quality methods. One possible outcome measures for colleges is the graduation rate (the percentage of the students matr