Define the term multicollinearity, Applied Statistics

Question:

(a)
(i) Define the term multicollinearity.

(ii) Explain why it is important to guard against multicollinearity.

(b) (i) Sometimes we encounter missing values in databases with a large number of fields. A common method of handling missing values is simply to omit from the analysis the records or fields with missing values. Explain why this may be dangerous.

(ii) Data analysts have turned to methods that would replace the missing value with a value substituted according to various criteria. Briefly give a choice of three possible replacement values for missing data.

(c) Variables tend to have ranges that vary greatly from each other. Data miners should normalise the numerical variables to standardise the scale of effect each variable has on the results. Name two techniques for normalisation and differentiate between each one of them.

(d) The usual measure used to evaluate estimation and prediction models is the mean square error (MSE). Write down the expression for the MSE.

(e) (i) Explain briefly the term measures of variability.
(ii) Give four examples of typical measures of variability.

Posted Date: 11/20/2013 5:26:30 AM | Location : United States







Related Discussions:- Define the term multicollinearity, Assignment Help, Ask Question on Define the term multicollinearity, Get Answer, Expert's Help, Define the term multicollinearity Discussions

Write discussion on Define the term multicollinearity
Your posts are moderated
Related Questions
The investor has constant wealth 1 and is o?ered to invest in shares of a project that either gains 3=2 or loses 1 with equal probabilities. Therefore, if the investor obtains sha

give me question on mean is the aimplest average to understand and easy to compute

What is an interaction? Describe an example and identify the variables within your population (work, social, academic, etc.) for which you might expect interactions?

Assumption of extrapolation

A consumer preference study involving three different bottle designs (A, B, and C) for the jumbo size of a new liquid detergent was carried out using a randomized block experimenta

Regression Coefficient While analysing regression in two related series, we calculate their regression coefficients also. There are two regression coefficients like two regress

Exercise: (Binomial and Continuous Model.) Consider a binomial model of a risky asset with the parameters r = 0:06, u = 0:059, d =  0:0562, S0 = 100, T = 1, 4t = 1=12. Note that u

There are 15 types of ice cream: A,B,C,D,E,F,G,H,I,J,K,L,M,N, and O. How many combinations are there to sample 5 flavors if you sample 1 flavor 4 times? How many combinations are t

application of chi square test in civil engineering

Consider an MBA program as a processing network where the flow unit consists of a student in the program.  Suppose the organizations that hire and promote MBAs are considered to be