Define the term multicollinearity, Applied Statistics

Question:

(a)
(i) Define the term multicollinearity.

(ii) Explain why it is important to guard against multicollinearity.

(b) (i) Sometimes we encounter missing values in databases with a large number of fields. A common method of handling missing values is simply to omit from the analysis the records or fields with missing values. Explain why this may be dangerous.

(ii) Data analysts have turned to methods that would replace the missing value with a value substituted according to various criteria. Briefly give a choice of three possible replacement values for missing data.

(c) Variables tend to have ranges that vary greatly from each other. Data miners should normalise the numerical variables to standardise the scale of effect each variable has on the results. Name two techniques for normalisation and differentiate between each one of them.

(d) The usual measure used to evaluate estimation and prediction models is the mean square error (MSE). Write down the expression for the MSE.

(e) (i) Explain briefly the term measures of variability.
(ii) Give four examples of typical measures of variability.

Posted Date: 11/20/2013 5:26:30 AM | Location : United States







Related Discussions:- Define the term multicollinearity, Assignment Help, Ask Question on Define the term multicollinearity, Get Answer, Expert's Help, Define the term multicollinearity Discussions

Write discussion on Define the term multicollinearity
Your posts are moderated
Related Questions
Related Positional Measures Besides median, there are other measures which divide a series into equal parts. Important amongst these are quartiles, deciles and percentiles.

The file Midterm Data.xls has a tab labeled "Income Data 2009". This data is collected income data from a sample of 400 people in 2009. Use a hypothesis test to see whether the av

For the circuit shown below; Write a KCL equation for Node A, Node B, Node C and Node D. Write a KVL equation for Loop 1, Loop 2 and Loop 3.   A simple circ

.what job can you after offering that course

A researcher is interested in comparing the effectiveness of three different parts of therapy for anger problems. 8 participants are randomly assigned to 3 treatment conditions: Co

Ask 3. Precision Manufacturing has a government contract to produce stainless steel rods for use in military aircraft. Each rod is required to be 20 millimeters in diameter. Each

Chebychev inequality

Variance The term variance was used to describe the square of the standard deviation by R.A.Fisher. The concept of variance is highly important in areas where it is possible to

#In planning the teaching assignments for next semester, Mr. Hinton must have a teacher in each of the 7 grades during each of the 6 periods of the day. If he has 10 teachers to ch

Application of the chi Square Test