Define the term multicollinearity, Applied Statistics

Question:

(a)
(i) Define the term multicollinearity.

(ii) Explain why it is important to guard against multicollinearity.

(b) (i) Sometimes we encounter missing values in databases with a large number of fields. A common method of handling missing values is simply to omit from the analysis the records or fields with missing values. Explain why this may be dangerous.

(ii) Data analysts have turned to methods that would replace the missing value with a value substituted according to various criteria. Briefly give a choice of three possible replacement values for missing data.

(c) Variables tend to have ranges that vary greatly from each other. Data miners should normalise the numerical variables to standardise the scale of effect each variable has on the results. Name two techniques for normalisation and differentiate between each one of them.

(d) The usual measure used to evaluate estimation and prediction models is the mean square error (MSE). Write down the expression for the MSE.

(e) (i) Explain briefly the term measures of variability.
(ii) Give four examples of typical measures of variability.

Posted Date: 11/20/2013 5:26:30 AM | Location : United States







Related Discussions:- Define the term multicollinearity, Assignment Help, Ask Question on Define the term multicollinearity, Get Answer, Expert's Help, Define the term multicollinearity Discussions

Write discussion on Define the term multicollinearity
Your posts are moderated
Related Questions
Geometric Mean The geometric mean   of numbers is defined as the th root of the product of numbers .It is obtained by multiplying all the values of a variable and then extracti

A country''s national accounts are assumed to look as follows: GDP 1180 VAT and taxes 140 Commodity subsidies 60 Raw material and consumables 530 1. Calculate GVA 2. Calculate t

Calculation for Discrete Series or Ungrouped Data The formula for computing mean is = where,          f  = fr

The median, as the name suggests, is the middle value of a series arranged in any of the orders of magnitude i.e. ascending or descending order. As distinct from the arithmetic

Accident proneness  A personal psychological issue which affects the individual's probability of suffering the accident. The concept has been studied statistically under the num

According to a recent study, when shopping online for luxury goods, men spend a mean of $2,401, whereas women spend a mean of $1,527. Suppose that the study was based on a sample o

Simple Regression: The Teacher Preparation Research Team conducted a study of college students who took the Praxis II-a teacher certification examination. Some variables from

Geometric Mean is defined as the n th root of the product of numbers to be averaged. The geometric mean of numbers X 1 , X 2 , X 3 .....X n is given as

for this proportion, use the +-2 rule of thumb to determine the 95 percent confidence interval. when asked if they are satisfied with their financial situation, .29 said "very sat

Read the following data on the economy of Angoia and answer/respond to the questions/instructions that follow. Unless otherwise stated, the monetary figures are in real billions o