Define the term multicollinearity, Applied Statistics

Question:

(a)
(i) Define the term multicollinearity.

(ii) Explain why it is important to guard against multicollinearity.

(b) (i) Sometimes we encounter missing values in databases with a large number of fields. A common method of handling missing values is simply to omit from the analysis the records or fields with missing values. Explain why this may be dangerous.

(ii) Data analysts have turned to methods that would replace the missing value with a value substituted according to various criteria. Briefly give a choice of three possible replacement values for missing data.

(c) Variables tend to have ranges that vary greatly from each other. Data miners should normalise the numerical variables to standardise the scale of effect each variable has on the results. Name two techniques for normalisation and differentiate between each one of them.

(d) The usual measure used to evaluate estimation and prediction models is the mean square error (MSE). Write down the expression for the MSE.

(e) (i) Explain briefly the term measures of variability.
(ii) Give four examples of typical measures of variability.

Posted Date: 11/20/2013 5:26:30 AM | Location : United States







Related Discussions:- Define the term multicollinearity, Assignment Help, Ask Question on Define the term multicollinearity, Get Answer, Expert's Help, Define the term multicollinearity Discussions

Write discussion on Define the term multicollinearity
Your posts are moderated
Related Questions
Geometric Mean is defined as the n th root of the product of numbers to be averaged. The geometric mean of numbers X 1 , X 2 , X 3 .....X n is given as

Binomial Distribution Binomial distribution  was discovered by swiss mathematician James  Bernonulli, so this distribution is called as Bernoulli distribution also, this is a d

Main stages of Statistical Inquiry The following are the various stages of a statistical inquiry (1)   Planning the Inquiry: First of all we have to assess the problem und

Agency revenues. An economic consultant was retained by a large employment agency in a metropolitan area to develop a regression model for predicting monthly agency revenues ( y ).

Chemical processors manufacture wondercool using two processes- mixing and distillation. The following details relate to the distillation process for a period. No opening work i

Meaning of Interpolation and Extrapolation Interpolation is a method of estimating the most probable  missing figure on  the basis of given data under certain assumptions. On t

The prevalence of undetected diabetes in a population to be screened is approximately 1.5% and it is assumed that 10,000 persons will be screened. The screening test will measure

The manager of Pizza Hut provides a delivery service for customers who telephone in an order. The manager would like to give callers an idea of the time it will take to deliver an

You were recently hired by E&T Boats, Inc. to assist the company with its financial planning and to evaluate the company's performance.  E&T Boats, Inc. builds and sells boats to o

The proportion of American Express credit-card holders who pay their credit card bill in full each month is 23%; the other 77% make only a partial or no payment. (a) In a random