Already have an account? Get multiple benefits of using own account!
Login in your account..!
Remember me
Don't have an account? Create your account in less than a minutes,
Forgot password? how can I recover my password now!
Enter right registered email to receive password!
Question:
(a) (i) Define the term multicollinearity.
(ii) Explain why it is important to guard against multicollinearity.
(b) (i) Sometimes we encounter missing values in databases with a large number of fields. A common method of handling missing values is simply to omit from the analysis the records or fields with missing values. Explain why this may be dangerous.
(ii) Data analysts have turned to methods that would replace the missing value with a value substituted according to various criteria. Briefly give a choice of three possible replacement values for missing data.
(c) Variables tend to have ranges that vary greatly from each other. Data miners should normalise the numerical variables to standardise the scale of effect each variable has on the results. Name two techniques for normalisation and differentiate between each one of them.
(d) The usual measure used to evaluate estimation and prediction models is the mean square error (MSE). Write down the expression for the MSE.
(e) (i) Explain briefly the term measures of variability. (ii) Give four examples of typical measures of variability.
Histogram: It is generally used for charting continuous frequency distribution. In histogram, data are plotted as a series of rectangle one over the other. Class intervals
(a) Average rainfall during the month of January is found to be 58 mm. A Class A pan evaporation recorded an average of 8.12 mm/day near an irrigation reservoir. The average
You want to know the thoughts of air travelers in fields such as tickets, comffort, safety, securuty, services and economic growth. You are given a database and 20 questions to ask
Suppose the money supply process is now represented by the following function: where m measures the sensitivity of money supply with respect to the interest rate. (i) Us
The following table shows the results of fitting a linear regression model of starting annual salaries on a constant, GPA (4 point scale), and a variable (Metrics =1) indicating wh
b. A paper mill produces two grades of paper viz., X and Y. Because of raw material restrictions, it cannot produce more than 400 tons of grade X paper and 300 tons of grade Y
10. If a set of scores has a sample mean of 25 and a sample variance of 4, find the following: a. the z-score for a raw score of 31 b. the z-score for a raw score of 18 c. the raw
Root Mean Square Deviation The standard deviation is also called the ROOT MEAN SQUARE DEVIATION. This is because it is the ROOT (Step 4) of the MEAN (Step 3) o
Question: (a) (i) Define the term multicollinearity. (ii) Explain why it is important to guard against multicollinearity. (b) (i) Sometimes we encounter missing values
A new weight-watching company, Weight Reducers International, advertises that those who join will lose, on the average, 10 pounds the first two weeks with a standard deviation of 2
Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!
whatsapp: +91-977-207-8620
Phone: +91-977-207-8620
Email: [email protected]
All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd