Compute decision boundary for linear discriminant analysis

Assignment Help Basic Statistics
Reference no: EM131296960

STA Homework

1. This is a basic illustration of using Bootstrap for inference.

(i) Generate a random sample of size n = 100 following the univariate regression model

Yi = -5 + 2Xi + εi

where Xi's are independent Chi-square random variables with degrees of freedom 6, and εi's are i.i.d. N(0, σ2) with σ = 1. Fix a random seed to ensure that the results are reproducible.

(ii) Fit the least squares regression line to the data and obtain the estimate of (β0, β1, σ2).

(iii) Obtain re-sampling-based 95% confidence intervals for β0 and β1 by using a parametric (i.e., residual-base) bootstrap procedure with 400 bootstrap replicates.

(iv) How do the confidence intervals in (iii) compare with the theoretical confidence intervals for β0 and β1? [To compare the accuracy of the confidence intervals, repeat the procedure in steps (i)-(iii) 10 times (using different random seed for each simulation run) and report the average lengths of the bootstrap confidence intervals and that of corresponding theoretical confidence intervals.]

2. In this example, compare k-NN classification method, linear discriminant analysis and logistic regression in a two-class classification problem. For this consider the iris data available in R.

(i) Extract the data corresponding to flower types setosa and versicolor, numbering a total of 100 flowers. Set aside the last 10 measurements for each flower type as test data and use the remaining data consisting of 80 measurements as training data.

(ii) Fit a logistic regression model to the training data, using the variable Sepal.Length as predictor. Obtain the estimates of the model parameters. Compute the confusion matrix for the test data set.

(iii) Compute the decision boundary for linear discriminant analysis, using Sepal.Length as the predictor variable. Compute the confusion matrix for the test data set.

(iv) Use k-nearest neighbors classification method with k = 3, 4, 5, again using Sepal.Length as the predictor variable. In each case, confusion matrix for the test data set.

(v) Write a very brief summary of the comparative performance of different classification procedures.

Reference -

1. James, G., Witten, D., Hastie, T. and Tibshirani, R. (2013). An Introduction to Statistical Learning with Applications in R. Springer. [Chapters 3, 4 & 5].

Reference no: EM131296960

Questions Cloud

Describe any conditions mentioned in the annual report : Describe any conditions mentioned in the annual report that expose the firm to risk ? Explain how the business uses technology to provide information about its form of business ownership.
What would organization be measuring in terms of cash cows : What would this organization be measuring in terms of Cash Cows, Problem Children, Shining Stars, Faithful Dogs, Black Holes, Cash Pigs and Mangy Dogs? Choose one and give an example?
Asking for guidance on a few issues : The Elora Jean & Co. owner has come to you asking for guidance on a few issues that were brought to her attention. With the rapidly growing workforce, an increasing number of employees have requested extended time off from work. The owner does not..
What form of business ownership would you recommend : What form of business ownership would you recommend for this business?- Would Mary Ann's form of ownership be any different from Paul's?
Compute decision boundary for linear discriminant analysis : STA 141A Fall 2016 Homework. Compute the decision boundary for linear discriminant analysis, using Sepal.Length as the predictor variable. Compute the confusion matrix for the test data set
Business model design and innovation : What is "Business Model Design and Innovation"? 3 paragraphs What role do value propositions perform in successful business models? 3 paragraphs
What are the costs and benefits : What are the costs and benefits if Ann Whithey decided to keep her business small rather than expand into a very large business?
Examples of nehemiah''s ability to lead and control : For Integration of Faith 2, continue your review of Nehemiah. Examine 3 examples of Nehemiah's ability to lead and control. Compare to concepts in the text, and add your own analysis of the characteristics you define. Explain how 1 of these biblic..
Why did you choose this theory over the others : What concepts of the theory make it the most appropriate for the client in the case study?Why did you choose this theory over the others?What will be the goals of counseling and what intervention strategies are used to accomplish those goals?

Reviews

len1296960

12/1/2016 5:25:33 AM

Submit the assignment both electronically and by submitting the printed copy. Electronic submission must be in the form of a zip folder (with extension .zip, .7z, etc.) containing two files: (i) descriptions of your analysis (as appropriate); (ii) codes used. Honor Code: “The codes and results derived by using these codes constitute my own work. I have consulted the following resources regarding this assignment:” (ADD: names of persons or web resources, if any, excluding the instructor, TAs, and materials posted on course website).

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd