Create a custom chi-square function in matlab, MATLAB in Statistics

Assignment Help:

Introduction and Purpose

In the lectures and tutorials you have been introduced to a number of probability distributions. You have also learned that modelling any system (such as communication systems), requires the analysis of the input data to be fed to the model. Random variables play an important role in the development of a model, as they represent the input(s) to the modelled system. In order to use random variables in the modelling process, they need to be analysed and tested to verify that they represent a close fit to the real-world input. In order to do that, a 'Goodness of Fit' test is applied to a data sample in order to accept or reject a certain hypothesis. A hypothesis, generally states that a particular data sample conforms to a certain probability distribution.

The purpose of this assignment is to determine via statistical analysis the probability distributions of the numerical data contained in two *.csv (comma separated) files.

Scenario and advice

There are two data files, Data(a).csv and Data(b).csv. The files have been logged by a communications device and represent the input to a system. In order to determine the effects on the output of the system, we need to be able to determine their probability distributions. If you read the files in a text editor, it should be apparent that one maybe a continuous distribution and the other possibly a discrete distribution (however the type of distributions are unknown. To resolve this it is suggested that you carry out the following:

Minimum objectives

1. For both files, read them into MATLAB (one at a time) using the csvread('filename') function and save them to a vector.

2. Calculate the mean and standard deviation using either custom functions or the functions built into MATLAB.

3. Create a q-q plot of the data using the qqplot(x) function (this should help you to guess the data's distribution).

4. Guess a probability distribution (create a null hypothesis) and create a dataset with approximately the same number of elements as the data provided in the data files.

5. Create a custom Chi-Square function in MATLAB

6. Carry out a Chi-Square analysis of the data (reference tables are on chi-square-table.pdf)

7. Complete a regression analysis (if appropriate) and determine the equation of the line (please refer to lecture 4 for an example)

8. Report on all your findings.

Additional objectives

1. Using the results obtained above select a data distribution and sample it using the students t distribution (please refer to slide 21 in lecture 2)

2. Demonstrate graphically how, as the number of degrees of freedom increases, the student's t distribution approximates the normal distribution and plot the variance between the curves.

3. Verify your findings in 6 above using the Kolmogorov-Smirnov test (kstest(x)).

4. Report on all your findings.

Deliverables

The following must be submitted by the date outlined above:

An individual report (times new roman pt 12, single lined spaced) that outlines your solution and the development of your MATLAB simulation. It should include as a minimum an explanation of your MATLAB implementation, screen captures of the MATLAB plots, and also a summary of the input analysis including a plot of the quantile-quantile graph, a plot of the sample distribution and any functions you have developed in appendices. The report should approximate the following structure: Title page, Contents page, List of figures, Introduction, MATLAB Implementation, Input Analysis, Conclusions and Appendices (make sure all MATLAB code is clearly available in the appendices) 

Note about Plagiarism

§  Any sentences, including any definitions that are copied word for word are in quotation marks and cite the source(s);

§  Any figures copied include citations to sources;

§  Any code that is taken from any source (text book, WWW, journals etc) is fully acknowledged.


Related Discussions:- Create a custom chi-square function in matlab

Median, Median The median is defined only for a data set which has been...

Median The median is defined only for a data set which has been sorted first, that means that the values are in order. The median of a sorted set of data values (n) is defined

Illustration of mode, Illustration of Mode When there is more than one ...

Illustration of Mode When there is more than one value with similar (highest) frequency, then the smaller value is mode. In the case below, as 3 and 8 appear twice in the vecto

Illustrations of sequential search, Illustrations of Sequential search: ...

Illustrations of Sequential search: The two illustrations of calling such function is as shown below: >> values = [85 70 100 95 80 91]; >> key = 95; >> seqsearch

Program of persistent variables, Program of Persistent variables: The ...

Program of Persistent variables: The program below elaborates this. The script calls a function func1 that initializes a variable count to 0, then increment it, and print the

Assignment , Determine sequence weights for the sequences ACTA, ACTT, CGTT,...

Determine sequence weights for the sequences ACTA, ACTT, CGTT, and AGAT in problem 1 by using Thompson, Higgins, and Gibson method a) compute pairwise distances between sequences

Vectors of nested structures, Vectors of nested structures: The Combin...

Vectors of nested structures: The Combining vectors and nested structures are possible to have a vector of structures in which several fields are structures themselves. Here i

Operation on file, Operation on file: We concentrate first on the fget...

Operation on file: We concentrate first on the fgetl function that reads strings from the file one line at a time. The fgetl functions afford more control over how the data is

Debugging techniques, Debugging Techniques: Any error in the computer ...

Debugging Techniques: Any error in the computer program is called a bug. This word is thought to date back to the 1940s, whenever a problem with an early computer was found to

Example of median, Example of Median For the vector [1 4 5 9 12 33], th...

Example of Median For the vector [1 4 5 9 12 33], the median is the average of the 5 & 9 in the middle: >> median([1 4 5 9 12 33]) ans =     7

Indexing, Indexing: The Indexing is an alternate to sort a vector. Wit...

Indexing: The Indexing is an alternate to sort a vector. With indexing, the vector is left in its original order. An index vector is used to point values in the original vecto

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd