Create a custom chi-square function in matlab, MATLAB in Statistics

Assignment Help:

Introduction and Purpose

In the lectures and tutorials you have been introduced to a number of probability distributions. You have also learned that modelling any system (such as communication systems), requires the analysis of the input data to be fed to the model. Random variables play an important role in the development of a model, as they represent the input(s) to the modelled system. In order to use random variables in the modelling process, they need to be analysed and tested to verify that they represent a close fit to the real-world input. In order to do that, a 'Goodness of Fit' test is applied to a data sample in order to accept or reject a certain hypothesis. A hypothesis, generally states that a particular data sample conforms to a certain probability distribution.

The purpose of this assignment is to determine via statistical analysis the probability distributions of the numerical data contained in two *.csv (comma separated) files.

Scenario and advice

There are two data files, Data(a).csv and Data(b).csv. The files have been logged by a communications device and represent the input to a system. In order to determine the effects on the output of the system, we need to be able to determine their probability distributions. If you read the files in a text editor, it should be apparent that one maybe a continuous distribution and the other possibly a discrete distribution (however the type of distributions are unknown. To resolve this it is suggested that you carry out the following:

Minimum objectives

1. For both files, read them into MATLAB (one at a time) using the csvread('filename') function and save them to a vector.

2. Calculate the mean and standard deviation using either custom functions or the functions built into MATLAB.

3. Create a q-q plot of the data using the qqplot(x) function (this should help you to guess the data's distribution).

4. Guess a probability distribution (create a null hypothesis) and create a dataset with approximately the same number of elements as the data provided in the data files.

5. Create a custom Chi-Square function in MATLAB

6. Carry out a Chi-Square analysis of the data (reference tables are on chi-square-table.pdf)

7. Complete a regression analysis (if appropriate) and determine the equation of the line (please refer to lecture 4 for an example)

8. Report on all your findings.

Additional objectives

1. Using the results obtained above select a data distribution and sample it using the students t distribution (please refer to slide 21 in lecture 2)

2. Demonstrate graphically how, as the number of degrees of freedom increases, the student's t distribution approximates the normal distribution and plot the variance between the curves.

3. Verify your findings in 6 above using the Kolmogorov-Smirnov test (kstest(x)).

4. Report on all your findings.

Deliverables

The following must be submitted by the date outlined above:

An individual report (times new roman pt 12, single lined spaced) that outlines your solution and the development of your MATLAB simulation. It should include as a minimum an explanation of your MATLAB implementation, screen captures of the MATLAB plots, and also a summary of the input analysis including a plot of the quantile-quantile graph, a plot of the sample distribution and any functions you have developed in appendices. The report should approximate the following structure: Title page, Contents page, List of figures, Introduction, MATLAB Implementation, Input Analysis, Conclusions and Appendices (make sure all MATLAB code is clearly available in the appendices) 

Note about Plagiarism

§  Any sentences, including any definitions that are copied word for word are in quotation marks and cite the source(s);

§  Any figures copied include citations to sources;

§  Any code that is taken from any source (text book, WWW, journals etc) is fully acknowledged.


Related Discussions:- Create a custom chi-square function in matlab

Location of a rectangle - graphics objects, Location of a rectangle - graph...

Location of a rectangle - graphics objects: The location of a rectangle is [x y w h], where x and y are the coordinates of the lower left point, here w is the width, and h is

Functions which complete a task without returning values, Functions which c...

Functions which complete a task without returning Values: Most of the functions do not compute values, but instead of accomplish a task like printing formatted output. As thes

Opening and closing a file, Opening and Closing a File: The Files are ...

Opening and Closing a File: The Files are opened with the fopen function. By the default, fopen function opens a file for reading. If the other mode is preferred, a permission

Polynomials, Polynomials: Simple curves are polynomials of various deg...

Polynomials: Simple curves are polynomials of various degrees, or orders. The degree is the integer of the highest exponent in the expression. The illustrations are as follows

Function polyval - interpolation, Function polyval - interpolation: Th...

Function polyval - interpolation: The function polyval can then be used to compute the polynomial at particular values. For illustration, we could compute at every value in th

Illustration of reading from files, Illustration of reading from files: ...

Illustration of reading from files: For illustration, suppose that there is a data file 'subjexp.dat' that has on each line a number followed by thecharacter code. The type fu

Logical errors, Logical errors: The Logical errors are more complicate...

Logical errors: The Logical errors are more complicated to locate, as they do not result in any error message. The logical error is a mistake in reasoning by the programmer, b

Steps for input output functions - lower level file, Steps for input output...

Steps for input output functions - Lower level file: The steps involved are as shown below:  Open the file.  Read the file, write to the file, or append to the file.

Example of median, Example of Median For the vector [1 4 5 9 12 33], th...

Example of Median For the vector [1 4 5 9 12 33], the median is the average of the 5 & 9 in the middle: >> median([1 4 5 9 12 33]) ans =     7

Execution of persistent variables, Execution of persistent variables: ...

Execution of persistent variables: The functions can also be called from the script or from the Command Window, as shown here. For illustration, the functions are called first

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd