What is statistical inference, Advanced Statistics

Assignment Help:

What is statistical inference?

 Statistical inference can be defined as the  method of drawing conclusions from data which are subject to random variations. This is based on the mathematical laws of probability. Probability is the branch of statistics, where we make inferences from a finite set of observations to an infinite set of new observations. The finite set of observations is called as Samples and the infinite set is called as populations. Suppose let us consider tossing a few coins. Here the total number of outcomes i.e., getting the heads and tails are called as the sample space. The new observation that is getting particularly heads or tails is the population.

Solutions to statistical inference:

There are two kinds of statistical inferences. One is the estimation, where we use the sample and sample variables to predict the population variables. The second is the hypothesis testing. Here we use the samples and sample variables to test the population and the population variables.

Estimation:

 Estimation can be divided into two types. One is point estimation and the other is interval estimation.  In the point estimation we use the sample variable to estimate the parameter ?, whereas in the interval estimation we use the samples variable to construct an interval which is equated to p where p is the confidence level adopted.

The most common method adopted for point estimation is the maximum likelihood estimation (MLE) which consists of choosing the estimate that maximizes the probability of the statistical material. MLE is the best solution if the statistical material is large. Special cases of MLE are the sample mean dented as E(X) and the relative frequency denoted by P(X=x).

Illustrations  of MLE:

A game is played with a single fair die. A player wins Rs.20 if a 2 turns up and Rs.40 if a 4 turns up, and he losses if a 6 turns up. While he neither wins nor loses if any other face turns up. Find the expected sum of money he can win.

Let X be the random variable denoting the amount he can win. The possible values are 20,40,-30,0

P[X=20] = P(getting 2) = 1/6

P[x=40] = P( getting 4 = )1/6

P[X = -30] = P(getting 6] = 1/6

The remaining probability is ½.

Hence the mean is E(X) = 20(1/6) + 40(1/6) + (-30)(1/6) +0(1/2) = 5.

Hence the expected sum of money he can win is Rs.5

Hypothetical Testing:

A statistical hypothesis is a statement of the numerical value of the population parameter.

The steps involved in solving a statistical hypothesis is

1.       State the null hypothesis Ho

2.       State the alternative hypothesis Ha

3.       Specify the level  of significance α

4.       Determine the critical regions and the appropriate test statistic.

5.       Compute the equivalent test statistic of the observed value of the parameter.

6.       Take the decision either to reject Ho or accept Ho.


Related Discussions:- What is statistical inference

Explain lie factor, Lie factor : A measure suggested by Tufte for judging t...

Lie factor : A measure suggested by Tufte for judging the honesty of the graphical presentation of data. Which can be calculated as follows   The values close to one are desir

Falsediscoveryrate (fdr), The approach of controlling the error rate in an ...

The approach of controlling the error rate in an exploratory analysis where number of hypotheses are tested, but where the strict control which is provided by multiple comparison p

Hypergeometric distribution, Hypergeometric distribution is t he probabili...

Hypergeometric distribution is t he probability distribution related with the sampling without replacement from the population of finite size. If the population comprises of r ele

Explain information theory., Information theory: This is the branch of app...

Information theory: This is the branch of applied probability theory applicable to various communication and signal processing problems in the field of engineering and biology. In

Cluster sampling, Cluster sampling : A method or technique of sampling in w...

Cluster sampling : A method or technique of sampling in which the members of the population are arranged in groups (called as 'clusters'). A number of clusters are selected at the

Nearest-neighbour methods, Nearest-neighbour methods are the methods of di...

Nearest-neighbour methods are the methods of discriminant analysis are based on studying the training set subjects much similar to the subject to be classified. Classification mig

Explain o. j. simpson paradox, O. J. Simpson paradox is a term coming from...

O. J. Simpson paradox is a term coming from the claim made by the defence lawyer in murder trial of O. J. Simpson. The lawyer acknowledged that the statistics demonstrate that onl

Explain median absolute deviation (mad), Median absolute deviation (MAD) : ...

Median absolute deviation (MAD) : It is the very robust estimator of the scale given by the following equation   or, in other words we can say that, the median of the absolute

Combine standard deviation, what is the combine standard deviation height f...

what is the combine standard deviation height from the follwing

Implementation of huffman coding, Input to the compress is a text le with a...

Input to the compress is a text le with arbitrary size, but for this assignment we will assume that the data structure of the file fits in the main memory of a computer. Output of

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd