Explain initial data analysis (ida), Advanced Statistics

Initial data analysis (IDA): The first phase in the examination of the data set which comprises  number of informal steps including the following steps

* checking the quality of the data,

* calculating the simple summary statistics and constructing the suitable graphs.

The basic aim is to clarify the structure of the data, attain a simple descriptive sum and perhaps get ideas for the more sophisticated analysis.

Posted Date: 7/28/2012 9:12:29 AM | Location : United States







Related Discussions:- Explain initial data analysis (ida), Assignment Help, Ask Question on Explain initial data analysis (ida), Get Answer, Expert's Help, Explain initial data analysis (ida) Discussions

Write discussion on Explain initial data analysis (ida)
Your posts are moderated
Related Questions
Oracle property is a name given to techniques for estimating the regression parameters in the models fitted to high-dimensional data which have the property that they can correctl

Chi-squared distribution : It is the probability distribution, f (x), of the random variable de?ned as the sum of squares of the number (v) of independent standard normal variables

Prevented fraction is a measure which can be used to attribute the protection against the disease directly to an intervention. The measure can given by the proportion of disease w

This term is sometimes used for the analysis of data from the clinical trial in which treatments A and B are to be compared under the suppositions that the patients remain on their

Randomized response technique : The procedure for collecting the information on sensitive issues by means of the survey, in which an element of chance is introduced as to what quer

Quincunx  is the device used by Galton to illustrate his lectures, which is shown in the Figure. It had a glass face and a funnel at its top. The shot was passed through funnel an

The division of a sample of observations into several classes, together with the number of observations in each of them.  It acts as a useful summary of the main features of the da

The procedures used for determining how the quality of life is affected by the environment, in particular by factors such as air and solid wastes, water pollution, hazardous substa

Confidence interval : A range of the values, calculated from the sample observations which is believed, with the particular probability, to posses the true parameter value. A 95% c

Matching coefficient is a similarity coefficient for data consisting of the number of binary variables which is often used in cluster analysis. It can be given as follows    he