data project, Applied Statistics

Choose any published database from the internet or Bethel library (such as those from the Census Bureau or any financial sites). You may opt to use one of the data files provided by the instructor if applicable.
* Get advanced approval from the instructor on your chosen database.
* If the file is large, randomly choose 200 of the observations from the data.
* Explain each variable in the file that you are analyzing. Be sure your file includes at least 3 scale variables and at least 2 nominal variables.
* Conduct a descriptive analysis on any 2 interval / ratio variables you wish using Descriptive_Statistics.xls and Frequency_Distribution.xls. Explain the output.
* Conduct 3 different hypothesis tests of your choice using appropriate variables from the file (note: you must use 3 different tests and not run one test on 3 different variables). In each case, state the variables being tested as well as the hypothesis, decision and conclusion. Use 3 of the following (1-Sample Test for Means, 1-Sample Test for Proportions, 2-Sample Test for Means ? Independent Samples, 2-Sample Test for Means ? Paired Samples, 2-Sample Test for Proportions, Analysis of Variance, Chi Square Goodness of Fit Test, Chi Square Test of Independence, Correlation Test).
* Develop a model to predict an interval / ratio variable using at least 2 other variables. Use Multiple_Regression.xls and state the regression model and which variables are or are not significant. Also, use the model to make a prediction by making up values for each of the independent variables.
* Write a one to two page summary of your findings. Include the data file in the appendix.

Posted Date: 8/26/2012 1:54:47 PM | Location : United States







Related Discussions:- data project, Assignment Help, Ask Question on data project, Get Answer, Expert's Help, data project Discussions

Write discussion on data project
Your posts are moderated
Related Questions
the sum of mean and variance ofabinomia distribution of 5 trials is 9/5, find the binomial distribution.

Try different numbers of clusters in your program (K=2...15) and build a plot that shows the dependency between number K and value of RSS function on the last iteration. What is th

b. A paper mill produces two grades of paper viz., X and Y. Because of raw material restrictions, it cannot produce more than 400 tons of grade X paper and 300 tons of grade Y


Define sampling unit and population for selecting a random sample in every case. a) 100 voters from a constituency b) 20 stocks of National Stock Exchange c) 50 account ho

Dr. Jim Mirabella UNIT EIGHT: DATA ANALYSIS PROJECT All Excel output should be copied into a single Word document where you must enter all of your responses to the questions below.

the president of a certain firm concerned about the safety record of the firms employee sets aside $50 million a year for safety education. the firms accountant believes that more


For the following claim, find the null and alternative hypotheses, test statistic, P-value, critical value and draw a conclusion. Assume that a simple random sample has been selec