Spss-graphing-basic descriptive statistics

Assignment Help Basic Statistics
Reference no: EM131096084

Lab 1: Using SPSS, Graphing, Basic Descriptive Statistics, z-scores, & t-tests

TURN IN:

  1. A word document containing answers to the questions below only including appropriately labeled and formatted supporting tables and graphs ~ 120 points
  2. Copies of your annotated SPSS syntax and output ~10 points

a) When one annotates syntax it means to add notes and comments to oneself and others that SPSS does not read as code. In SPSS you would do this in the syntax window. Click where you want to add your comment and begin with an asterisk (*), then end your comment with a period (.). If it is correct it will turn gray. Your code should be annotated such that I will be able to easily identify what question the code is addressing.

Part I: Entering and analyzing a small data set (30 points)

Radial Growth (RGR) of Lichen R. geographicum in 17 Successive 3-month periods in North Whales in Relation to Eight climatic variables

RGR

Temp_max

Air Frosts

Rain Days

Rainfall

Temp_min

Ground Frosts

Sunshine hours

Wind Speed

Season

0.04

7.6

8

59

207.7

6.2

16

609

8.6

1

0.58

20.1

0

38

306

11.6

0

237.3

8.1

2

0.15

10.6

3

47

317.7

5.5

33

181.9

10.7

3

0.18

11.6

17

51

194.5

1.9

49

171.6

11.8

4

0.07

8.3

8

24

97.8

6.1

33

619

8.1

1

0.37

19.6

0

41

287.4

11.4

1

287.4

8.7

2

0.33

19.4

3

68

457.7

6

25

186.9

9.3

3

0.17

6.8

43

44

175.8

0.26

57

276.8

8.7

4

0.10

13.9

2

48

295.6

6.9

19

488.5

10

1

0.29

17.9

0

63

328

12

0

318.4

10

2

0.16

10.6

17

52

318.8

4.2

41

200.6

8.7

3

0.35

19.8

34

49

233.4

0.87

52

217.4

8.4

4

0.18

14.5

2

48

197.8

7

16

521.2

8.5

1

0.14

18

0

42

231.1

11.3

3

495.5

7.5

2

0.22

10.9

13

57

463.9

5

26

223

8.6

3

0.20

8.9

8

72

349.8

4

17

199.2

12.4

4

0.34

17.9

2

31

140.3

7.2

20

220.3

6.7

1

  1. Enter the data into SPSS:

Give each of the variables a label and set the type of variable in the variable view window. Set the decimal places to the correct place. For the season variable create labels such that 1 = spring, 2 = summer, 3 = fall, and 4 = winter. (Hint: You should be able to copy and paste columns of data into SPSS, remember to exclude the variable names when copying.)

2. Create a variable that indicates which observations fall above the mean RGR and which ones fall below it. Do the same for Rainfall. Be sure to give your new variables names, labels, value labels, and set the level of measurement. (10 pts)

(Hint: In SPSS use Transform à Recode into Different Variables).

  1. Use the data to answer the following questions:

a. What are the counts and percent of the categorical variables? (5 pts)

b. Based on Season and the new rainfall variable during what season does it rain more?

Explain the basis for your choice (3 pts).

c. Create a bar graph which depicts RGR by season. Add a title to the graph, copy the

graph and paste it into the document (5 pts). There are a few ways you can answer

this question. You only need to include 1 figure.

d. Create a histogram of RGR overall and another paneled by the new rainfall variable.

Add a title to the graphs, copy the graphs and paste them into the document (5 pts).

4. Paste a screen shot of the variable view into the document. (2 pts)

Part II: Computing and comparing descriptive statistics (25 points)

Using the Calcium.sav data set available on Blackboard, answer the following questions:

  1. Compute descriptive statistics for Alkaline Phosphatase, Calcium, and Inorganic Phosphorus.

a. Compare the mean, median, and mode in words for each variable. (5pts)

b. Create a histogram for calcium paneled in rows by gender and boxplot for Alkaline Phosphatase by gender. Edit each graph to have an appropriate title and labels, paste the graphs into this document. (5pts)

6. Compare characteristics between Males and females in the Calcium data set and fill in the frequency table below. For continuous variables put the mean and standard deviation. For categorical put the percent in each category. Hint: Use crosstabs for categorical variables and compare means for continuous variables. Then write a paragraph describing the differences between the men and women in this sample based on your analysis. (15 pts)

Table 1: Characteristics of the patients in Calcium.sav

Characteristic

Male (n=)

Female (n=)



Mean (sd) or N (%)

Mean (sd) or N (%)


Age (years)




Alkaline Phosphatase ((IU)/Liter)




Calcium (mmol/L)




Inorganic Phosphorus (mmol/L)




Lab




Metpath




Deyor




St. Elizabeth's




CB Rouche




YOH




Horizon










Part III: z-scores and t-tests (65 points)

Use the bodyfat2.sav file from Blackboard to answer the following questions.This data set contains body measurements body fat percent calculated using 2 different methods for a random sample of 252 college graduates.

  1. Make histograms of adiposityfatpct1, fatpct. Superimpose a normal curve on them. Do any of these distributions appear approximately normal? Describe how each of the distributions is different from the normal distribution (make sure to discuss skew & kurtosis). Include the histograms with appropriate titles and labeling in your response. (5pts)
  1. Generate a fake, normally distributed Caloric intake for each case in the file. To do this, use TransformàCompute variable and place the following expression into the dialog boxes:

Variable name: Calories

= RV.NORMAL(1500,200)

These instructions generate a random sample from a normal distribution with a mean of 1500 and a standard deviation of 200.

a. Make a histogram of Calories and superimpose a normal curve on it.

For the following "your sample" refers to the variable Calories you created in the data set. If it was exactly normally distributed then it would have the mean and standard deviation you requested. (Hint: You will NOT use SPSS to answer b, c, and d directly.)

b. If Calories is exactly normally distributed, what percentage of cases should have values between 1350 and 1600? What percentage of cases in your sample have Calories in this range?

c. What percentage of cases would you expect to have Calories 1800 or more if Calories is exactly normally distributed? What percentage of cases in your sample have Calories greater or equal to 1800?

d. What percentage of cases in your sample have Calories less than 1200? What would you expect if the distribution is exactly normal? (2 pts)

  1. Compute standard scores for the Calories variable. (Hint: Use AnalyzeàDescriptive Statisticsà Descriptives; click the "save standardized values as variables" box)and check the box to save standardized values as variables. This creates a new variable in your dataset called zcalories, what the z-score of calories would be for each subject in your dataset.)
  1.  
    1. Make a histogram with a normal curve superimposed.
  1.  
    1. What is the mean of this distribution? The standard deviation? Is this what you expected? Why?
  1.  
    1. Using the z-score table, what percentage of the cases would you expect to have standard scores between -1 and 1? Between 0 and 1.5? Greater than 2? Less than -2? (4 pts)

One-sample t-test

  1. Test the hypothesis that for all college graduates, the population value for body fat percentage is 20 when using method 1 (variable fatpct1).

a. What are the null hypothesis you want to test? The alternative hypothesis? (2 pts)

b. Test the hypotheses and write a brief summary of your conclusions. (Hint: You have ONE sample.) (5 pts)

c. What is the 95% confidence interval for the average body fat percentage using method 1? (Hint: use the Explore procedure) How does it differ from the 95% confidence interval for the difference? (2 pts)

d. Based on the 95% confidence interval for the mean can you reject the null hypothesis that the average population value for body fat percentage using method 1 is 18%? Explain. (2 pts)

Independent samples t-test

  1. We can use the body fat data to test hypotheses about differences in body fat percentages based on gender and age.
  1.  
    1. Describe the age and gender composition of those in the sample. Be sure to look at age by gender as well.

d. Test the null hypothesis that men and women have the same body fat percentage using method 1. List your null and alternate hypotheses and summarize the results. Can we assume equality of variance for the two groups? Why or why not? (8 pts)

e. Select only those 50 years old or older ). (Hint: use Dataà Select casesà "if condition is satisfied" and click if, enter the conditionage > 49.9). Test the null hypothesis that the average body fat percentage is the same for men and women over 50 years old. List your null and alternate hypotheses and summarize the results. Can we assume equality of variance for the two groups? Why or why not?

Paired samples t-test

  1. Use Compute to create a new variable that is the difference between the between the two body fat percent variables (variables fatpct1 and fatpct2). (Hint: use TransformàCompute)

a. Make a histogram of the difference.

b. Make a Q-Q plot of the differences. How does the distribution differ from the normal distribution? (Hint: Use AnalyzeàDescriptive StatisticsàExplore; under the "Plots" button, click the "Normality Plots with Tests" box)(2 pts)

c. Perform a paired t-test using the Paired-Samples T-Test procedure. Write a brief summary of your results. Be sure to state your null and alternative hypotheses. (Hint: Use the original variables)

d. Run a one-sample t-test on the difference variable. What is the difference you are testing for? Write a brief summary of your results. Be sure to state your null and alternative hypotheses.

e. Compare the results of the two tests. In what ways are the two tests different?

Reference no: EM131096084

Questions Cloud

Patients taking antihypertensive agent : Multiple drugs are often used in treating hypertension. Suppose 10% of patients taking antihypertensive agent A experience gastrointestinal (GI) side effects, whereas 20% of patients taking antihypertensive agent B experience such side effects.
Benefit from price discrimination : Buyers also benefit from price discrimination. Those with lower reservation prices would be excluded from buying the good if the rebate were not offered. In this case, some buyers whose reservation price is at least as high as the discounted price..
Aggregate demand or short-run aggregate supply : Next, state three variables that would cause an increase in the price level--indicate whether changes in the variable increase or decrease aggregate demand or short-run aggregate supply.
Power and violence are opposites : "To sum up: politically speaking, it is insufficient to say that power and violence are not the same. Power and violence are opposites; where the one rules absolutely, the other is absent. Violence appears where power is in jeopardy, but left to its ..
Spss-graphing-basic descriptive statistics : A word document containing answers to the questions below only including appropriately labeled and formatted supporting tables and graphs ~ 120 points
Extensive damage to both public and private oyster grounds : Oil spills in the Gulf of Mexico have been known to cause extensive damage to both public and private oyster grounds along the Louisiana and Mississippi shores. One way to protect shellfish along the shoreline is to release large volumes of fresh ..
Demand or supply for hairdressers when imposed : The market equilibrium wage is currently $12 per hour for hairdressers. Legislation imposed a minimum wage of $11.50 per hour. If there are no changes to either the demand or supply for hairdressers when imposed, the number of hairdressers will be?
How does cultural competency occur : The Purnell Model when working with subcultures (immigration status, gender, political beliefs, socioeconomic status, sexual orientation, educational status, etc.).
Calculate economic growth rate and growth rate of real GDP : China’s real GDP was 7,394 billion yuan in 2005 and 8,000 billion yuan in 2006. China’s population growth rate in 2006 was 0.5 percent. Calculate China’s economic growth rate and growth rate of real GDP per person in 2006.

Reviews

Write a Review

Basic Statistics Questions & Answers

  Committee within the specified range

Government officials in Canberra have recently expressed concern regarding overruns on military contracts.  These unplanned expenditures have been costing Australians millions of dollars every year.  The prime minister impanels a committee of experts..

  If one of these individuals is selected at random find the

in an experiment to study the relationship of hypertension and smoking habits the following data are collected for 180

  The population of douglas fir trees was shown to have an

the population of douglas fir trees was shown to have an approximately normal diameter distribution with mu 50

  Analysis of variance test

One-way Analysis of Variance (ANOVA) test -  How many treatments are there and  What is the total sample size?

  Number of persons selected at random

Problem: 10.5% of the population is 65 or older. Find the probability that the following number of persons selected at random from 25 people are 65 or older.

  Recognizing midpoint of the first class

Recognize the midpoint of the first class. Recognize the class boundaries of the first class.

  Calculate the test statistic for a test of the hypothesis

researchers studied the effects of adding an exercise program to dieting in the achievement of weight loss. a sample of

  Is there evidence that rainfall is normally distributed

Can you support a claim that mean rainfall from seeded clouds exceeds 25 acre-feet? Use α = 0.01. Is there evidence that rainfall is normally distributed

  Value of the mean of the two x distributions

Suppose an x distribution has mean μ = 5. Consider two corresponding x distributions, the first based on samples of size n = 49 and the second based on samples of size n = 81. (a) What is the value of the mean of each of the two  x distributions?

  Mean and standard deviation of distribution of sample mean

Mean SAT score is computed for each sample. If the sample size is n=2,500, find the mean and standard deviation of the distribution of sample means.

  Qualitative research often generates large amounts of data

Qualitative research often generates large amounts of data that must be gathered and analyzed. The use of statistical software such as the Statistical Package for the Social Sciences (SPSS) can significantly simplify and enhance the analysis of qu..

  Obtain the linear correlation coefficient for the data

obtain the linear correlation coefficient for the data. round your answer to three decimal places. the data below show

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd