Explain why an ANOVA is the most appropriate analysis

Assignment Help Applied Statistics
Reference no: EM132303000

SPSS Analysis Assignment - 

This assignment consists of a series of tasks that generally should be completed sequentially. This assignment must be completed individually. In discussing the assignment with your peers (which should be kept to a minimum), your enquiries should be general in nature (about SPSS and not about the assignment questions) and should never reveal the solution to the assignment tasks.

Information and Tasks -

Data Information/Background -

For this assignment, there are two datasets that you will be using: Dataset 1 (SPSS_Assign_Pt2_Dataset1.sav) - You will use this dataset for Tasks 1 to 3. This is similar data to that which was used in the Part 1 assignment (important: you must use this new dataset, NOT your dataset from the Part 1 assignment).

The table below provides information on the variables in this dataset. The data was collated by a not-for-profit organisation named Gapminder, who aim to promote sustainable global development by increased use and understanding of statistics.

While the data is real, the experimental scenarios and research questions herein may not be factual and may have been manipulated for educational purposes.

Note: for the purpose of this assignment, id is not considered a variable, but is instead simply a unique country identification number.

Dataset 2 (SPSS_Assign_Pt2_Dataset2.sav) - You will use this dataset for Tasks 4 to 10.

You should save any changes you make to the dataset and output files as you go along. You will need to submit your final datasets (file extension .sav) at the end of this assignment. Your output (tables and graphs) files directly in SPSS (.spv extension) will also need to be submitted, but will not necessarily be marked (they're there for verification purposes only). The task descriptions below will indicate which tables and/or graphs are necessary for you to include in each section of your assignment document for your submission.

For Tasks 1 to 3, you must use SPSS_Assign_Pt2_Dataset1.sav -

Over tasks 1 to 3, you will be examining the relationship between hemisphere and life expectancy in 2007. To do this, you will be performing an independent t-test (ITT) with the hemisphere and lifeExp07 variables, and specifically investigating the research question: "Do countries in the Northern hemisphere have a higher life expectancy than those in the Southern hemisphere?" For the following tasks, use a one-tailed α = 0.05.

Task 1 - Independent t-tests (Thinking About the Research Question)

Answer these questions succinctly (use the following numbering in your submission; you must not re-write the question):

1. What is the independent variable (IV)? What type of variable (quantitative or qualitative) is the IV? If it is qualitative, state the number of levels (categories) in the variable.

2. What is the dependent variable (DV)? What type of variable (quantitative or qualitative) is the DV? If it is qualitative, state the number of levels (categories) in the variable.

3. Briefly explain (in 1-4 sentences) why an ITT is the most appropriate analysis for this research question. (Hint: think about the variables; you could also compare/contrast why other tests may not be suitable)

4. What is the null hypothesis for this research question? Write this in words only (statistical notation not necessary).

Task 2 - Independent t-tests (Visualising the Data)

Generate an error bar plot of the confidence intervals of the mean of lifeExp07 in each category of hemisphere. Insert this graph into your assignment submission.

Briefly describe (in 1-4 sentences) what this graph shows, with specific mention of:

  • What the middle point of the confidence intervals demonstrates about the groups (i.e. what statistic does this middle point represent? Compare this between the two groups).
  • What the overlap or lack thereof between the intervals suggest about statistical significance.
  • What the difference/similarity in width between the two groups indicates.

Hint 1: Confidence level.

Hint 2: In thinking about the width of the CI, what are the some of the key determinants of the width of a confidence interval?

Task 3 - Independent t-tests (Hypothesis Test)

Perform an ITT to evaluate this research question. Assume the assumption of normality has been met and use the built-in test of homogeneity of variance to test this latter assumption (i.e. do not generate this using the Explore function).

For this task, you must insert your Group Statistics and Independent Samples Test output tables into your assignment document (remember to paste as an image). Also, you should provide commentary on your assumption (homogeneity of variance) and hypothesis test findings (see hint below). Ensure you state the value of the p-value for the ITT and not just its relationship to α (for example, if p = 1 × 10-3, you should write it in this form, and not "p < 0.05"). State the p-value correct to at least 4 decimal places (even if you write it in scientific notation).

Given your statistical conclusion above, what can you conclude in relation to your research question?

Finally, briefly discuss (no more than 2 sentences) what some key differences (if any) in your conclusions or the output you generated would be if you had analysed this as a two-tailed test instead.

Hint 1: Confidence level.

Hint 2: use the pro forma on page 7 of the Module 3 SPSS Workbook to guide your answer for this task, and use the M3 lecture slides and/or refer to the Allen/Bennett text for examples of reporting your results. When describing assumption testing, you should briefly state the null hypothesis of the homogeneity of variance test, and how you've determined if it is violated or not. Be specific.

For Tasks 4 to 7, you must use SPSS_Assign_Pt2_Dataset2.sav

In tasks 4 to 7, you must use one-way analysis of variance (ANOVA) to examine the relationship between continent and life expectancy in 1977 (lifeExp77). In this analysis, assume all assumptions have been met and α = 0.05.

Task 4 - ANOVA (Thinking About the Research Question)

Answer these questions succinctly (use the following numbering in your submission; you must not re-write the question):

1. What is the independent variable (IV)? What type of variable (quantitative or qualitative) is the IV? If it is qualitative, state the number of levels (categories) in the variable.

2. What is the dependent variable (DV)? What type of variable (quantitative or qualitative) is the DV? If it is qualitative, state the number of levels (categories) in the variable.

3. Briefly explain (in 1-4 sentences) why an ANOVA is the most appropriate analysis for this research question.

4. What is the null hypothesis for this research question? What is the alternate hypothesis? Write these in words only (statistical notation not necessary).

Task 5 - ANOVA (Visualising the Data)

Generate an error bar plot of the confidence intervals of the mean of life expectancy for each continent. Insert this graph into your assignment document.

Comment (in only a few concise sentences/dot points) what this graph shows, with specific mention of what the middle point of the confidence intervals shows about the groups, any trends you observe and the results you expect to observe in the hypothesis testing.

Task 6 - ANOVA (Hypothesis Test)

Perform an ANOVA to answer your research question. For this task, only include the Descriptives and ANOVA output tables in this task submission.

Should you reject or accept the global (i.e. overall) null hypothesis? What does this conclusion mean in terms of the research question, and should you proceed with post-hoc analysis?

Task 7 - ANOVA (Post-Hoc Analysis)

Regardless of your conclusion about proceeding to post-hoc analysis in the preceding task, perform both Bonferroni and LSD post-hoc analyses (if you haven't already done so).

How many unique pairwise comparisons do you have for each post-hoc analysis (i.e. for each of LSD and Bonferroni)?

Complete the table in attached file (you should use the template table provided and delete rows as necessary). The name of the first comparison is provided as an example of how to fill in that column. The p-values should be reported to at least 3 decimal places (hint: if SPSS displays the p-value as ".000" to 3dp, what does this indicate?)

Do each your Bonferroni and LSD conclusions match, or do you have any contradictory results (where one indicates a significant difference, but the other does not)? If any contradict, suggest an explanation as to why.

What can you conclude about the relationship between these two variables in the research question?

For Tasks 8 to 10, you must use SPSS_Assign_Pt2_Dataset2.sav

Using the same dataset as above, the researchers now want to see whether the gross domestic product (GDP) per capita has changed from 1977 to 2007. As the data for GDP per capita is positively skewed, both the 1977 and 2007 data have been log-transformed to allow for parametric statistical tests. Use these log-transformed variables for the following tasks.

Task 8 - Paired t-test (Thinking About the Research Question)

Answer these questions succinctly (use the following numbering in your submission; you must not re-write the question):

1. What is the independent variable (IV)? What type of variable (quantitative or qualitative) is the IV? If it is qualitative, state the number of levels (categories) in the variable.

2. What is the dependent variable (DV)? What type of variable (quantitative or qualitative) is the DV? If it is qualitative, state the number of levels (categories) in the variable.

3. Briefly explain (in 1-4 sentences) why a paired t-test is the most appropriate analysis for this research question.

4. What is the null hypothesis for this research question? Write this in words only (statistical notation not necessary).

5. How many countries contributed any data towards this dataset?

Task 9 - Paired t-test (Assumption Testing)

List the assumptions of the paired t-test. Indicate which of these (if any) are testable using SPSS.

Perform assumption testing for this analysis. To test the assumption of normality (both graphically and numerically), you should use the Explore function. Insert the relevant output into your assignment document and comment on each - draw conclusions about whether the assumptions have been met overall.

Regardless of the outcomes of your assumption testing, suggest at least one alternative action that should be taken if the normality assumption is violated (note: you do not need to do this, just provide a suggestion).

Hint 1: there should be four assumptions (see the module lecture slides).

Hint 2: you will also need to use the Compute function as part of this task.

Task 10 - Paired t-test (Hypothesis Testing)

Regardless of the conclusions drawn from the assumption testing above, perform a paired t-test to test the null hypothesis. Include your Paired Samples Statistics and Paired Samples Test tables in your task submission. Also answer the following:

  • What statistical conclusion can you draw?
  • What does this mean in terms of the research question?
  • The data from how many countries were included in this paired t-test analysis? Does this correspond to the number of countries you stated in Task 8 (Question 5)? Why/why not?

Task 11 - Finally Submit Your Assignment, Dataset and Output Files.

Attachment:- Assignment Files.rar

Reference no: EM132303000

Questions Cloud

Something happens when groups of people come together : "Something happens when groups of people come together, called emergence, producing results that are greater than the sum of their parts"
Configuring and troubleshooting medium-size networks : ITNE2003 - Install, Configure, Operate and troubleshoot Medium-Size Networks - Develop Gantt Chart showing execution of the processes
What are the ecological integrity of coastal-marine : What are the Ecological integrity of coastal-marine, sustainability of both human and ecological systems of Kruger Inc? Why? Explain
What are the stakeholder engagement : What are the Stakeholder Engagement, Ecosystem Processes, and Environmental Perturbations of Kruger Inc? Why? Explain
Explain why an ANOVA is the most appropriate analysis : SPSS Analysis Assignment - Griffith University, Briefly explain (in 1-4 sentences) why an ANOVA is the most appropriate analysis for this research question
Aggregate supply and aggregate demand : Use aggregate supply and aggregate demand to predict the effects of fiscal and monetary policy actions on output, unemployment, and inflation.
Explain the benefits and costs of international trade : Explain the benefits and costs of international trade and the role of international trade in the U.S. economy.
Define and calculate gross domestic product : Define and calculate gross domestic product, inflation rate, and unemployment rate.
Why is morality important in business ethics : Why is Morality important in business ethics? What role does in play?

Reviews

len2303000

5/10/2019 4:07:35 AM

This assignment consists of a series of tasks that generally should be completed sequentially. You will need to use the template document provided to compile your assignment into a single document; hereafter, this document is referred to as your “assignment submission”. This assignment must be completed individually. In discussing the assignment with your peers (which should be kept to a minimum), your enquiries should be general in nature (about SPSS and not about the assignment questions) and should never reveal the solution to the assignment tasks. No output is to be shared between students either. Failure to comply with these guidelines is a breach of the University’s Academic Integrity policies and will result in further academic penalty according to the Student Academic Misconduct Policy.

len2303000

5/10/2019 4:07:28 AM

Marking Criteria & Feedback - You will have seen from Part 1 how the marking criteria are structured to assess each part of a task. While the criteria for this assignment will be necessarily different, again, in this Part 2 assignment, it assesses your ability to use SPSS to correctly generate descriptive and inferential analysis output, as well as the interpretation of this output. The marking criteria used will be based on your ability to: Generate the correct output, based on the instructions provided, Interpret and communicate your understanding of the output and Relate your findings back to the research question or background of the data given.

len2303000

5/10/2019 4:07:21 AM

Your mark will be available via My Marks, and an announcement will be made when this is available. Every endeavour will be made to provide results and/or feedback for this assignment prior to the provision of the Part 3 assignment. This part (part 2 of 3) of the SPSS assignments is worth 10% of your final course grade. Some specific expectations: p-values less than 0.001 should be changed to scientific notation. All variables should have correctly defined variable labels and value labels (with units, if necessary). Axes on graphs should be labelled correctly, with variable labels (not variable names) and units. Images should be resized so that they are clear to a reader of your assignment. Tables should be inserted as images, and not simply copy-pasted (unless explicitly stated otherwise).

len2303000

5/10/2019 4:07:15 AM

Read the assignment tasksheet again – chances are you skimmed over an important instruction or piece of information about the data. If relevant, have you tried a web search for general/technical issues and questions? There is a wealth of information on using SPSS on the web. Some of the tasks may require you to include a screenshot/screen capture of your screen/output. Additionally, all tables (unless otherwise stated) should be inserted as images, and screen captures are usually the easiest way to achieve this. The image you submit must be clear enough for the markers to read the text.

len2303000

5/10/2019 4:07:01 AM

You will need to insert graphs and tables that you have generated in SPSS into your assignment submission. You can do that individually for each output item required, or you can do it en masse for all output as follows: Please note, where you are required to submit your output file (Task 11) - you must not submit this in the Word export version. It must be submitted as a .spv out file. Save your assignment file as a PDF document and Save any/all changes you’ve made to your dataset (.sav file). Save your output file(s). If you have more than one output file, ensure each is labelled with the tasks they cover (e.g. “Output Task 1”, “Output Task 1-5”). Do not submit Word-exported files.

len2303000

5/10/2019 4:06:54 AM

For this assignment, you should end up with at least 4 files to submit: your assignment document (in PDF format), your two dataset files, and your output file(s). Note: these data and output files may not be assigned marks based on their content – you are submitting them in case we need to verify any of your work. There will however be marks assigned for completing this task as instructed. IF YOU DO NOT SUBMIT YOUR FILES CORRECTLY, EXPECT TO RECEIVE A PENALTY OR YOUR ASSIGNMENT NOT TO BE MARKED AT ALL . These consequences can be avoided simply by reading instructions carefully. Do not leave submitting to the last minute too.

Write a Review

Applied Statistics Questions & Answers

  Hypothesis testing

What assumptions about the number of pedestrians passing the location in an hour are necessary for your hypothesis test to be valid?

  Calculate the maximum reduction in the standard deviation

Calculate the maximum reduction in the standard deviation

  Calculate the expected value, variance, and standard deviati

Calculate the expected value, variance, and standard deviation of the total income

  Determine the impact of social media use on student learning

Research paper examines determine the impact of social media use on student learning.

  Unemployment survey

Find a statistics study on Unemployment and explain the five-step process of the study.

  Statistical studies

Locate the original poll, summarize the poling procedure (background on how information was gathered), the sample surveyed.

  Evaluate the expected value of the total number of sales

Evaluate the expected value of the total number of sales

  Statistic project

Identify sample, population, sampling frame (if applicable), and response rate (if applicable). Describe sampling technique (if applicable) or experimental design

  Simple data analysis and comparison

Write a report on simple data analysis and comparison.

  Analyze the processed data in statistical survey

Analyze the processed data in Statistical survey.

  What is the probability

Find the probability of given case.

  Frequency distribution

Accepting Manipulation or Manipulating

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd