Perform a suitable hypothesis test

Assignment Help Basic Statistics
Reference no: EM131984945

Statistics and Data Analysis - Statistical Modelling Assignment

OVERVIEW OF THE ASSIGNMENT

This assignment will test your skill to collect and analyse data to answer a specific business problem. It will also test your understanding and skill to use statistical methods to make inferences about business data and solve business problems, including constructing hypotheses, test them and interpret the findings.

Gender gap is the difference between the salary of men and the salary of women. The reasons of gender gap are not only because of discrimination in hiring, but also includes the different industries that women and men are working, as well as many other reasons. By using an edited subset of the sample file from the Australian Taxation Office (ATO), your task is to summarise and analyse several aspects of the salary and occupation of the different gender. In addition, you are also asked to suggest one relevant research question and then collect and analyse a dataset that will answer your research question.

TASK DESCRIPTION: WRITTEN REPORT

There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below.

Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is a subset of 2013-2014 individual sample file, provided by the ATO and has been edited to only include a subset of the cases and variables. The original dataset can be obtained, and it is under the license of Creative Commons Attribution 3.0 Australia. Data dictionary of the edited dataset is given in the following table.

Variable

Description

Values

Gender

Gender (sex)

Female or Male

Occ code

Salary/wage occupation code

0 = Occupation not listed/ Occupation not specified 1= Managers

2 = Professionals

3 = Technicians and Trades Workers

4 = Community and Personal Service Workers

5 = Clerical and Administrative Workers

6 = Sales workers

7 = Machinery operators and drivers

8 = Labourers

9 = Consultants, apprentices and type not specified or not listed

Sw_amt

Salary/wage amount

All numeric

Gift amt

Gifts or donation deductions

All numeric

Dataset 2: Collect data (e.g. via a survey) that will answer your research question. There is no requirement about the number of variables, sampling methods and sample size, but you need to justify your approaches in Section 1 (see below).

Both datasets should be saved in an Excel file (one file, separate worksheets). All data processing should be performed in Excel or Statkey.
Prepare a report in a document file (.doc or .docx) which includes all relevant tables and figures, using the following structure:

1. Section 1: Introduction

a. Give a brief introduction about the assignment, including your research question. Include a short summary of a related article with a proper citation.

b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What types of variable(s) is involved? Display the first 5 cases of your dataset.

c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What type of variable(s) is/are involved? You don't need to display your data in this section.

2. Section 2: Descriptive Statistics

Use Dataset 1

a. Using suitable graphical display, describe the relationship between the variables Gender and Occ_code for Dataset 1. Make sure your graph shows the distribution of Gender for each Occ_code.

b. Using suitable graphical display, describe the relationship between the variables Gender and Sw_amt.

c. Using suitable numerical summary, describe the relationship between the variables Gender and Sw_amt.

d. Using suitable graphical display, describe the relationship between the variables Sw_amt and Gift amt.

3. Section 3: Inferential Statistics

Use Dataset 1

a. List top 4 occupation based on median salary and find the proportion of the gender of those top 4 occupation.

b. Perform a suitable hypothesis test at a 5% level of significance to test whether the proportion of machinery operators and drivers who are male is more than 80%.

c. Perform a suitable hypothesis test at a 5% level of significance to test whether there is a difference in salary amount between gender.
Use Dataset 2

d. Perform a suitable statistical analysis on dataset 2 (the one you collected) that will answer your research question.

4. Section 4: Discussion & Conclusion
a. What can you conclude from your findings in the previous sections?
b. Give a suggestion for further research

TASK DESCRIPTION: PRESENTATION/INTERVIEW

A presentation/interview for the assignment is scheduled on Week 11, in your allocated tutorial.

You do NOT need to prepare a presentation material (e.g. power-point slides), instead, you will be asked to demonstrate and/or explain how you summarised the data and how you performed the analysis. You may be asked to reproduce what you have made in your written report (e.g. generate a chart or numerical summary using Excel or Statkey).

SUBMISSION REQUIREMENT

1. Main report, in a Microsoft Word document file (this is the file that will be marked, it should contain all necessary tables and figures)

2. Dataset, in a Microsoft Excel file (this is just a supporting file)

Main report (word document):

1. Size: A4
2. Use Assignment Cover Page (download from Moodle) with your details and signature
3. Single space
4. Font: Calibri, 11pt
Dataset (excel document):
1. Dataset 1 in Sheet 1
2. Dataset 2 in Sheet 2
3. Data processing for each section in other sheets (rename the sheet appropriately)

Reference no: EM131984945

Questions Cloud

How do the linked allocation and the use of a fat affect : How do the linked allocation and the use of a FAT affect both sequential and direct record access? How does it affect sequential and direct record access?
Explain how files on the network may be accessed by lsdg : How will systems receive IP addresses? Explain. How will DNS be accessed by the LSDG systems? Explain. Explain how files on the network may be accessed by LSDG.
Represents a significant aspect of who you are : Select an object that represents a significant aspect of who you are. Describe why you chose it and explain how it represents you.
List and explain the key characteristics of computer family : List and briefly define the main structural components of a computer. List and explain the key characteristics of a computer family.
Perform a suitable hypothesis test : BUS708 Statistics and Data Analysis - Statistical Modelling Assignment - Assignment will test your skill to collect and analyse data to answer business problem
Insert capitalization in sentence : Insert Capitalization in these Sentence. 1).Campbell's Soups are perennial best sellers. 2). He advised me, "Do not sell your stock at this time."
What are some software restriction policies that can be set : We limit end users from altering particular settings in Internet Explorer such as trusted. What are some other software restriction policies that can be set?
Adjust the balance of public and private power in society : Woodrow Wilson progressive agents of change, who would try to adjust the balance of public and private power in society.
Create three directories named letters report and assignment : Create three directories named letters, reports and assignments under your home directory. Create directories named friendly and formal under letters directory.

Reviews

len1984945

5/16/2018 7:23:18 AM

5 DEDUCTION, LATE SUBMISSION AND EXTENSION Late submission penalty: - 5% of the total available marks per calendar day unless an extension is approved. For extension application procedure, please refer to Section 3.3 of the Subject Outline. 6 PLAGIARISM Please read Section 3.4 Plagiarism and Referencing, from the Subject Outline. Below is port of the statement: "Students plagiarising run the risk of severe penalties ranging from a reduction through to 0 marks for a first offence for a single assessment task, to exclusion from KOI in the most serious repeat cases. Exclusion has serious visa implications."

Write a Review

Basic Statistics Questions & Answers

  What is the expected lifetime of a minicomputer

What is the probability that no component of a minicomputer fails during the first 0.2 years of operation of that minicomputer? What is the probability that component 1 of a minicomputer is the first component of that minicomputer to fail?

  Calculate the lower quartile and the mode

Using seven classes of equal width, arrange the above data in a less-than cumulative distribution. Let 350 tonnes be the lower limit of the initial class. Use the distribution in 1.1 to calculate the lower quartile and the mode

  Explanation to inventory management

Barbara Flynn is in charge of maintaining hospital supplies at General Hospital. During the past year, the mean lead time demand for bandage BX-5 was 60 (and was normally distributed).

  Practical information regarding the vendors

Is a sample of size 10 large enough to provide any practical information regarding the vendors supposed process improvement?

  History and statistical tests

Find historical information on the statisticians who developed statistical tests, such as the normal distribution, Student t distribution, Chi-Square distribution

  The data constitute independent random samples

Assuming that the data constitute independent random samples from normal populations with equal variances, construct a 99% confidence interval for the difference between the true average heat-producing capacities of coal from the two mines

  Probability of hashing the first element to location p

What is the probability of hashing the first element to location p (and storing it there, since it is the first item and there will be no collisions)?

  Selling of averaged department store

An averaged department store sells 350 mens suits per year. The mens suit departments at a particular national chain of stores claim they sell more than the industry average.

  What is the probability that the product

What is the probability that the product of the two rolls is at most 5 given that the first one is an even number?

  Using binomial distribution find probability

Using binomial distribution find the probability that 2 or 3 women in a sample of 20 will never be married.

  What percentage of males in age bracket could be expected

Suppose that the natural logarithm of cholesterol levels for males in a given age bracket is normally distributed with amean of 5.35 and a standard deviation of .12. What percentage of themales in this age bracket could be expected to have a seru..

  Evaluate the administrators statement

In checking a random sample of 400 emergency room patients, a board member finds that 35% of those treated were not true emergency cases. Using an appropriate hypothesis test and the 0.05 level, evaluate the administrator's statement.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd