Draw a scatter plot of natural log of total expenditures

Assignment Help Basic Statistics
Reference no: EM132192155

Analysing - Household Data

Have you been part of a national census? Privacy issues aside, a census provides lots of data that can inform government policies and actions - but to be useful, the data needs to be analysed and interpreted.

In this assignment, we will use statistical methods to analyse and interpret real-world demographic data.

The goal of this assignment is to

- Test your understanding of statistical methods and approaches
- Improve your ability to use Excel for manipulation of data (see here for some guides on using Excel)
- Understand the real-world applications and implications of statistics

To complete this assignment you must
- Complete a set of statistical analysis tasks on a unique data set (both tasks and data set will be provided to you)
- Submit a report in word detailing your response to each task (the final answer and reasoning including calculations that led to it)
- Submit an excel document that contains your data set and the calculations you used to complete the tasks

Important:
- This is an individual Assignment worth 20%. Each student will use a unique data set to complete the assignment.
- You should use Excel for all your computational work.
- Please provide detailed calculations for all the tasks. Not explaining how you arrived at your conclusions may result in only partial marks being awarded for those tasks.
- Present your computer results in appropriate tabular form. Also, include graphs where required to support your answers.

Instructions:

Analysing Household Data

In this assignment, you will be analysing and interpreting household data.

Step 1: Prepare Data Set
- You need to download and modify the generic data set to identify the sample you will be working on. Vore information on the data set is available below.
- Important: You do not need to analyse the entire data set - only draw a random sample of 250 households. This must be done first and the rest of your statistical analysis will be based on this data set.

- You will need to know how to manipulate data in Excel. You can use any other compatible spreadsheet tool (for example Numbers for Mac OS, OpenOffice, LibreOffice etc) but be aware that some of the functions differ slightly.

- There are plenty of resources online and in this LMS that will show you how to perform these tasks in Excel. Use these resources (and Google) to improve your skills as you progress. If you have a particularly difficult challenge, share it in the forum and your tutors or peers will help.

Step 2: Analyse Data & Submit Report

- You will be presented with a set of tasks to run on your unique data set (of 250 households) - the full list of tasks are listed below.

- As much as possible, complete these tasks within the excel document. You will need to submit your excel document at the end of the assignment.

- You will also submit a brief report in word addressing all the tasks below. When responding to the tasks, please explain the reasoning behind your answer, and refer to your Excel sheet.

Data Set: Household data

The DataSet for this Assignment is available on LVS (Data Set for Assignment 01.xls). This includes information about 2000 households across the following variables.
- Income: Annual Income in AUD,
- ATaxlnc: After-tax annual income in AUD
- Grocery: Annual expenditures on groceries in AUD
- Alcohol: Annual expenditures on alcohol in AUD
- Veals: Annual expenditures on meals eaten out in AUD
- Fuel: Annual expenditures on fuel in AUD
- Cloth: Annual expenditures on clothing in AUD
- Phone: Annual expenditures on phone in AUD
- Utilities: Annual expenditures on utilities (Water, Gas, Electricity) in AUD
- Texp: Annual total expenditures in AUD
- Children: Number of children in a household
- Adults: Number of adults in a household
- OwnHouse: This is a categorical variable and takes value 1 if a household owns a house and 0, otherwise.
- GHH: Gender of the Head of Household (V Vale, F: Female)
- Highest Degree: Highest Level of Education, where the Highest Level of Education is;

P: Primary
S: Secondary
I: Intermediate
B: Bachelor
M: Master

Tasks for Analysis of Data Set

Complete the following tasks based on the unique data set you generated. These questions should be answered in a Word document, with brief reasoning to justify your answers. Your answers and reasoning should correlate with the tables and graphs from the excel sheet.

Task 1
A. Draw a random sample of two hundred and fifty (250) households as per the sample selection procedure. What sampling method have you used to select your sample data? In your opinion, is this the best method of sampling particularly when one is interested in characteristics like the gender of the household head, education levels etc., why or why not?

B. Compute the descriptive statistics and draw a Box-Whisker plot of Expenditures on the following variables (all series in one graph!);
(i) Alcohol (ii) Meals (iii) Fuel (iv) Phone

C. Use information from the descriptive statistics and the boxplots in part (B) above to present a summary of your findings by contrasting different features of these distributions.

Task 2
A. Construct a frequency distribution of the expenditures on Utilities, using the following classification (11 classes).

           1        2        10        11
Classes 0     - 300   - 2700   - More
          300     600    3000    than 3000

B. Using frequency distribution of the utilities above, what is the percentage of households who spend on Utilities
a. at the most $900 per annum
b. between $1500 and $2700 per annum, and
c. more than $3000 per annum.

Task 3
A. Find the top 5% value and the bottom 5% value of the household's annual after-tax income (Ataxlnc). What do these two values imply?

B. The series OwnHouse represents whether a household owns a house or not. Let X be a random variable such that X = Number of households who own a house.

(i) Is this a quantitative or a qualitative variable?

(ii) What would be the probability distribution of this random variable if we choose randomly (a) Only 1 household? (b) 250 households? Provide any relevant condition(s) to justify your answer.

C. Draw a scatter plot of natural log of total expenditures against the natural log of after-tax income, that is, In(texp) against In(ataxinc) and compute the coefficient of correlation. Express your finding of the relationship between the two variables.

Task 4

A. Construct a contingency table between the gender and the level of education.

B. What is the probability that the head of a household is a male and his higher level of education is Intermediate?

C. What is the probability that the head of household is a female and has the Bachelor degree?

D. What is the proportion of having the Secondary as the highest degree from among males?

E. Do you think that the events "gender of household head is female" and "having the Vaster Degree" are independent?

Reference no: EM132192155

Questions Cloud

Determine how to enter data into spss : After installing SPSS view this video to learn how to enter data into SPSS. After you are comfortable with the procedure for entering data, enter the following.
Identify three market research sources that you will use : Doing sufficient research is a necessity before launching a product or service into the marketplace.
Marketing manager for a regional division of grease2go : Jesse Jamison is the marketing manager for a regional division of Grease2Go, a major fast food restaurant chain. He is overseeing the introduction of a new
Situational leadership and constructive discipline : Situational Leadership and Constructive Discipline. How does this differ from her “usual” performance?
Draw a scatter plot of natural log of total expenditures : What would be the probability distribution of this random variable if we choose randomly (a) Only 1 household? (b) 250 households? Provide any relevant
How crafting a good research question is : For this Discussion, consider how crafting a good research question is the cornerstone for designing robust studies that yield useable data.
Explain what each assessment result means to the efficiency : Explain what each assessment result means to the efficiency and effectiveness of the organization according to scholarly literature, and the Bible.
What are the basic concepts in medical ethics : What are the basic concepts in medical ethics? How much should hospitals or government agencies intervene between patients and doctors?
How michael porters value chain models can be used : Recognize how Michael Porter's Value Chain Models (i.e., external and internal) can be used to categorize and assess industries and businesses.

Reviews

len2192155

12/12/2018 9:17:59 PM

Marks Distribution (Total Marks = 100) Marks Distribution Task 1 15 + 10 + 10 35 Task 2 7.5 + 7.5 = Task 3 5 + 5 + 5 =1 Task 4 12.5 + 2.5 + + 2.5 + 5 = ; Report Organisation 2x5=10 Proper Numbering of the tasks and questions and explanation Tables with Title/Caption,

Write a Review

Basic Statistics Questions & Answers

  Sat scores and central limit theorem

SAT Scores and the Central Limit Theorem. Assume that SAT scores are normally distributed with a mean of 500 and a standard deviation of 100.

  One article reports that in a sample of 413 male college

one article reports that in a sample of 413 male college students the average number of energy drinks consumed per

  What the probability that more than four customers subscribe

Instead of offering no premium free channels, suppose that two free premium channels are included. What is the probability that fewer than 3 customers will subscribe? What is the probability that more than 4 customers will subscribe

  Probability of a fatal accident

What is the probability of a fatal accident over a lifetime?

  Construct ten independent samples

Repeat Exercise 4 ten times, using different random digits each time. In other words, construct ten independent samples each of which contains

  Describe the distribution of each of given variables

Using numerical and graphical summaries, describe the distribution of each of these variables.-  Using numerical and graphical summaries, describe the relationship between each pair of variables in this set.

  How long will it take for bill to recoup his initial

a. How long will it take for Bill to recoup his initial investment in project? A? b. How long will it take for Bill to recoup his initial investment in project

  A professor is interested in the number of students who use

a professor is interested in the number of students who use the campus library for academic purposes. she takes a

  Compute the pulse rate in the population of days

The professor swims Here are data on the time (in minutes) Professor Moore takes to swim 2000 yards and his pulse rate (beats per minute) after swimming.

  Determine given probabilities using binomial distribution

The maximum temperature reached on any day can be classified as above freezing (a success) or below freezing (a failure).

  Find a minimal sufficient statistic

Let X1, ..., Xn be i.i.d. random variables having the beta distribution B(ß, ß) with an unknown ß > 0. Find a minimal sufficient statistic for ß.

  Find the estimated regression line

Find the estimated regression line, and complete the summary ANOVA table. Conduct an F test for a significant regression. Use a significance level of 0.01.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd