Reference no: EM132341047
Statistical Modelling Assignment
OVERVIEW OF THE ASSIGNMENT
This assignment will test your skills of collecting and analysing data to answer a specific business problem. It also gives you the opportunity to apply the theories you have learned in this course such as finding numerical summaries, displaying with appropriate graphs and using statistical inferences to solve business problems, including constructing hypotheses, test them and interpret the findings. You may have to use two data sets. One Data set will be sent you vis KOI email individually and other you need to collect.
Suppose you are working for an agency who analyses Service Station and price history of petrol to give information to NRMA so that they can give a media report on petrol prices. You will be given series of research questions. Use your knowledge that you gain from this course to answer these questions by displaying appropriate outputs of Excel, Statkey or Wolfram alpha. Use these answers to write and executive summary which might helpful for NRMA to prepare for media report.
TASK DESCRIPTION: WRITTEN REPORT
There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below.
Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is a subset of Service Station and Price History September 2016 individual sample file, provided by the Australian Government Open Data and has been edited to only include a subset of the cases and variables.
The original dataset attached below and it is under the license of Creative Commons Attribution 3.0 Australia. Data dictionary of the edited dataset is given in the following table.
| Variable |
Description |
Values |
| Service station name |
Name of the service station and its suburb |
Eg Caltex Homebush |
| Address |
Service station address |
Eg 940 Pittwater Road & Hawkesbury Avenue, Dee Why NSW 2099 |
| Suburb |
Which suburb this service station is in |
Eg Mount Druitt |
| Post code |
Australian post code of the suburb |
Eg 2365 |
| Brand |
Which brand of service station |
Eg Caltex Woolworths |
| Fuel Code |
Type of petrol code |
Eg E10 |
| Price |
Price of the petrol price |
Eg $110.9 |
Dataset 2: Collect data (e.g. via a survey) that will answer your research question in Section 4. There is no requirement about the number of variables, sampling methods and sample size, but you need to justify your approaches in Section 1 (see below).
Both datasets should be saved in an Excel file (one file, separate worksheets). All data processing should be performed in Excel or Statkey.
Prepare a report in a document file (.doc or .docx) which includes all relevant tables and figures, using the following structure:
Section 1: Introduction
a. Give a brief introduction about the assignment and search a related article and write a paragraph of summary which should be a support for your report. You need to give full citation of the article.
b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What types of variable(s) is involved? Explain briefly what the possible cases are used in this study.
c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What is/are type of variable(s) involved? Give a description of cases you consider for this data set.
Section 2: Analysis of single variable in Dataset 1.
a. To answer the research question "What is the shape of the distribution of the variable Price?", provide a suitable numerical summary and graphical display for the variable Price of Dataset 1. Give detail comments to answer research question where you need to use all outputs.
b. Now to answer the research question "Is the average price of petrol is in all service station in September 2016 is more than 115 Australian cents?" setup appropriate hypotheses, perform hypotheses test by following all steps of hypotheses test and answer the research question by writing the conclusion of the test.
Section 3: Analysis of two variables in Dataset 1
NRMA always report to media by comparing the price of petrol with major brand of service stations namely Caltex, Caltex Woolworths, Coles Express and 7-Eleven.
a. Give numerical summary and appropriate graphical display for comparing the price of petrol of those four major Brands.
b. Perform a suitable hypothesis test at a 5% level of significance to test whether there a price difference among these four major Brands.
c. Use the conclusion in part b and the outputs in part a to write an accurate information of the petrol price. Your answer should contain that whether there is price differences and if there is, try to find which Brand price is lowest.
Section 4: Collect and analysis Dataset 2
Choose at least 30 KOI students and find out which service station they prefer to buy petrol and provide appropriate numerical and graphical summary. Use these outputs to write a comment.
Discussion and conclusion
a. Write an executive summary by combining all of your finding in the previous sections which must be a valuable for NRMA to report to media
b. Give a suggestion for further research
TASK DESCRIPTION: PRESENTATION/INTERVIEW
A presentation/interview for the assignment is scheduled on Week 11, in your allocated tutorial.
You do NOT need to prepare a presentation material (e.g. power-point slides), instead, you will be asked to demonstrate and/or explain how you summarised the data and how you performed the analysis. You may be asked to reproduce what you have made in your written report (e.g. generate a chart or numerical summary using Excel or Statkey).
Attachment:- Dataset.rar