Analysis of single variable in dataset

Assignment Help Basic Statistics
Reference no: EM132112520

Statistical Modelling Assignment

OVERVIEW OF THE ASSIGNMENT

This assignment will test your skills of collecting and analysing data to answer a specific business problem. It also gives you the opportunity to apply the theories you have learned in this course such as finding numerical summaries, displaying with appropriate graphs and using statistical inferences to solve business problems, including constructing hypotheses, test them and interpret the findings. You may have to use two Data sets. One Data set will be sent to you via KOI student email individually and you need to find or collect another dataset.

Suppose you are working for an agency who analyse NSW transport system data to make a recommendation to improve public transport system. You will be given series of research questions. Use your knowledge that you gain from this course to answer these questions by displaying appropriate outputs of Excel, StatKey or Wolfram alpha. Use these answers to write an executive summary which might be a valuable recommendation to Transport NSW.

TASK DESCRIPTION: WRITTEN REPORT

There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below.

Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is a subset of a data Opal Tap on and Tap Off Location - 8th to 14th August 2016 individual sample file, provided by the Transport for NSW Open Data and has been edited to only include a subset of the cases and variables.

The original dataset can be obtained and it is under the license of Creative Commons Attribution 3.0 Australia. Data dictionary of the edited dataset is given in the following table.

Variable

Description

Values

mode

Type of the public transport

Bus, Train, Ferry and Light Rail

date

Date of the tap on/off held

Date/month/year

tap

It is a tap on or off

On and Off

loc

Locations of stops. For bus

postcodes and others name of the stations

Postcodes and names of the stations

count

Total number tap on or off on the certain location and

the certain date

Number

Dataset 2: Collect data (e.g. via a survey) that will answer research question given in section 3. There is no requirement about the number of variables, sampling methods and sample size, but you need to justify your approaches in Section 1 (see below).

Both datasets should be saved in an Excel file (one file, separate worksheets). All data processing should be performed in Excel or Statkey.

Prepare a report in a document file (.doc or .docx) which includes all relevant tables and figures, using the following structure:

1. Section 1: Introduction
a. Give a brief introduction about the assignment and search related article and write a paragraph of summary which supports your assignment. You need to give the full citation of the article.
b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What are types of variables involved? Explain briefly what are the possible cases used in this study.
c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What is/are the type(s) of variable(s) involved? Give a description of cases you consider for this data set.

2. Section 2: Analysis of single variable in Dataset 1
a. To answer research question "Which type of public transport was most used by the NSW people during 8th to 14th of August 2016?", provide a suitable numerical summary and graphical display for the variables mode of Dataset 1. Give a detailed comment to answer the research question.
b. Now to answer research question "Are there more than 50% of public transport users in NSW use the particular mode of transport found in Part a?" setup an appropriate hypotheses, perform hypotheses test and answer the research question by writing the conclusion of the test.

3. Section 3: Analysis of two variables in Dataset 1
NSW Government need to decide on whether they have to build an underground Railway line from either Parramatta, Bankstown or Gosford to central. To prepare a recommendation for this;
a. Give a numerical summary and an appropriate graphical display for the variables location, by only considering those three stations; and the variable count by considering the data with trains only.
b. Perform a suitable hypothesis test at a 5% level of significance to test whether there is difference between mean counts of taps on and off.
c. Use the conclusion of the test in part b and the outputs in part a to write a recommendation to NSW government.

4. Section 4: Collect and analysis Dataset2
You are interested in finding whether there is a difference in preference between different gender in terms of their transport mode (Bus, Train, Ferry and Light Rail). by considering appropriate number of cases and variable, give a proper graphical display and use it to write a comments.

Section 5: Discussion & Conclusion

Write an executive summary by combining all your findings in the previous sections which must be a valuable recommendation for NSW Transport. Give a suggestion for further research

TASK DESCRIPTION: PRESENTATION/INTERVIEW

A presentation/interview for the assignment is scheduled on Week 11, in your allocated tutorial.

You do NOT need to prepare a presentation material (e.g. power-point slides), instead, you will be asked to demonstrate and/or explain how you summarised the data and how you performed the analysis. You may be asked to reproduce what you have made in your written report (e.g. generate a chart or numerical summary using Excel or Statkey).

Attachment:- Data 15.rar

Verified Expert

The study design is an example of exploratory study design in which the research that is performed mainly to identify a solution for the solution for which the solution is yet to be derived. Initially descriptive statistics will be performed and it is general procedure to understand the distribution of the data

Reference no: EM132112520

Questions Cloud

Do you think that they should have access to direct lobbying : Do you think that they should have access to “Direct Lobbying?” Is this process enhancing or diminishing the government’s Bureaucratic behavior?
Holistic medicine center is opening in mixed urban community : A new holistic medicine center is opening in a mixed urban community that is starting to attract young professionals.
What type of study would be most appropriate : What type of study would be most appropriate to determine the economic value of the goods listed in question 1? Explain fully.
True regarding inventory turnover : Which statement is true regarding inventory turnover, In a PEST, how would a growing religious movement be categorized?
Analysis of single variable in dataset : brief introduction about the assignment and search related article and write a paragraph of summary which supports your assignment
Do some research on the given issue : In American negligence cases, if the plaintiff is successful, the plaintiff's attorney receives a contingency fee, i.e. a percentage of the damages awarded.
Describe situation that caused you broaden perspective : Describe a situation that caused you a broaden your perspective. Discuss Apple Inc ethical policy which include: trade secrets, discrimination, OSHA, marketing.
Which type of inventory consists of finished goods : Which type of inventory consists of finished goods? In the three factors of success model, what two components comprise the "acceptability” factor?
What are the counterpoints : Now in your final assignment, you will combine these writing techniques to write a stance essay. A stance essay takes a position on a topic and argues.

Reviews

inf2112520

11/1/2018 3:57:54 AM

Dataset 2, I was asked to "find whether there is a difference in preference between different gender in terms of their transport mode (Bus, Train, Ferry and Light Rail).by considering appropriate number of cases and variable, give a proper graphical display and use it to write a comments. thanks for making this assignment very simple and explained me all the aspects for the same..

inf2112520

11/1/2018 3:56:07 AM

Section 5: Discussion and Conclusion 5.a 5 Executive summary: 5 Write an executive summary by combining all your findings in the previous sections which must be a valuable recommendation for NSW Transport. 4.b. 2 Giving further research: 2 1.7 Written presentation

inf2112520

11/1/2018 3:56:01 AM

Correct Choice of Numerical summary: 1 Correct numerical values for all three Categories: 2 Comment: 2 a. Using appropriate numerical summary, describe the variables location with only categories Bankstown, Gosford and Parramatta and Count. 3.b. 6 Correct hypotheses: 1 Correct ANOVA table: 2 Correct p-value: 1 Correct conclusion: 2 c. Perform a suitable hypothesis test at a 5% level of significance to perform a hypoptheses test that there is a difference between the means of these categories. 3.c 5 Usage of conclusion in part b: 1 Usage of graph and numerical values: 2 Good recommondation: 2 d. Use conclusion in part b and outputs in part a appropriately to write a recommondation. Section 4: Collect and Analysis a data set 2 4.a 5 Correct Choice of graph: 1 Correct graph based on data:1 Title/label/legends:1 Use graph to answer the research question:2 a. Using appropriate graphical display, describe the variables in data set 2

inf2112520

11/1/2018 3:55:28 AM

2.a. 5 Correct choice of numerical summary: 1 Correct numerical values for all four Categories of the variable Mode: 2 Use graph and numerical summary to answer the research question: 2 a. Using suitable numerical summary, to answer the research question. 2.b. 5 Correct Hypotheses: 1 Checking Assumptions: 2 Correct Test Statistics: 2 b. Perform the hypotheses test for proportion with first three steps 2.b. 3 Correct P- Value: 1 Correct conclusion: 2 b. Perform the hypotheses test for proportion with last two steps Section 3: Analysis of two variables 3.a. 4 Correct Choice of graph: 1 Correct graph based on data: 1 Title/label/legends: 1 comment: 1 a. Using appropriate graphical display, describe the variables location with only categories Bankstown, Gosford and Parramatta and Count. 3.a. 5

inf2112520

11/1/2018 3:55:09 AM

Primary/secondary: 1 Types of variables: 1 Description of cases : 1 c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What type of variable(s) is/are involved? You don’t need to display your data in this section. Section 2: Analysis of single variable 2.a. 5 Correct choice of graph: 1 Correct graph based on data: 1 Title/label/legends: 1 Comments: 2 a. Using suitable graphical display, describe the variable Mode for Dataset 1. Make sure your graph shows the appropriate features.

inf2112520

11/1/2018 3:28:18 AM

Clear description: 2 Primary/secondary: 1 Types of variables: 1 Description of cases: 1 b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What types of variable(s) is involved? Describe the cases. 1.c. 5 Clear data collection description: 1 Limitation: 1

inf2112520

11/1/2018 3:27:39 AM

Section Mark Criteria Question Section 1: Introduction 1.a. 5 Clear and concise intro: 2 Proper citation: 1 A summary of a related article: 2 a. Give a brief introduction about the assignment, including your research question. Include a short summary of a related article with a proper citation. 1.b. 5

inf2112520

11/1/2018 3:27:17 AM

The assignment is correct There is a very important part I have uploaded a document that outlines all the requirements short summary (8-10 lines preferably) of related article to this topic along with citations and references It is 30 percent of my grade I am attaching an additional file that I have collected from my tutor. It is a marking rubric. Please check against this rubric to see if all the marking criteria have been met.

len2112520

9/14/2018 1:58:28 AM

The first file has requirements. Second file contains data set 1. Data set 2 needs to be created through surveys. you can just make it up. samplpe size needs to be atleast 20. and both word and excel files has to be there 1. Main report, in a Microsoft Word document file (this is the file that will be marked, it should contain all necessary tables and figures) 2. Dataset, in a Microsoft Excel file (this is just a supporting file) Main report (word document): 1. Size: A4 2. Use Assignment Cover Page (download from Moodle) with your details and signature 3. Single space 4. Font: Calibri, 11pt Dataset (excel document): 1. Dataset 1 in Sheet 1 2. Dataset 2 in Sheet 2 3. Data processing for each section in other sheets (rename the sheet appropriately)

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd