Determine the inter-quartile range

Assignment Help Advanced Statistics
Reference no: EM131623487

Presentation

- Your answers must be presented in task number order and be clearly labelled with the appropriate task number.
- Your assignment must be presented in Microsoft (MS) Word. Copy and paste any relevant Excel outputs to this document immediately before (above)any relevant written answers to each task.
- If you are unfamiliar with the use of the MS Word Equations Editor, you may write algebraic/mathematical/statistical symbols and notation in neat handwritten form.
- Your answers must be clear. You must highlight relevant items on any required Excel outputs and make reference to them in your written answers.
- When asked to perform a manual calculation (i.e. the use of MS Excel is not specified) you must show all working. This must include intermediate steps where relevant. Failure to do so will result in a loss of marks.
- Completed assignments are to be presented for correction on A4 paper, stapled in the top left hand corner.You are permitted to print on both sides of the paper. Colour printing is recommended for graphs/charts. If printed as greyscale, be mindful and creative to make the greyscalesdistinct shades.
- Do not submit the assignment with fancy bindings, folders or plastic envelopes.
- Do not include the assignment questions nor the population property data with your submitted assignment.
- You are permitted to consult reference textbooks and notes and to communicate with other students. However, the work you hand-in for correction must be your own. Be aware that the University penalties for plagiarism are severe.

Introduction

The Assignment Data(PopulationPropertyData.xls) file, which you can access from the
Assessment Information page on the unit website contains, in the range A1:I401, real estate sales data for a population of 400 properties around Melbourne in a particular week. You are required to select a random sample of 50 properties from this population. The variables in the data set are as follows:
V1 = Region where property is located (1 = North, 2 = West, 3 = East, 4 = Central)
V2 = Property type (0 = Unit, 1 = House)
V3 = Sale result (1 = Sold at auction, 2 = Passed-in, 3 = Private sale, 4 = Sold before auction). Note that a blank cell for this variable indicates that the property did not sell.
V4 = Building type (1 = Brick, 2 = Brick veneer, 3 = Weatherboard, 4 = Vacant land)
V5 = Number of rooms
V6 = Land size (Square metres)
V7 = Sold Price ($000s)
V8 = Advertised Price ($000s)
Column A (PN), contains the property identification numbers from 001 to 400 properties.

Selecting your Random Sample and Creating your Sample Data File

To select your random sample, you need:
- A printed copy of the Random Number Table handy.
- Open the PopulationPropertyData.xls file on computer screen.
- Create aSamplePropertyData Excel file and keep it open on computer screen.

In order to select the sample data that will form the basis of your assignment you will need to make use of the random number table provided as a pdf file (RandomNumbers.pdf) on the Assessment Information page of the unit website.The provided table of random numbers is, as the title suggests, a sequence of randomly generated numerical digits (0 to 9). These digits are arranged in a table with ten columns (numbered 0 to 9) and one hundred rows (numbered 01 to 00) spread over two pages. The entries in each column of each row consist of six single digits.

Your first task is to select 50 three-digit random (property) numbers ranging from 001 to 400 from the table of random numbers. The type of simple random sampling that we will be engaged in here is termed "without replacement" because we specifically do not want to allow a property number to be selected more than once. If we allowed this to occur we would run the risk of the sample being biased and so not representative of the population. In the population, a particular property only occurs once and so it would not do to allow a particular property to occur more than once in your sample. In this way we can be more assured that the sample is typical of the population and so perform inferential statistical analyses about the population with some confidence.

In order to select your 50random property numbers you will need to first go to a starting position row and column in the random number table (Note ~ not the population property data) defined by the last threedigitsof your VU student identification number (the assignment marker will check your student ID number against the three digits number you use to collect the random sample). The last two digits of your VU ID number identifies the row and the third last digit identifies the column of your (relatively) "unique" starting position.

For demonstration purposes, if the last three digits of your student identification were 7, 4 and 9 (i.e. 749), you would commence your property number selection at the starting position -row 49 and column 7 of the random number table. You are required to colour/highlight the starting row number 49 and the starting column number 7.You should be able to see that the six digit number occupying that position is 217035.

Then, moving across therow,from left to right from the starting position, examine the first three digits of each six digit number and then the second three digits in each of the columns of the table. If any of these three digit numbers are between 001 to 400 inclusive, they are "good" numbers (the population data numbered from 001 to 400). Ignore any number greater than 400 or equal to 000. They are "not-good" numbers

Continue reading across row 49from left to right starting at column 7 as instructed, you would encounter the following three digit good numbers:

217, 035, 306, 150, ...

You need to record the first good property numbers, i.e.217, and open the PopulationPropertyData.xlsExcel filelocated on the Assessment Information page of the unit website. On the spreadsheet, scroll down the PN column to locate 217 (note: do not select the Excel spreadsheet rownumber 217. Select the row with 217 in the PN column). At this row, highlight from 217 under the PN column across to the right up to the V8 column, use Cut and Paste procedure to cut the row of data and paste the data into a new Excel file (name it and save it as SamplePropertyData.xlsx). Next is to repeat the Cut and Paste process for PN 035, and for PN 306 and the subsequent three digit goodnumbers selected from Random Number Table up to the point whenthe row of the spreadsheet in theSamplePropertyDatafile grownup to 50 rows of data. Make sure you copy the column headings, PN, V1, ... V8 into your sample data file as the heading for the columns.

Each time a number is selected from the Random Number Table, insert a strikethrough mark over the selected number on the Random Number Table to mark it off. It is possible that you may come acrosssome three digit good numbers more than once (we call them "repeated" number). The use of the Cut and Paste procedure is the "without replacement" sampling procedureto ensure that no repeated PN number and the corresponding data can be select more than once in this sample selection process. When a repeated number is found, colour/highlight/cross-out it in the Random Number Table to indicate that this good number has not been used to select the sample data (See the Assignment Part I Model Answers file).

Note that if you reach to the end of Row 50 on the first page of the Random Number Tablebutstill not yet to collect 50 good numbers, continue the process on to Row 51 on the top of the second page of the Random Number Table (as the same practice in the Assignment Part I Model Answer). Similarly if you reach to the end of Row 00 on the second page,proceed on to row 01 on the top of the first page. Once 50 good numbers are selected and the 50 rows of data have been copied from the PopulationPropertyData file into the SamplePropertyData file, thiswill form a completed sample data set occupying spreadsheet columns A to I and spreadsheet rows 1 to 51 (Refer to the Assignment Part I Model Answers file on the Assessment Information).

Assignment Part I

Part I of the assignment simply requires the submission of a hard copy of yoursample property datapresented in a maximum of no more than 3 printed pages in total. (See the Assignment Part I Model Answer). This sample data set will form the basis of the statistical presentation and analysis tasks contained in Part II of the assignment.

Task 1

(a) Make a hard copyof your Random Number Table containing the following:

(i) The highlight of the starting row and starting column of the sample selection process. (Refer to the Assignment Part I Model Answer).

(ii) The strikethrough/mark on the three digits good numbers and the cross-out of the repeated number(s). (Refer to the Assignment Part I Model Answer).

(b) Print a hard copy (see note below) of your sample property data (9 columns x 51 rows of data plus the column headings row) from the Excel file (SamplePropertyData) obtained per the above instructions.

Assignment Part II

Answers to the six assignment tasks in Part II must be based on the sample data file that you have created in Part I. All tasks in this assignment require you to obtain an Excel output prior to performing some analysis. Copy and Paste these outputs to your assignment MS- Word document immediately preceding any subsequent analysis. Explanations must be precise and to the point. Charts and tables must have appropriate titles and numerical values must be rounded to an appropriate number of decimal places and accompanied by the correct units of measure.

Task 2

Use Excel to produce a Frequency Column Chart and a Relative Frequency Pie-Chart for your sample to show the number and proportion, respectively, of each building type.

Use these graphical summaries to answer the following questions:

(a) How many properties in your sample consist of brick buildings?

(b) Which building type occurs most frequently in your sample?

(c) What proportion of properties in your sample consists of weather board buildings?

Task 3

(a) Use Excel to sort your sample "Sold Price" data and paste into your MS Word assignment document.
(b) Use the percentile location formula;
, and the three associated rules(Slide 11 of Week 2 Seminar, Session 1) to determine:

(i) The 70th percentile.

(ii) The first and third quartiles.

(c) Briefly explain what the 70th percentile that you have determined informs you about your sample "Sold Price" data.

(d) Determine the Inter-Quartile Range of your sample "Sold Price" data and provide a brief explanation of what information this statistic provides about your sample data.

Task 4

(a) Use Excel to produce a Descriptive Statistics table for your sample "Sold Price" data and paste into your MS Word assignment document.

(b) Use results from Task 3 to determine manually for this data, the upper and lower inner fence limits;

IFUL = Q3 + 1.5 x IQR

and IFLL = Q1 - 1.5 x IQR

(c) Based on the limits calculated in (b), choose from the numerical summary measures provided in the Descriptive Statistics table, and/or measures calculated previously in Task 3;

(i) an appropriate measure of central tendency, and,

(ii) an appropriatemeasure of dispersion for your sample "Sold Price" data.

Provide a brief explanation of the reasoning behind your choice in both cases.

Task 5

Remember to show all working! Failure to do so will result in the loss of marks.

(a) From the Descriptive Statistics table obtained in Task 5, identity threepieces of evidence that indicate whether your sample "Sold Price" data has been obtained from a normally distributed population or not. What is your conclusion? Note: Make sure only onepiece of evidence relates to the shape of the sample data.

(b) Regardless of your conclusion in above, assume the "Sold Price" population data is normally distributed. Applying the Standard Normal tables, calculate how many "Sold Price" observations in your sample would expect to lie within 1.5 standard deviations of the mean (i.e. between z = -1.5 and z = +1.5).

(c) Use the mean and standard deviation from the Descriptive Statistics tableof Task 5 to calculate the bound for 1.5 standard deviation spread from the mean. Using the "Sold Price" sample data, manually count the number of observations fall within the bound. State whether this count matches, approximately, your answer to (b) and hence whether this result confirms (or not) your conclusion in (a).

Task 6

Remember to show all working! Failure to do so will result in the loss of marks.

(a) Use Excel to produce a Descriptive Statistics table for the "Sold Price" variable in your sample suitable for constructing an interval estimate of the population mean "Sold Price". Hence determine:

(i) A point estimate of the mean "Sold Price"of the population of properties.

(ii) A 90% confidence interval estimate of the mean "Sold Price" of the population of properties.

(iii) Make a brief verbal statement explaining the meaning of the confidence interval estimate obtained in (ii) in the context of the variable in this task.

(b) If the population mean "Sold Price" is actually 650 ($000s), would you consider the interval estimate obtained in (a), to be satisfactory? Explain why or why not.

Task 7

Remember to show all working! Failure to do so will result in the loss of marks.

(a) Use Excel to produce a Descriptive Statistics table for the brick veneer properties in your sample suitable for constructing an interval estimate of the population proportion of brick veneer properties. Hence determine:

(i) A point estimate of the proportion of brick veneer properties in the population.
(ii) A 99% confidence interval estimate of the proportion of brick veneer properties in the population.

(b) Using the following formula:

(sample statistic) ± (critical z or t) x (standard error of the sample statistic)

Use the rule of thumb for good normal approximation (Slide 3 of Week 7 Session 2) for proportion, then the Empirical Rule(Slide 8 of Week 5 Session 1) for a Normal distribution to determine a 95% confidence interval estimate of the proportion of brick veneer properties in the population.

(c) Compare, in terms of the precision, the interval manually calculated in (b) with the interval obtained from the Descriptive Statistics table in (a). Explain why the direction of the change in precision is expected.

Reference no: EM131623487

Questions Cloud

Computer hardware and software paper : The purpose of this assignment is to understand what basic hardware and software components make up a computer. Students will research hardware components
Anything except price performance : Walmart's success is due to its low prices. Why would they need to monitor anything except price performance?
Airport manager of a city-owned public airport : The airport's development has been partially funded through federal grants. The airport authority, your boss, asks you the following questions:
How was the lighting technique suited to the genre of film : How was the lighting technique suited to genre of film? Documentary films tend to rely on natural light as a way of creating an overall tone of authenticity.
Determine the inter-quartile range : BE01106 - BUSINESS STATISTICS How many properties in your sample consist of brick buildings and which building type occurs most frequently in your sample?
Calculate the regular earnings and overtime earnings : Calculate the regular earnings, overtime earnings, and gross pay for each employee
Define a similarity in the names of the professor : Professor Smith knew that her colleague, Professor Smitt had terminal cancer. Smith applied for life insurance on Smitt's life and named herself
What was the total average lavilla profit : What was the total average Lavilla profit per day during the ski season of 2004? ski resort was at max capacity, with 1200 skiers, each staying an average.
Corporation social responsibility : In the Unit 1 Discussion Board, you analyzed a corporation's social responsibility with regard to its customers. Not only do corporations have responsibly

Reviews

len1623487

9/1/2017 8:26:41 AM

Introduction The Assignment Data(PopulationPropertyData.xls) file, which you can access from the Assessment Information page on the unit website contains, in the range A1:I401, real estate sales data for a population of 400 properties around Melbourne in a particular week. You are required to select a random sample of 50 properties from this population. The variables in the data set are as follows: V1 = Region where property is located (1 = North, 2 = West, 3 = East, 4 = Central) V2 = Property type (0 = Unit, 1 = House) V3 = Sale result (1 = Sold at auction, 2 = Passed-in, 3 = Private sale, 4 = Sold before auction). Note that a blank cell for this variable indicates that the property did not sell. V4 = Building type (1 = Brick, 2 = Brick veneer, 3 = Weatherboard, 4 = Vacant land) V5 = Number of rooms V6 = Land size (Square metres) V7 = Sold Price ($000s) V8 = Advertised Price ($000s) Column A (PN), contains the property identification numbers from 001 to 400 properties.

len1623487

9/1/2017 8:26:31 AM

• Your answers must be clear. You must highlight relevant items on any required Excel outputs and make reference to them in your written answers. • When asked to perform a manual calculation (i.e. the use of MS Excel is not specified) you must show all working. This must include intermediate steps where relevant. Failure to do so will result in a loss of marks. • Completed assignments are to be presented for correction on A4 paper, stapled in the top left hand corner.You are permitted to print on both sides of the paper. Colour printing is recommended for graphs/charts. If printed as greyscale, be mindful and creative to make the greyscalesdistinct shades. • Do not submit the assignment with fancy bindings, folders or plastic envelopes. • Do not include the assignment questions nor the population property data with your submitted assignment. • You are permitted to consult reference textbooks and notes and to communicate with other students. However, the work you hand-in for correction must be your own. Be aware that the University penalties for plagiarism are severe.

len1623487

9/1/2017 8:26:23 AM

Presentation • Your answers must be presented in task number order and be clearly labelled with the appropriate task number. • Your assignment must be presented in Microsoft (MS) Word. Copy and paste any relevant Excel outputs to this document immediately before (above)any relevant written answers to each task. • If you are unfamiliar with the use of the MS Word Equations Editor, you may write algebraic/mathematical/statistical symbols and notation in neat handwritten form.

len1623487

9/1/2017 8:26:02 AM

Remember to show all working! Failure to do so will result in the loss of marks. (a) Use Excel to produce a Descriptive Statistics table for the brick veneer properties in your sample suitable for constructing an interval estimate of the population proportion of brick veneer properties. Hence determine: (2 marks) (i) A point estimate of the proportion of brick veneer properties in the population. (1 mark) (ii) A 99% confidence interval estimate of the proportion of brick veneer properties in the population. (1 mark)

len1623487

9/1/2017 8:25:53 AM

(sample statistic) ? (critical z or t) ? (standard error of the sample statistic) Use the rule of thumb for good normal approximation (Slide 3 of Week 7 Session 2) for proportion, then the Empirical Rule(Slide 8 of Week 5 Session 1) for a Normal distribution to determine a 95% confidence interval estimate of the proportion of brick veneer properties in the population. (4 marks) (c)Compare, in terms of the precision, the interval manually calculated in (b) with the interval obtained from the Descriptive Statistics table in (a). Explain why the direction of the change in precision is expected. (2 marks)

Write a Review

Advanced Statistics Questions & Answers

  Capacity planning and facility location

Analyze the key concepts related to capacity planning and facility location for your business. Develop an appropriate level, chase, or hybrid aggregate plan to maintain a competitive advantage. Provide a rationale for developing the type of plan yo..

  Write r code for applying given method

Write R code for applying this method in order to compute the nearest correlation matrix when a symmetric matrix is given(see paper for details).

  Calculate the correlation matrix for all variables

Enter the following data into SPSS and calculate the correlation matrix for all four variables. Identify which correlations are significant and state and interpret your findings. Report significant correlation coefficients and p values.

  Value of preferred stock

A firm has an issue of preferred stock outstanding that has a stated annual dividend of $4. The required return on the preferred stock has been estimated to be 10%. Compute the value of the preferred stock.

  Find the probability density of ui

Show that each inter-renewal interval Xi = Si - Si-1  (where S0  = 0) is the sum of two  independent rv s, Yi + Ui where Yi is the ith service time; find the probability density of Ui.

  Construct a frequency distribution for payment method

Construct a frequency distribution for Payment method

  Determining break-even point-profit gained

To produce x number of units of glass vases cost C(x)=12x + 39. My revenue is R(x)=25x. Both cost and revenue and cost are in dollars.

  Determining the number of clusters

What are your personal thoughts about the three different clustering algorithms - Determining the number of clusters

  Consequences of sale and land when forming partnership

Mike and Lisa formed a partnership at the beginning of the year. They were equal partners and they had the same basis. When the partnership was formed, Mike contributed the following items:

  Determine the skewness of the dataset

Compute the mean and standard deviation - find the median, and compare the mean and median to determine the skewness of the dataset.

  Find mean time t for population in original markov process

Find the mean time T for the population in the original Markov process to die out. Note: We have seen before that removing transitions from a Markov chain or process.

  Taskyour task is to answer a set of research questions

taskyour task is to answer a set of research questions using the supplied data set lsquomalaria data set.sav. details

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd