What is the central limit theorem

Assignment Help Basic Statistics
Reference no: EM131044900

Business resource and Statistics practical exam

Question one-

1. What is the Central Limit Theorem? Why and when is it used? Please elaborate on how the Central Limit Theorem and the Empirical Rule are connected. Please use a graph to illustrate your answer.

2. In probability theory, if population mean is μ and standard deviation is σ, we know that the standard deviation of a sampling distribution is σ/√n If a random sample of size n = 49 is obtained from a population with µ = 75 and (population standard deviation) = 14

What can you say about the distribution of the sampling dataset in terms of?

i. The shape of the distribution curve?

ii. The value of the mean?

iii. The value of the standard deviation

Quewstion two -

i. What is meant by the term interquartile range

ii. Describe the steps you need to take to calculate the interquartile range

iii. Find the median, lower quartile, upper quartile and interquartile range of the following data set:

18 20 23 20 23 27 24 23 29

iv. Draw a box and whisker plot to visualise the five number summary of the data set in iii. above

iv. What is a binomial distribution and when is it used? Please list the five key aspects of the binomial distribution

vi. For a binomial distribution we know that the standard deviation, σ = √np(1-P)

Where n is the number of trials and p is the probability of success.

Calculate the standard deviation of the binomial distribution if n=4 and P=0.25

Question Three-

1. Explain the difference between the following terms. It would be useful to explain your answer with the diagram.

A. Data (or Raw Data)

B. Information (or records)

Question four-

Consider the following three data sets; A, B and C

A. = {9, 10, 11, 7, 13}

B. = {10, 10, 10, 10, 10}

C. = {{1, 1, 10, 19, 19}

I. Calculate the mean of each data set

II. Calculate the standard deviation of each data set, given the formula:

III. Which set has the largest standard deviation?

IV. Is it possible to answer question iii) without calculations of the standard deviation? Explain your answer (show the rough or working)

C. Statistics
D. Knowledge

2. List four basic ways of visualising data.

3. Briefly explain the difference between descriptive statistics and inferential statistics.

4. Univariate Analysis involves the examination across cases of one variable at a time. In analysing a variable, there are three major characteristics that we tend to look at:

Distribution
Central Tendency
Dispersion/Variability

Briefly explain each of these terms and give examples of each. You can use drawings to illustrate your answer

5. With regard to the standard deviation of data distributions, what is the Empirical Rule?

Question Five-

Fill in the missing words on the answer sheet provided.

1. A study is under way in Lough Key Forest Park to determine the adult height of oak Trees. Specifically, the study is attempting to determine what factors aid a tree in reaching heights greater than 15m tall. It is estimated that the park contains 25,000 adult oak trees. The study involves collecting heights from 250 randomly selected adult oak trees and analysing the results. Identify the population from which the study was sampled.

A. The 250 randomly selected adult oak trees.

B. The 25,000 adult oak trees in the park.

C. All the adult oak trees taller than 15m.

D. All chestnut trees, of any age, in the park.

2. From the same study identify the variable of interest:

A. The age of an oak tree in Lough Key Forest Park.

B. The height of an oak tree in Lough Key Forest Park.

C. The number of oak trees in Lough Key Forest Park.

D. The species of trees in Lough Key Forest Park.

3. Again from the same study above what is the sample in the study?

A. The 250 randomly selected adult oak trees.

B. The 25,000 adult oak trees in the park.

C. All the adult oak trees taller than 15m.

D. All oak trees, of any age, in the park.

4. To monitor campus security, the security superintendent is taking a survey of the number of students in a concourse each 30 minutes of a 24-hour period with the goal of determining when patrols of the concourse would best serve the most students. If X is the number of students in the concourse each period of time, then X is an example of

A. A categorical random variable.

B. A discrete random variable.

C. A continuous random variable.

D. A statistic.

Question Six-

1. What is the Central Limit Theorem? Why and when is it used? Please elaborate on how the Central Limit Theorem and the Empirical Rule are connected. Please use a graph to illustrate your answer.

2. In probability theory, if population mean is μ and standard deviation is σ, we know that the standard deviation of a sampling distribution is σ/√n If a random sample of size n = 49 is obtained from a population with µ = 75 and (population standard deviation) = 14

What can you say about the distribution of the sampling dataset in terms of?

i. The shape of the distribution curve?

ii. The value of the mean?

iii. The value of the standard deviation?

Question seven

i. What is meant by the term interquartile range

ii. Describe the steps you need to take to calculate the interquartile range

iii. Find the median, lower quartile, upper quartile and interquartile range of the following data set:

18 20 23 20 23 27 24 23 29

iv. Draw a box and whisker plot to visualise the five number summary of the data set in iii. above

i. What is a binomial distribution and when is it used? Please list the five key aspects of the binomial distribution

ii. For a binomial distribution we know that the standard deviation, σ = √np(1-P) Where n is the number of trials and p is the probability of success.

Calculate the standard deviation of the binomial distribution if n=4 and P=0.25

Question Eight

1. What is a "Normal Distribution" of data, and how would you recognise it if it was graphed on a histogram?

2. What are two branches of statistics in brief and explain statistical terms in detail.

2. Explain population parameters and sample statistics. Also define measures of central tendency.

3. Calculate mean, median, mode, variance and standard deviation of the following dataset:

45, 50, 55, 55, 55, 60, 40

. Explain percentiles, quartiles and Inter Quartiles. Describe population, sample proportions and various types of charts used in statistics

4. Explain the concept of probability, and detail calculations of population, variance and standard showing formulas.

5.What is data analytics? Explain data analytics at workplace and provide various methods of turning data to knowledge.

6. Explain data mining and business reporting. Give appropriate examples of business reporting tools.

7.Explain central limit theorem in detail by giving examples.

8. A quiz consists of four multiple-choice questions, each with four possible answer choices (A, B, C, or D). One of which is correct. Suppose that an unprepared student does not read the question, but simply makes a random guess for each question. Let the random variable X equal the number of correct guesses the student makes for the five questions. Is this binomial? And if so what is n and what is p? Calculate variance and standard deviation.

The level of economic activity in a region has been recorded over a period of four years and the data is presented below:

Year                                               Quarter                                                      Activity Level
1                                                           1                                                              105
                                                             2                                                              99
                                                             3                                                              90
                                                             4                                                              110
2                                                           1                                                              111
                                                             2                                                               104
                                                             3                                                               93
                                                             4                                                               119
3                                                           1                                                               118
                                                             2                                                               109
                                                             3                                                                 96
                                                             4                                                                 127
4                                                           1                                                                 126
                                                             2                                                                  115
                                                              3                                                                 100
                                                              4                                                                 135

a. Construct a graph of this data

b. Find a centred four-point moving average and place it on your graph

c. Calculate the corresponding seasonal components(*show the rough or working of it)

Question nine

Sales on article B (‘000 units)

                           Q1      Q2       Q3         Q4
2010                  24.8   36.3     38.1       47.5
2011                 31.2    42.0    43.4         55.9
2012                 40.0    48.8    54.0         69.1
2013                  54.7   57.8    60.3          68.9

a. Plot the time series of the sales figures

b. Find a centred four-point moving average and place it on your graph

c. Calculate the seasonal component for each quarter

Comment on the following:

‘The process of polling is often mysterious, particularly to those who don't see how the views of 1,000 people can represent those of hundreds of millions'.

Question 11

Discuss the relative advantages and disadvantages of the postal questionnaire and the personal interview as a means of collecting data.

b. Compare simple random sampling and quota sampling as methods of selecting a representative sample from a population.

Question 12

Sampling methods are frequently used for the collection of data. State which type of sampling method is being described in the following situations.

i. One school in an area is selected at random and then all pupils in that school are surveyed.

ii. The local authority has a list of all pupils in the area and the sample is selected in such a way that all pupils have an equal probability of selection

iii. An interviewer surveys pupils emerging from every school in the area, attempting to question them randomly but in line with specified numbers of boys and girls in the various age groups.

iv. The local authority has a list of all pupils in the selected area. The first pupil is selected randomly from the list and then after every 100th pupil thereafter is selected for the survey.

B. List the advantages and disadvantages of Quota Sampling.

Question 13

The following transactions have been recorded on an automatic cash dispenser:

Value of transactions

(€):                    Number:
10                         46
20                         57
30                         68
50                         56
100                       47
150                       39
200                       34

Required:

a. Determine the mean

b. What value has the mode

c. Draw an appropriate graph for the above data

Question 14

A. Give two examples of variables which might be correlated

B. Distinguish between positive and negative correlation

C. What range of values can the product moment correlation coefficient use?

D. When should Spearman's rank correlation coefficient be used?

E. What are the advantages and disadvantages of the least squares method of linear regression?

Reference no: EM131044900

Questions Cloud

Company promotional literature : A sample of 900 computer chips revealed that 76% of the chips do not fail in the first 1000 hours of their use. The company's promotional literature states that 78% of the chips do not fail in the first 1000 hours of their use.
Income statement and balance sheet : The tax effects of temporary differences that give rise to deferred tax assets and liabilities are as follows ($ thousands):
What is the probability that he will select : An urn contains 9 white balls and 8 green balls. If Juan choose 9 balls at random from the urn, what is the probability that he will select 5 white ballight and 4 green balls? Round your answer to 3 decimal places.
Develop a bar chart to help you manage this project : A local municipality has issued a request for proposals to conduct a feasibility study to design, implement, and sustain a recycling program in a small town. As PM, you need to develop a bar chart to assign responsibility and maintain a project sc..
What is the central limit theorem : What is the Central Limit Theorem? Why and when is it used? Please elaborate on how the Central Limit Theorem and the Empirical Rule are connected. Please use a graph to illustrate your answer.
Probability that at least one of cards : 5 cards are drawn from a standard deck without replacement. What is the probability that at least one of the cards drawn is a heart? Express your answer as a fraction or a decimal number rounded to four decimal places.
Measure of central tendency : QUESTION 1: To identify the point in a distribution at which 50% of scores fall above and 50% fall below a given score, which measure of central tendency would you report?
Discuss proposal planning and execution of a hcit project : Discuss the proposal, planning and execution of a HCIT project and the significance of these elements. Discuss the significance of proper planning in at least five project knowledge areas.
Find the schedule and cost variances for a project : Find the schedule and cost variances for a project that has an actual cost at month 16 of $540,000, a scheduled cost of $523,000, and an earned value of $535,000

Reviews

Write a Review

Basic Statistics Questions & Answers

  Based on information from the rocky mountain news a random

based on information from the rocky mountain news a random sample of 12 winter days in denver gave a mean pollution

  Explain what your computed population mean

Review the data and for the purpose of this project please consider the 100 listing prices as a population. Explain what your computed population mean and population standard deviation were

  Consider the following give your answers correct to two

consider the following. give your answers correct to two decimal places.a find the standard score z such that the area

  Problem regarding the average production employee

A sample of 25 production employees has been tested twice on a standard test of manual dexterity. The average change in the time required to finish the test was a decrease of 1.5 minutes, with a standard deviation of 0.3 minutes. At the 0.05 level..

  What is the probability that the sample average

What is the probability that fewer than 59 CEOs agree that it is attractive to have a joint venture to increase global competitiveness and what is the probability that between 53 and 61 (inclusive) CEOs agree with that assertion?

  Transparency and stock trading activity explain

Transparency and Stock Trading Activity Explain the relationship between transparency of firms and investor participation (or trading activity) among stock markets. Based on this relationship, how can governments of countries increase the amount o..

  For a given linear regression model built between x and y

for a given linear regression model built between x and y the error sum of squares was found to be 300. the total sum

  Is there a difference in among the means

The following data show samples of three chain stores in three different locations in one town and the amount of dollars spent per customer per visit. At the 0.05 level, is there a difference in among the means?

  Statistical on ghypothesis testing

An automotive manufacture claims mean price of small SUV is $25,071 if hypothesis test is performed how you should interpret a decision,

  Explain why a customer would likely prefer a single line

Explain why a customer would likely prefer a single line, N server (channel) system over N-single server systems like those used by many theme parks.

  A tube of listerine tartar control toothpaste contains 42

a tube of listerine tartar control toothpaste contains 4.2 ounces. as people use the toothpaste the amount remaining in

  Test the budgeted amount

The program office budgets $30,000 per program review at the contractor's site. Your concern for "end of year" spending drills is that you have budgeted enough for the reviews

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd