Calculate the inclusion probabilities

Assignment Help Basic Statistics
Reference no: EM131295133

Part1) Central Limit Theorem

The input data consists of the sequence from 1 to 25 (1:25). Show the following three plots in a single row.

a) Show the histogram of the densities of this distribution.

b) Using all samples of this data of size 2, show the histogram of the densities of the sample means.

c) Using all samples of this data of size 5, show the histogram of the densities of the sample means.

d) Compare of means and standard deviations of the above three distributions.

Part2) Central Limit Theorem

The data in the file queries.csv contains the number of queries Google has had each day for a one year period (365 days). The data file is also available at https://kalathur.com/cs544/data/queries.csv. Use this link to read the data using read.csv function when submitting the homework.

a) Show the histogram of the distribution of the number of queries. Compute the mean and standard deviation of the number of queries Google has had per day.

b) Draw 1000 samples of this data of size 5, show the histogram of the densities of the sample means. Compute the mean of the sample means and the standard deviation of the sample means.

c) Draw 1000 samples of this data of size 20, show the histogram of the densities of the sample means. Compute the mean of the sample means and the standard deviation of the sample means.

d) Compare of means and standard deviations of the above three distributions.

Part3) Central Limit Theorem - Negative Binomial distribution

Suppose the input data follows the negative binomial distribution with the parameters size = 5 and prob = 0.5.

a) Generate 1000 random numbers from this distribution. Show the barplot with the proportions of the distinct values of this distribution.

b) With samples sizes of 10, 20, 30, and 40, generate the data for 5000 samples using the same distribution. Show the histograms of the densities of the sample means. Use a 2 x 2 layout.

c) Compare of means and standard deviations of the data from a) with the four sequences generated in b).

Part4) Sampling

Use the MU284 dataset from the sampling package. Use a sample size of 20 for each of the following.

a) Show the sample drawn using simple random sampling without replacement. Show the frequencies for each region (REG). Show the percentages of these with respect to the entire dataset.

b) Show the sample drawn using systematic sampling. Show the frequencies for each region (REG). Show the percentages of these with respect to the entire dataset.

c) Calculate the inclusion probabilities using the S82 variable. Using these values, show the sample drawn using systematic sampling. Show the frequencies for each region (REG). Show the percentages of these with respect to the entire dataset.

d) Order the data using the REG variable. Draw a stratified sample using proportional sizes based on the REG variable. Show the frequencies for each region (REG). Show the percentages of these with respect to the entire dataset.

e) Compare the means of RMT85 variable for these four samples with the entire data.

Attachment:- queries.csv

Verified Expert

Descriptive statistics was normally used to assess the distribution of the variables taken into consideration.If the variables are continuous, then mean, median and other descriptive measures were used to identify the distribution. on the other hand, if the variable is qualitative, then, frequency distribution was used to assess the distribution for the same

Reference no: EM131295133

Questions Cloud

Handling affect the outcome : 1. What was the outcome of the BP oil spill in April 20th 2010 2. How did management handle the crisis? 3. How did the handling affect the outcome?
Use annual compounding for amortization schedule of mortgage : A family currently live in an apartment whose monthly rent is $950. They are thinking of buying a house which would cost $220,000. They plan to live in this house for 5 years and sell it at the end of the 5th year. Note that property taxes are tax de..
Methods of assessing health care organizations : Compare and contrast at least two methods of assessing health care organizations' operational needs and explain how they differ based upon a particular health care setting.
Find the total charge delivered to the device : Fort ≥ 0, the voltage across and power absorbed by a twoterminal device are v (t) = 2e -t V and p (t) = 40e -2t mW. Find the total charge delivered to the device for t ≥ 0.
Calculate the inclusion probabilities : CS544 Module - Calculate the inclusion probabilities using the S82 variable. Using these values, show the sample drawn using systematic sampling. Show the frequencies for each region (REG). Show the percentages of these with respect to the entire ..
Explain the impact the armory show had on american art scene : Describe the impact the Armory Show (1913) had on the American art scene. Use examples to support your essay.
What is the payback period for the new machine : A company is considering the purchase of a new machine for $48,000. Management predicts that the machine can produce sales of $16,000 each year for the next 10 years. Expenses are expected to include direct materials, direct labor, and factory overhe..
Using the payback period method : Coffer Co. is analyzing two projects for the future, Assume that only one project can be selected. If the company is using the payback period method and it requirea a payback of three years or less, which project should be selected?
Based on agency relationships and tort actions : Based on agency relationships and tort actions, what would be the theory or theories that the lawsuit would be based on?

Reviews

len1295133

11/30/2016 12:23:17 AM

The pdf in the end has what needs to be submitted. normally a doc and R file. You are strongly encouraged to add comments for the code portions. Doing so will help your instructor to understand your programming logic and grade you more accurately.

Write a Review

Basic Statistics Questions & Answers

  After a military campaign sample of 59 soldiers were

after a military campaign sample of 59 soldiers were examined it was determined that 32 of them suffered from lifetime

  Number of occurrences of an event

The random variable x, which is the number of occurrences of an event over an interval of ten minutes. It can be assumed that the probability of an occurrence is the same in any two time periods of an equal length.

  What is probability that neither company become profitable

A venture capitalist invests in one firm of each type. Assume the companies function independently. What is the probability that both companies become profitable?

  Find measures of central tendency-dispersion-skew for data

Calculate the measures of central tendency, dispersion, skew for your data. Display your descriptive statistical data using graphic and tabular techniques. Frequency distribution, Histogram.

  Reverses the sequence of characters within a string

Let Rev be the operator that reverses the sequence of characters within a string. For example, Rev(abc) = cba. Let R be any regular expression. Rev® is the set of strings denoted by R, with each string reversed. Is Rev(R) a regular set? Why?

  Probability that one is a republican followed by democrat

If two voters are randomly selected for a telephone survey, what is the probability that they one is a Republican followed by a Democrat? Round your answer to 4 decimal places.

  Determining the compression ratio

Find the final temperature when air is compressed adiabatically from an initial temperature of 293 K (20 °C): (a) When the compression ratio Vf /V0 is 1/10 (as in gasoline engines).

  Find test statistic-degrees of freedom and the p-value

State appropriate null and alternative hypotheses. Report the test statistic, its degrees of freedom, and the P-value. What do you conclude?

  Why is probability theory required

What is the purpose and function of The ?eld of study of statistics?

  Sample and the type of population

Critically discuss the relationship between a sample and the type of population it is taken from? What population factors should be taken into consideration?

  Claiming at carpeting department by supervisor

The carpeting department supervisor claims that the average number of days between the receipt of a complaint and its resolution is 20 days or less.

  Determine the attitude of students

Assume you have received a class assignment to determine the attitude of students in your school toward the school's registration process. What are the validity issues you should be concerned with?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd