Important in the analysis of modern data sets

Assignment Help Advanced Statistics
Reference no: EM13705214

1. Why are robust statistics, such as the median or IQR, important in the analysis of modern data sets? Give a reason why (no need to give numeric values that explain what the median and MAD are).

2. If a random variable has a normal distribution with mean of 90 and standard deviation of 30 units, what is the probability that the variable:

(a) has a value less than 75?(b) greater than 120?

3. Why would a resolution III design ever be considered for experimentation, especially considering the high level of confounding that occurs with these designs? In your answer, also explain what a resolution III design means.

4. If you are a new employee at a company, e.g. a petrochemical corporation, give some characteristic features that will make you realize an EVOP strategy is being applied when you are looking at the company's historical data.

5. You will hear about 6-sigma processes frequently in your career. What does it mean exactly that a process is "6-sigma capable"? Draw a diagram to help illustrate your answer.

6. For any least squares model, does a low value of the correlation coefficient, r, imply that the input andoutput variables are unrelated? Explain.

7. Describe why a box plot is an effective univariate summary. Note: do not explain how the box plot is calculated; rather explain how you use it.

8. An exponentially weighted moving average (EWMA) chart allows one to develop a monitoring chart with either Shewhart chart characteristics, or CUSUM (cumulative sum) characteristics.


(a) In which general situation(s) would a more CUSUM-like behaviour be important to a monitoring system?

(b) Now describe a specific example to illustrate your prior answer.

(c) How would you change your EWMA chart to exhibit more CUSUM-like behavior ?

9. A method of fitting a least squares model, LTS, Least Trimmed Squares, takes the full set of n data points and trims out (and totally ignores) a subset of the outlier points so that they do not influence the objective function. This is done as a way to get robustness to outliers.

(a) Write out the regular least squares objective function.

(b) Draw an example to show how a robust least squares model would be beneficial.

(c) Describe an alternative modification to the objective function which would also be robust to outliers.

10. Name a reason why a company (or yourself) would run a set of saturated fractional factorials

11. Why is the principle of minimizing "data ink" so important in an effective visualization? Give anengineering example of why this important.

12. Why are latent variable methods effective for dealing with modern data sets? Your answer must also clearly describe the problem faced with these modern data sets.

13. Explain the intention of blocking in experimental designs.

14. You have two production lines in your company, producing the same product, which is sold to the samecustomers. Production line TL-419 has a Cpk = 0:90 and line TL-417 has Cp = 1:2 (notice that one is Cpkand the other is Cp).

1. When should one use Cpk and when should one use Cp to assess the process capability? [2]

2. Write a few bullet points to your manager to explain which production line should receive most ofthe $200,000 annual budget for process improvements.

15. Itconstraints only allow you to run 9 experiments. You must run two experiments per day to finish theexperiments within 5 days. Each day there is a different crew of plant operators and staff - they are stronglyexpected to have an effect on the results.

Write out an experimental table that blocks for the effect of the operators. Your table must show the levelsof the 4 factors and have an additional column that indicates which day the experiment should be run (1, 2,3, 4 or 5). Give bullet point notes that outline the justification for your table.

Hint: blocking can be viewed as adding additional factor(s) to a fractional factorial, with the blocking levelsgiven by the new factor(s).

16. Your new raw material supplier has a Cpk value of 1.2 for a critical quality variable, and your previous supplier's Cpk is 0.95. Your manager doesn't understand this terminology and wants to understand why yourecommended the new supplier, even though their material is more expensive. Give a brief explanation, andan illustration (diagram) to help your manager

Reference no: EM13705214

Questions Cloud

Discuss the market system and the need for ethics : 1. Discuss the market system and the need for ethics in business and distinguish it from the law and concepts of virtue and morality.2. Discuss ethics in the context of relativism, psychological egoism, utilitarianism, deontology, and virtue ethics..
Analyze the dataset from repeated-measures experimental : Assignment consists of two parts. In the first part, you will utilize an existing dataset to analyze the dataset from repeated-measures experimental design. All SPSS output should be pasted into your Word document. In the second part, you will ..
Genentech: after the acquisition by roche : What impact will the Roche buyout have on Genentech? Will it be possible for Roche to own Genentech without destroying its ability to innovate?
Complex variables and rlc circuits : The analysis of several circuits can be summarized in a unified language, that is, the tool of complex variables. The purpose of the project is to shed some light on that.
Important in the analysis of modern data sets : Why are robust statistics, such as the median or IQR, important in the analysis of modern data sets? Give a reason why and why would a resolution III design ever be considered for experimentation, especially considering the high level of confounding..
Describe the biological functions of tola : Describe the biological functions of tolA, tolB, and tolR proteins in pseudomonas? Are these three proteins (ORFs) located on the same operon? why or why not? explain.
Use to develop and express an antibody fragment : List the steps that you would use to develop and express an antibody fragment as a fusion protein? ?
How many h bonds would form : In an antiparallell Beta pleated sheet containing 100 amino acids, 50 in each strand, how many H bonds would form?
What is data-processing cycle : What is data-processing cycle

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Use of statistics in business research

Why use statistics in business research, and what is its role? How can the concept of variance and standard deviation be applied to solving a real world business-related problem? Include a specific example from business. Explain.

  Build up a two-way anova analysis

Formula for sample size n with given margin of error m and con dence level C for population proportion and please simplify the model by dropping the insigni cant terms

  Hypothesis testing in real world setting

Describe how a consumer advocate might proceed to establish his point of view using hypothesis testing. Please create a scenario for me with numbers. List all the steps taken.

  Taxation of international transactions

What are the intercompany transactions that USAco must price at arm's length? What compliance techniques may USAco employ to minimize the risk of a transfer pricing penalty?

  Test of hypothesis for furniture store

A furniture store claims that a specially ordered product will take on average, a mean of 42 days to arrive. The standard deviation of these waiting times is 7 days. We suspect that the special orders are taking longer than this.

  Disadvantages of regression models

In what ways are sunk, fixed and average costs considered irrelevant and why? What are the advantages and disadvantages of regression models in comparison to using a computerized regression routine?

  Question regarding statistical sampling

What are some benefits of an auditor using statistical sampling? What are some examples of statistical sampling? Does anyone have any experience they are willing to share with the class on statistical sampling?

  Standard deviation and its importance

Explain standard deviation and its importance to the statistical interpretation. When presented with a correlation coefficient of .7, would you consider this a particularly strong relationship?

  Do a one-way anova to test the hypothesis

Do a one-way ANOVA to test the hypothesis that the population means are equal and what are the hypotheses to be tested here? What is the value of F ?

  Computing annuity and sinking fund

What is the difference between an annuity and a sinking fund? If you were to set up an annuity today to purchase something in 7 years what would it be for?

  Suppose 10 of the tubes produced by a machine are

suppose 10 of the tubes produced by a machine are defective. if six tubes are inspected at random determine

  What is the break-even quantity

Draw a decision tree for this problem and what should management do to achieve the highest expected payoff?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd