Important in the analysis of modern data sets

Assignment Help Advanced Statistics
Reference no: EM13705214

1. Why are robust statistics, such as the median or IQR, important in the analysis of modern data sets? Give a reason why (no need to give numeric values that explain what the median and MAD are).

2. If a random variable has a normal distribution with mean of 90 and standard deviation of 30 units, what is the probability that the variable:

(a) has a value less than 75?(b) greater than 120?

3. Why would a resolution III design ever be considered for experimentation, especially considering the high level of confounding that occurs with these designs? In your answer, also explain what a resolution III design means.

4. If you are a new employee at a company, e.g. a petrochemical corporation, give some characteristic features that will make you realize an EVOP strategy is being applied when you are looking at the company's historical data.

5. You will hear about 6-sigma processes frequently in your career. What does it mean exactly that a process is "6-sigma capable"? Draw a diagram to help illustrate your answer.

6. For any least squares model, does a low value of the correlation coefficient, r, imply that the input andoutput variables are unrelated? Explain.

7. Describe why a box plot is an effective univariate summary. Note: do not explain how the box plot is calculated; rather explain how you use it.

8. An exponentially weighted moving average (EWMA) chart allows one to develop a monitoring chart with either Shewhart chart characteristics, or CUSUM (cumulative sum) characteristics.


(a) In which general situation(s) would a more CUSUM-like behaviour be important to a monitoring system?

(b) Now describe a specific example to illustrate your prior answer.

(c) How would you change your EWMA chart to exhibit more CUSUM-like behavior ?

9. A method of fitting a least squares model, LTS, Least Trimmed Squares, takes the full set of n data points and trims out (and totally ignores) a subset of the outlier points so that they do not influence the objective function. This is done as a way to get robustness to outliers.

(a) Write out the regular least squares objective function.

(b) Draw an example to show how a robust least squares model would be beneficial.

(c) Describe an alternative modification to the objective function which would also be robust to outliers.

10. Name a reason why a company (or yourself) would run a set of saturated fractional factorials

11. Why is the principle of minimizing "data ink" so important in an effective visualization? Give anengineering example of why this important.

12. Why are latent variable methods effective for dealing with modern data sets? Your answer must also clearly describe the problem faced with these modern data sets.

13. Explain the intention of blocking in experimental designs.

14. You have two production lines in your company, producing the same product, which is sold to the samecustomers. Production line TL-419 has a Cpk = 0:90 and line TL-417 has Cp = 1:2 (notice that one is Cpkand the other is Cp).

1. When should one use Cpk and when should one use Cp to assess the process capability? [2]

2. Write a few bullet points to your manager to explain which production line should receive most ofthe $200,000 annual budget for process improvements.

15. Itconstraints only allow you to run 9 experiments. You must run two experiments per day to finish theexperiments within 5 days. Each day there is a different crew of plant operators and staff - they are stronglyexpected to have an effect on the results.

Write out an experimental table that blocks for the effect of the operators. Your table must show the levelsof the 4 factors and have an additional column that indicates which day the experiment should be run (1, 2,3, 4 or 5). Give bullet point notes that outline the justification for your table.

Hint: blocking can be viewed as adding additional factor(s) to a fractional factorial, with the blocking levelsgiven by the new factor(s).

16. Your new raw material supplier has a Cpk value of 1.2 for a critical quality variable, and your previous supplier's Cpk is 0.95. Your manager doesn't understand this terminology and wants to understand why yourecommended the new supplier, even though their material is more expensive. Give a brief explanation, andan illustration (diagram) to help your manager

Reference no: EM13705214

Questions Cloud

Discuss the market system and the need for ethics : 1. Discuss the market system and the need for ethics in business and distinguish it from the law and concepts of virtue and morality.2. Discuss ethics in the context of relativism, psychological egoism, utilitarianism, deontology, and virtue ethics..
Analyze the dataset from repeated-measures experimental : Assignment consists of two parts. In the first part, you will utilize an existing dataset to analyze the dataset from repeated-measures experimental design. All SPSS output should be pasted into your Word document. In the second part, you will ..
Genentech: after the acquisition by roche : What impact will the Roche buyout have on Genentech? Will it be possible for Roche to own Genentech without destroying its ability to innovate?
Complex variables and rlc circuits : The analysis of several circuits can be summarized in a unified language, that is, the tool of complex variables. The purpose of the project is to shed some light on that.
Important in the analysis of modern data sets : Why are robust statistics, such as the median or IQR, important in the analysis of modern data sets? Give a reason why and why would a resolution III design ever be considered for experimentation, especially considering the high level of confounding..
Describe the biological functions of tola : Describe the biological functions of tolA, tolB, and tolR proteins in pseudomonas? Are these three proteins (ORFs) located on the same operon? why or why not? explain.
Use to develop and express an antibody fragment : List the steps that you would use to develop and express an antibody fragment as a fusion protein? ?
How many h bonds would form : In an antiparallell Beta pleated sheet containing 100 amino acids, 50 in each strand, how many H bonds would form?
What is data-processing cycle : What is data-processing cycle

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Minimize the cost of shipping

Sindbad Marine Co has two ships Barbarossa and Jebel Tarik based in Lattaquieh. Each ship has three sections: front, middle, and back. Sinbad Marine Co has received an offer to transport two commodities:

  Computing budgeted gross profit

For Nolte Company, the budgeted cost for one unit of product is direct materials $10, direct labor $20 and manufacturing overhead 90% of direct labor cost.

  Mtiple comparisonsscoretukey hsdi formatj formatmean

multiple comparisonsscoretukey hsdi formatj formatmean difference i-jstd. errorsig.95 confidence

  Final price adjustment-defective pricing

Compute the amount of the final price adjustment because of defective pricing based on the following:

  Computing desired probability values

Calculate the probability that the project will be completed in 38 weeks. Calculate the probability that the project will be completed in 42 weeks.

  Effectiveness of projections and forecasts

What are the ramifications to the firm to which you are most closely aligned or are analyzing if one or more of your projections/forecasts do not hold true?

  Standard deviation of complaints received per week

What is the probability that a randomly chosen package contains between 47 and 52 clips (inclusive) per package and What is the standard deviation of complaints received per week?

  Monte carlo method probability model

X and Y are both standard normal random variables (mean = 0, standard deviation = 1), statistically independent of each other. Create a simulation model to estimate the probability that X and Y are both positive and that their sum is less or equal..

  A shop is selling laptops at regular price and at half

a shop is selling laptops at regular price and at half price. if the laptops are regular price a day they can be at

  How are students distributed across classes of freshman

What is the proportion of females at StatCrunchU and determine a range of plausible values for this proportion. Is this proportion significantly larger than 0.5?

  Sales and marketing career path-tip sheet

Consider the top 2-3 careers in Sales or Marketing you would like to enter one day. Do some research at places like Monster and compile some data for each of these career paths. In particular, collect salary information, experience and degree requ..

  What is probability pat will pass the quiz

what is the probability that more than 2 packages will be delivered late - what is the probability that exactly 2 packages in the sample arrive late?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd