Calculate the test statistic

Assignment Help Mathematics
Reference no: EM131154854

Part I: Each year the EPA does an analysis on the current models of vehicles sold in the United States. The data provided in the data set EpaFE2016Data.csv is a subset of this analysis for 2016 models. If you are curious you may access the full data set from the EPA website https://www.fueleconomy.gov/feg/download.shtml.

Two variables among this data set are fuel type and whether or not the car is made from an American car company or an International car company.

The fuel types are:
DU = Diesel, ultra low sulfur (15 ppm, maximum)
G = Gasoline (Regular Unleaded Recommended)
GP = Gasoline (Premium Unleaded Recommended)
GPR = Gasoline (Premium Unleaded Required)

a. Use the R code and instructions under Data Analysis 4 link in Canvas to create a table of counts among the two categorical variables and a stacked bar chart of the conditional proportion of fuel type among American and International car companies. Include your results. Briefly describe the plot and table.

b. What proportion of American car company vehicles require premium unleaded gasoline (GPR)?

c. What proportion of International car company vehicles require premium unleaded gasoline (GPR)?

Is there evidence the proportion of 2016 vehicles that require premium gasoline is different among the vehicles made by American and International car companies? Use a significance level of 0.01

d. State the null and alternative hypothesis to answer the question of interest.

e. Check Conditions. If they are not met state so and why. Proceed either way.

f. Calculate the test statistic.Show work by hand or if completed in R copy and paste code.

g. Obtain a p-value based on your calculated test statistic.

h. Calculate the 99% Confidence Interval. Show work by hand or if completed in R copy and paste code.

i. Give a four part conclusion (shown in notes) and thoroughly answer the question of interest.

Part II. In 1978 congress established a Gas Guzzler Tax to discourage the production and purchase of fuel-inefficient vehicles. Every vehicle currently produced is labeled as a Guzzler if its fuel efficiency in MPG is below a certain amount. Trucks, SUVs and Minivans were uncommon in 1978 so are exempt from the Tax. If interested, read more: https://www.epa.gov/fueleconomy/guzzler/

Each vehicle in the data set EpaFE2016Data.csv has a projected Five Year Fuel Cost that estimates how much more or less the fuel cost will be in comparison to the average for similar makes and models. For example, the Honda Fit has a five year fuel cost of -$2750. This implies it is $2750 less expensive over five years to fuel versus the typical vehicle in its class.

Using the data set EpaFE2016Data.csv, compare the five year fuel cost fuel cost in dollars among the 2015 vehicles that are flagged as Guzzlers and Non-Guzzlers.

Use the R code and instructions under Data Analysis 4 link in Canvas to obtain descriptive statistics, a side by side box plot and results from a two sample t test.

a. Include a side-by-side box plot of the data. Is there visual evidence that the five year fuel cost is different between the Guzzler and Non-Guzzler vehicles? Explain.

b. Provide an organized table of the summary statistics. Include the sample means, standard deviations and sample sizes for each group.

c. What type of vehicles are guzzlers? Which vehicle has the highest five year fuel cost?

Do these data provide strong evidence of a difference between the average five year fuel costfor Guzzlers and Non-Guzzlers?

Assume that conditions for inference are satisfied. Use a significance level of 0.10.

d. State the null and alternative hypothesis to answer the question of interest.
e. From the summary statistics, calculate the test statistic and degrees of freedom "by hand". Show work. Conservative degrees of freedom are okay.
f. Obtain a p-value based on your calculated test statistic and degrees of freedom from a t table. (You may use either method to get your df). Show work.
g. From the summary statistics, calculate the 90% Confidence Interval "by hand". Show work.
h. Obtain a p-value from t test and confidence interval using R. Paste the output. Are your answers different? Why, yes/no?
i. Using the R output (from g) give a four part conclusion(shown in notes) and thoroughly answer the question of interest.

Part III. Burning fuel fossils contributes to greenhouse gases in the atmosphere. Included in the EPA analysis of 2016 vehicles israting for greenhouse gas emissions (GHGrating). Compare the greenhouse gas rating among vehicles that use different fuel types. Reference Part I for the description of the four fuel types.

Use the R code under the Data Analysis #4 Instructions to obtain graphical display, perform a Single Factor ANOVA F TEST and test of multiple comparisons.

a. From the side-by-side box plot does there look to be a difference in the average GHG rating among the different fuel types? Include the plot and explain your reasoning.

Does the EPA data provide evidence of a difference between at least one average fuel efficiencybetween the five different drive types? If so, which are statistically different?

Assume that conditions for inference are satisfied. Use a significance level of 0.05.

b. State the appropriate null and alternative hypothesis for the ANOVA F test.

c. Use the F statistic and p-value from the ANOVA table to state whether there is a significant difference between at least two of thefuel typesaverage GHG rating.

a. Paste R output.

b. Include a statement in regards to your significance level.

c. Include a statement in terms of the strength of evidence in terms of the alternative.

d. Using the Tukey's Multiple Comparison procedure output. Are there any individual comparisons that are significant at the 0.05 significance level?

a. Paste R output.

b. List all comparisons that are significant (or state those that are not). Which significant comparison has the largest difference in GHG rating? Give the difference estimate.

Reference no: EM131154854

Questions Cloud

Construct a b+ tree for the given set of key values : Construct a B+ tree for the following set of key values under the assumption that the number of key values that fit in a node is 3
Standardization and naming conventions : A position on whether or not standardization and naming conventions are critical for properly managing files and folders in a Windows environment. Include at least one (1) example or scenario to support your response.
What is a deadlock : What is a Deadlock? Write an algorithm for deadlock detection.
Determine the average shearing stress in the bolts : A composite beam is made by attaching the timber and steel portions shown with bolts of 12-mm diameter spaced longitudinally every 200 mm.
Calculate the test statistic : Is there evidence the proportion of 2016 vehicles that require premium gasoline is different among the vehicles made by American and International car companies? Use a significance level of 0.01. State the null and alternative hypothesis to answer..
Network against attacks and physical damages : Described the steps you will take to guard the network against attacks and physical damages. Described how you will use redundancy to provide 100 percent uptime for the BestPrice.com system. Wrote one to two pages describing how you plan to handle ph..
What is the implementation of a java interface : Prepare your report, you will need to research widely on these Java APIs and models. Your report must cover the issues.
Program must compare the corresponding elements in two array : The program should ask the buyer to enter six digits and should store in an integer array. Six digits should be generated randomly in the range 1 through 20. The digits should be filled in a separate array of size six called lottoDigits.
What was the amount of payments to suppliers of inventory : The accounts payable relates only to the acquisition of inventory. Sales were $789,500 and cost of goods sold was $532,700. What was the amount of payments to suppliers of inventory?

Reviews

Write a Review

Mathematics Questions & Answers

  Importance of geometry in math curriculum

The importance of geometry's role in the math curriculum is debated in many high schools and colleges. Some schools offer the course while others have done away with it.

  Determine an equation for each part of the ride

Integrate the concepts from different units in the Advanced Functions course to design a rollercoaster. Your coaster must meet the following conditions.

  Formulate a differential equation for the water temperature

Formulate a differential equation for the water temperature T(t). Specify your units for time and temperature, and the initial values.

  Set up a definite integral

The curve for x=sqrt(In(y)) for 1

  Computing percentage of total assets

What percentage of total assets was composed of current assets? Total Assets 2004 was $53,902

  How many chocolate bars cathy has

John has 20 more chocolate bars than Rick. Cathy has 5 times as many chocolate bars as John. If Rick has r chocolate bars, how many chocolate bars Cathy has, c?

  Define the fundamental counting principle

Be sure to describe or define the Fundamental Counting Principle, combinations, and permutations in your paper introduction. Include detailed calculations and solutions in the body of your paper.

  Find the variance for the number of defects per batch

A company manufactures batteries in batches of 23 and there is a 3% rate of defects. Find the variance for the number of defects per batch.

  State the inequalities that describe the information

The mountainous country of East Matrix can grow only two crops for export, coffee and cocoa. The country has 500,000 hectares of land available for crops. From the chart, state the inequalities that describe the information

  How long should the company dishwashers be guaranteed

How long should the company's dishwashers be guaranteed if the company wishes to replace no more than 2% of the dishwashers?

  How many sets of five marbles include

A bag contains two red marbles, four green ones, one lavender one, three yellows, and five orange marbles. How many sets of five marbles include either the lavender one or exactly one yellow one but not both colors?

  How would the seismographic readings differ at the distance

Two earthquakes differ by .1 when measured on the Richter scale. How would the seismographic readings differ at the distance of 100 km from the epicenter?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd