Demonstrate an advanced level of synthesis

Assignment Help Mathematics
Reference no: EM133150506

Project Report

Part A

Objective:
The purpose of this project is to provide you with an opportunity to demonstrate an advanced level of synthesis, understanding and communication of the concepts, statistical methods and practical analyses within R that you have learnt throughout this course.

Please remember that STA8005 is a postgraduate level course which requires that students demonstrate an advanced level of knowledge, skills, reasoning and problem- solving. Also, this project is a significant assessment item worth 40% of your final grade. As such you should expect to find it challenging and expect to spend considerable time working on it. I encourage you to start as soon as possible. You do not need to have completed all of the course work and topics to make a start on becoming familiar with the data.

The Data:
A consultancy firm has asked you to explore some data about vehicles and address three specific aspects of interest (Tasks 1, 2 and 3 below) for their client, and then report your process and findings in a written report.

The data file vehicles.txt contains data for 12 variables from 400 vehicles. The variables relate to the size, fuel efficiency and price of the vehicles. Each of the 12 variables are defined below. Before beginning the Tasks, you may need to do some data cleaning due to missing data or outliers. All analysis for the following tasks should be based on your cleaned data set. For the purpose of this exercise assume that the data meets any required MVN assumptions.

Definition of 12 variables in vehicles.txt:

• Name: The vehicle make and model name
• retail: Suggested Retail Price, what the manufacturer thinks the vehicle is worth, including adequate profit for the automaker and the dealer (U.S. Dollars)
• cost: Dealer Cost (or "invoice price"), what the dealership pays the manufacturer (U.S. Dollars)
• engine_size: Engine Size (litres)
• cylinders: Number of Cylinders (4, 6 or 8)
• horsepower: Horsepower (ft-lb/s) (foot-pounds per second)
• city_mpg: City Miles Per Gallon
• highway_mpg: Highway Miles Per Gallon
• weight: Weight (Pounds)
• wheel_base: Wheel Base (inches)
• length: Length (inches)
• width: Width (inches)

Task 1: The client would like to know the number of vehicles in the sample after cleaning. They would also like to know the number of vehicles with 4, 6 or 8 cylinders recorded in the data and the mean and standard deviation of the retail price of each cylinder group.
Action: Clean the data as necessary and describe the changes you have made and the final structure of the data you will analyse. Provide a frequency table of the number of vehicles by cylinder group and describe. Find the mean and standard deviation by cylinder group. Interpret interesting aspects of this data summary.

They would also like to know what the relationships are between the engine_size based on the variables: retail, cylinders, horsepower, city_mpg and highway_mpg. Which engine sizes are most similar to each other and which are most different?
Action: First, create a new variable called engine_gr and recode the engine size variable so that:
Engine size <2 = engine_gr 1
Engine size >=2 & <3 = engine_gr 2
Engine size >=3 & <4 = engine_gr 3
Engine size >=4 & <5 = engine_gr 4
Engine size >=5 engine_gr 5

Provide a table showing the number of vehicles in each engine_gr level and comment. Perform, provide relevant output, and interpret a cluster analysis to show the multivariate relationships among engine sizes (engine_gr). Note: there are several ways you could perform the cluster analysis - be sure to explain what you tried and explain why you decided on your final choice. (25 marks)

Task 2: For 6-cylinder cars only determine if there is a linear relationship between two sets of variables:
• Car size variables: weight, length, wheelbase and width
• Fuel efficiency: horsepower, city_mpg and highway_mpg

Action: Subset the data as needed and briefly describe this new set of data. Select the best method (from those covered in this course only) to explore this Task.
Perform the analysis, provide relevant output, and interpret.

Task 3: Can cylinder number be predicted using the car size and fuel efficiency variables mentioned in Task 2?
Action: Select the best method (from those covered in this course only) to explore this question, perform the analysis, provide relevant output, and interpret.

In order to successfully classify cases into groups or categories there does need to be some differences between those groups. We have covered a test in this course that specifically tests for differences between groups. For those cases in your test set, are there significant differences among the cylinder groups based on the using the car size and fuel efficiency variables mentioned in Task 2?

Action: Select the best method (from those covered in this course only) to explore this question, perform the analysis and interpret. Provide a summary table of results. Include in your answer appropriate p-values for all significance tests performed. How do your results here relate to your results in the first Action of Task 3?

Part B

Question 1:
Recreate and complete the table below by indicating which features are relevant to each method.

1758_Process.jpg

 

Question 2:
Construct by hand a simple nearest-neighbour dendrogram from the distance matrix below. (Note: it is acceptable to draw by hand and insert a photo of your dendrogram)
1 2 3 4
2 1.912370
3 5.382450 7.120542
4 3.385996 5.059430 2.138709
5 1.512238 3.190303 4.575420 2.910661

Question 3:
What are the limitations or disadvantages of multivariate methods generally? (no more than 300 words)

Question 4:
Describe when a Mantel's randomisation test might be used and how the significance of the test statistic is calculated.

Question 5:
Explain your understanding of eigenvectors and eigenvalues (your answer must be in your own words and will be checked using a plagiarism checker) (no more than 300 words).

Attachment:- project report.rar

Reference no: EM133150506

Questions Cloud

Compute amount of share premium related to ordinary shares : Gilas Company issued 20,000 shares of its P10 par ordinary share. Compute amount of share premium related to ordinary shares
Understanding of people from multiple backgrounds : 1. How does coming from different social, cultural, economic, and demographic backgrounds impact students understanding of people from multiple backgrounds?
Assess the chosen company ability : Create a matrix in which you assess the chosen company's capabilities and resources to implement an existing strategic plan.
Institute of management accountant : Who and what is the Institute of Management Accountants? What are the requirements of obtaining the CMA? What is the average salary advantage that CMAs have?
Demonstrate an advanced level of synthesis : Demonstrate an advanced level of synthesis, understanding and communication of the concepts, statistical methods and practical analyses
Prepare production budget for both second and third quarters : Prepare a production budget for both the second and third quarters that shows the number of transmissions to manufacture
Changes necessary to improve quality : How would you, as a quality control manager, manage resistance by employees to changes necessary to improve quality?
What entry is required to eliminate effect of inventory : Cairo 's ending inventory included merchandise that was purchased from Alex for $3,000. What entry is required to eliminate the effect of ending inventory
How much will the three of you spend with apple in ten years : If you tell two friends about the company, with one spending $7 a week and the other spending $9 a week, how much will the three of you spend with Apple

Reviews

len3150506

5/24/2022 10:56:07 PM

Hi this is a statistical r programming project. Please go through the instruction which has additional details.

Write a Review

Mathematics Questions & Answers

  How much must be paid if the invoice is paid

Columbus Fitness Center received three new weight machines on May 15 and the invoice in the amount of $1,215 for these goods arrived on May 1 with discount.

  What is rate of change of power with respect to resistance

The power, P, dissipated when a 9-volt battery is put across a resistance of R ohms is given by P = 81/R.

  Problems based on mathematical data

What is the shape of the data?

  Compare the descriptive statistics you calculated

Compare the descriptive statistics you calculated to the population statisticsreported. Were there differences? Why or why not? Do you think that the sample you picked was representative of the population? Explain your response

  Explain what would account for the presence of a shape

What would account for the presence of a shape (or lack of shape) in the graphing of the data? That is, what factors would promote a linear graph, or a nonlinear graph, or a random pattern graph?

  For which values of t is the curve concave upward

Quiz 1. For the parametric curve given by x = t2 + 1, y = t2 + t, find dy/dx and d2y/dx2 in terms of t. For which values of t is the curve concave upward

  What is the proabiility of overruning the poe

In performing a symetric approximation, you calculate the total system cost $6,725K. The variance is 75625. The POE is $6,475K. What is the proabiility of overruning the POE?

  Manufacturer produces 1000 computer chips for

a manufacturer produces 1000 computer chips for a mission-critical application. each chip costs 100 to manufacture and

  Find the derivative of the function

Find the derivative of the function. s(t) = (cost/1+sint)5 and y = 1/x2.4 - π/√x

  Payments exactly as scheduled

How much total interest will she pay if she makes all of her payments exactly as scheduled, with none missed and nothing extra.

  Find the radius of the hole

A compact disc (CD) is made such that it is 53.0 mm from the edge of the center hole to the edge of the disc.

  Description of conjugacy classes

Suppose K is a conjugacy class of S_n (the symmetric group), with K consisting of even permutations. Suppose that x in K. Show that K splits into two conjugacy classes

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd