Conduct a descriptive statistical analysis

Assignment Help Applied Statistics
Reference no: EM132024912

The exercise deals with new car registrations in Denmark. Buying a car is in Denmark something that normally takes place in the spring or for many persons in the business sector just before Christmas with the car delivered March next year.

Problem 1

The folder problem 1 reports quarterly seasonal data from 1992Q1 to 2018Q1 on new car registrations, imports, the rate of interest and a trend variable. Data are NOT seasonally adjusted.

Variable

Description

NCARREG

New registrations of cars

IMPORTS

Real Imports of goods (DKK billions)

RATE

The yearly bond interest rate of the Danish state

In addition, you will in the folder find the logarithmic transformation of the variables and a trend variable called TREND.

Buying a car normally involves loans, so if the rate of interest is low then it is cheap obtain finance for a car. It is more doubtful with the imports. Although all new cars are imported about half of all registrations are re-registrations of secondhand cars that not necessary affect the imports.

A. Provide time series diagrams for the variables NCARREG, IMPORTS and RATE, and a matrix of correlation. What is observed?

B. Estimate a regression model of the following form for the period 1992Q1 to 2018Q1:

NCARREGt = J0 +J1IMPORTSt + J2RATEt + J3TREND + σt

Perform a model control for this regression.

C. To improve the model dynamics are introduced. The two variables to explain the car registrations are lagged by 4 period's equivalent to a year. The model to be estimated for the period 1993Q1 to 2018Q1 is now of the form:

NCARREGt = J0 +J1IMPORTSt-4 + J2RATEt-4 + J3TREND + σt

Conduct a model control and investigate for the presence of autocorrelation by use of the Durbin Watson test. Save the residuals from this regression and call this variable RESID. Comment on the sign of the included variables. Assume a 10 percent level of significance. Compare also the model found under question B.

D. Expand the initial model above to an autoregressive (AR) model of order 4. This is an AR-4 model for the period 1994Q1 to 2018Q1. Estimate a model of the form:

NCARREGt = J0 + J1IMPORTSt-4 + J2RATEt-4 + J3TREND + J4RESIDt-1 + J5RESIDt-2 + J6RESIDt-3 + J7RESIDt -4 + σt

Why is it interesting for this data set to consider a model of order 4 relative to quarterly statistics?

Estimate the model and conduct a model control. Perform the Durbin Watson test. How are the plots of residuals? Compare with the findings under question C.

Conduct an LM-test where the model under H0 is the model estimated under question C, and the model under H1 is the model estimated under question D. Which model is the best? What happened to the autocorrelation?

E. Use the logarithmic transformed variables LNCARREG, LIMPORTS and LRATE to estimate an Error Correction Model (ECM). Explain the foundations for the ECM- model. Estimate first a long run regression model of the form:

LNCARREGt = J0 +J1IMPORTSt + J2LRATEt + σt

Save the residuals from this model and call them ECMRESID. Estimate second the short run ECM-model:

O4LNCARREGt = J0 +J1O4LIMPORTSt + J2O4LRATEt + 6(LNCARREGt-4 - Z0 - Z1LIMPORTSt-4 - Z2LRATEt-4 ) + σt

Or

O4LNCARREGt = J0 +J1O4LIMPORTSt + J2O4LRATEt + J3ECMRESIDt-4 + σt

Here ? is the fourth order difference operator. Is the ECM-term β3 negatively significant as should be the case for the model to be valid? How good is the model, and how is the model control?

Problem 2

The folder problem 2A reports quarterly NOT seasonal adjusted data from 1992Q1 to 2018Q1 on the new car registrations in Denmark. In addition, 3 seasonal dummies can be found and the logarithmic transformation of the new car registrations in Denmark

Variable

Description

NCARREG

New registrations of cars per quarter

D2

D2 = 1 in second quarter otherwise 0

D3

D3 = 1 in third quarter otherwise 0

D4

D4 = 1 in fourth quarter otherwise 0

A. Examine for quarterly deterministic seasonality and estimate a model of the form

NCARREGt = J0 +J1D2t + J2D3t + J3D4t + σt

How good is the model? Is deterministic seasonality observed? Is it true that the spring is the high season for new car registrations?

B. Examine for stochastic quarterly seasonality by use of the HEGY-test

Set up the hypotheses and find the transformed variables. Plot the transformed variables and comment. What is the outcome of the test?

The folder problem 2B reports monthly NOT seasonal adjusted data from 1992M1 to 2018M3 of the new car registrations in Denmark. In addition, 11 seasonal dummies can be found.

Variable

Description

MNCARREG

New registrations of cars per month

D2

D2 = 1 in second month otherwise 0

D3

.. D12

D3 = 1 in third month otherwise 0

..

D12 = 1 in twelfth month otherwise 0

C. Examine for deterministic monthly seasonality and estimate a model of the form

MNCARREGt = J0 +J1D2t + J2D3t + J3D4t + ... + J11D11t + J12D12t + σt

(I.e. all 11 seasonal dummies included). Is it true that March, May and June are special? Compare also with the model estimated on the quarterly data in question A. Is any information lost by using only the model estimated under question A?

Problem 3

Maybe you have been driving in a car, and you have experienced that not only the other car drivers, but also the other truck drivers on the road are occupied with things that not are related to focused driving for example talking in hand held phone or even worse sending SMS's. In addition, some drivers may also be eating, having the attention to tune in the radio etc.

The folder problem 3 reports statistics from 16 European countries on the shares of truck drivers that are performing distractive activities while driving. Statistics has been collected by traffic police in a number of European countries during February 2018. Here the police have given increased attention to these events causing an increased probability of serious traffic accidents.

Data are divided in to three groups according to traffic in the city areas, land areas and motorways. Data reports the share of drivers with distractive performance out of all drivers. In addition the data are divided into four segments of four countries each namely Scandinavia, small Western countries, large Western countries and East European countries. Data are given in percent shares.

A. Examine the hypothesis that distractive driving is more likely to occur on motorways. Use the simple ANOVA-method. Set up the hypotheses for the test, calculate the ANOVA-table, and provide an interpretation of the outcome. Provide a supplementary analysis.
B. Each group of countries can be considered as a segment. Set up and conduct a two- way ANOVA analysis for the data set. Describe the method and set up the hypotheses behind the test. What is the outcome? Do we observe interaction or segmentation among the two factors? What is the interpretation? Which group(s)/segments of countries are of interest with regard to both factors? Can a ranking be undertaken?

Problem 4

Let us continue our trip on the roads with a break and a cup of coffee! The coffee is provided by a coffee machine that from time to time needs service. When the indicator on the machine lights up then it is time for a service. The producer guarantees that the coffee machine can provide minimum 13,000 cups of coffee before each service. However, sometimes this occurs long time before the 13,000 cups of coffee has been provided. However, the producer guarantees that this will be the case in less than 1 percent of the cases.

The folder problem 4 reports statistics from 25 coffee machines. The table gives the number of coffee cups made before the service indicator showed that a service was needed.

A. Examine the symmetry of the dataset and conduct the Bowman-Shenton test. Set up the hypotheses, describe and perform the test. Is the material Normal Distributed?

B. Set up a sign-test and examine a hypothesis stating that the median is higher than 13,000. Set up the hypotheses, describe and perform the test. What is the outcome? What about the guarantee and the 1 percent?

Problem 5

The folder problem 5 contains a data set of 400 randomly selected incomes from Denmark.

Name

Description

DKINCOME

Income in 1,000 DKK

A. Conduct a descriptive statistical analysis for the variable DKINCOME. The analysis should contain a histogram and commenting on the symmetry. Describe the shape of the distribution of data

B. Compile by use of simple random selection a sample of 10 percent or 40 observations of the variable DKINCOME

C. Compile by use of stratified random sample with systematic sampling inside each stratum a sample of 10 percent or 40 observations of the variable DKINCOME. Use the histogram drawn under question A as a point of departure for setting up the strata's

D. Provide a comparison of your results found under question A-C. Comment of your findings. Use descriptive statistics and a discussion of the shape of the samples compared to the distribution of the total data set. Discuss the implications of the presence of eventual outliers on the process of sampling.

Reference no: EM132024912

Questions Cloud

Construct a stem-and-leaf plot and specify : Construct a stem-and-leaf plot and specify the five-number index (min, Q1, Q2, Q3, & max)
Share of preferred stock has par value : A share of preferred stock has a par value of $30. It pays a 6% dividend. If the required return is 3%, what is the price of the stock?
What is the outlier : Is this a normal distribution? What is the outlier? Are there any outliers in this distribution?
Making annual coupon payments and all selling at face value : Consider three bonds with 6.30% coupon rates, all making annual coupon payments and all selling at face value.
Conduct a descriptive statistical analysis : Advanced Tools for Quantitative Analyses - Conduct a descriptive statistical analysis for the variable DKINCOME. The analysis should contain a histogram
Find the minimum sample size? needed : Find the minimum sample size? needed, using a prior study that found that ?36% of the respondents said they are confident with their ?country's banking system.
How can french wines outside the bordeaux and bourgogne : How can French wines outside the Bordeaux and Bourgogne regions be marketed effectively?
What fraction of shipments will be accepted : Using this information, determine the following: a. What fraction of shipments will be accepted?
Required return exceed foley required return : By how much does Beale's required return exceed Foley's required return?

Reviews

Write a Review

Applied Statistics Questions & Answers

  Determine the critical value for a left tailed test of a

1. Determine the critical value for a left-tailed test of a population mean at the α=0.005 level of significance based on a sample size of n = 30. A) 2.750 B) 2.756 C) -1.699 D) -2.756 E) None of these 2. If beta  is computed to be 0.763 , then the p..

  Prepare a linear regression equation in excel

create a Linear Regression (LR) equation in Excel, assuming all assumptions for linear regression have been met. Use the Excel template provided.

  Make a weibull probability plot of the data

STAT 5105 Applied Survival Data Analysis Assignment. Use JMP to do the followings: Compute a nonparametric estimate of F(t), the proportion of units failing as function of time. Plot your estimates with linear scale. Make a Weibull probability plot ..

  Women entrepreneurs leading the charge

According to a report released by CIBC entitled "Women Entrepreneurs: Leading the Charge," the average age for Canadian businesswomen in 2008 was 41

  Conceptual questionswhat is the difference between

conceptual questionswhat is the difference between descriptive and inferential statistics?what are the five

  Average of three english courses with a standard

1.) The mean number of English courses taken in a two-year time period by male and female college students is believed to be about the same. An experiment is conducted and data are collected from 29 males and 16 females. The males took an average of ..

  Determine the procedure name and parameters involved for eac

Directions: Read the following problem.  In your submission, please include the following information: a. Determine if the problem is either a test of hypothesis, a confidence interval or something else and specify the 'key words' found in the proble..

  First perform the appropriate hypothesis test

You are advised to first perform the appropriate hypothesis test using pencil and paper, along with a calculator and statistical tables, and then use your working to answer the questions below.According to the Australian Bureau of Statistics, the qua..

  Create a histogram for TotalSales and QuantitySold

Create a histogram for "TotalSales" and create a histogram for "QuantitySold." Do these fields appear to have normal distributions? Why or why not

  What is the proportion of customers

Find the upper boundary of the 95% confidence interval for the average unload time and what is the proportion of customers who buy service plans when they buy a computer?

  The sample mean

The population mean is within $50.00 of the sample mean?

  Lakeshia sells 3,600 high end folding therapy

Lakeshia sells 3,600 high end folding therapy

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd