Analysis of variance relating casual to weekday

Assignment Help Basic Statistics
Reference no: EM131241113

Statistics for Data Science Assignment -

For this assignment you will continue to use data derived from Capital Bikeshare trip records from 2011 and 2012, this time analysing patterns in daily numbers of rentals by casual users.

References and Data Sources:

Bache, K. & Lichman, M. (2013). UCI Machine Learning Repository [https://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.

Fanaee-T, Hadi, and Gama, Joao, 'Event labeling combining ensemble detectors and background knowledge', Progress in Artificial Intelligence (2013): pp. 1-15, Springer Berlin Heidelberg.

Data file for this assignment:     

The data file for this assignment is called daily.sas7bdat and contains daily counts of bike rentals for 2011 and 2012, derived from Capital Bikeshare trip history data, with additional weather and seasonal information. The data was downloaded from the UCI Machine Learning Repository. Variables in that file are as follows:

Variable

Description

Instant

Record index

dteday

Date

Season

Winter, spring, summer or fall (northern hemisphere)

Yr

0 = 2011, 1 = 2012

Month

Month (January to December)

Weekday

Day of the week (Monday to Sunday)

workingday

Working day = 1, weekend or public holiday = 0

Temp

Normalised temperature in degrees Celsius; observed temperature divided by 41 (max)

Atemp

Normalised 'feels like' temperature in degrees Celsius; values divided by 50 (max)

Hum

Normalised humidity; observed values divided by 100 (max)

Windspeed

Normalised wind speed; values divided by 67 (max)

Casual

Count of casual users

registered

Count of registered users

Count

Total count of bike rentals (casual and registered)

Assignment tasks:          

Question 1 - Carry out a one-way analysis of variance relating casual to weekday. Use contrasts to test at least one a-priori hypothesis of your choice. Examine and comment on residuals. Also carry out appropriate post-hoc comparisons and discuss your results.

Question 2 - Use SAS to perform a one-way ANCOVA relating casual to weekday with atemp as a covariate, including appropriate post-hoc comparisons:

-Confirm that there is a linear relationship between the response variable and the covariate (a scatterplot and a correlation coefficient plus a comment will suffice);

-Check the two additional ANCOVA assumptions (report and comment only on the parts of the output most directly relevant to condition checking):

  • Independence of the covariate and the treatment effect (perform a one-way ANOVA test; there should be no statistically significant difference);
  • Equality of slopes (add and check significance of the interaction term);

-Report and briefly discuss your results.

Technical note: Make sure you obtain and examine Type III Sum of Squares (ss3). Also obtain estimates of 'least squares means' (lsmeans) which are means by treatment adjusted for the covariate.

Question 3 -

(a) Carry out a one-way analysis of variance relating casual to season. Use contrasts to test at least one a-priori hypothesis of your choice. Also carry out appropriate post-hoc comparisons and discuss your results.

(b) Extend your analysis in part (a) to test whether there is evidence of interaction between season and the type of day (working day vs weekend or public holiday). Carry out appropriate post-hoc comparisons and discuss your results.

(c) The distribution of the number of casual users by season is actually not Normal so a Kruskal-Wallis test may be more appropriate to relate casual to season. Carry out this test and for post-hoc analysis, consider comparisons between summer and each of the other seasons. Discuss and compare your results to those in part (a).

Question 4 - Write a summary of your findings from Questions 1 to 3. Keep the technical details of the analyses that led you to these conclusions to the absolute minimum. Rather, focus on practical significance and present your findings in non-specialist terms. One page will be sufficient.

Reference no: EM131241113

Questions Cloud

Space between the plates : A parallel plate capacitor with a plate area of 22.0 cm2 and air in the space between the plates, which are separated by 2.5 mm, is connected to a 24.0-V battery. If the plates are pulled back so that the separation increases to 4.5 mm, how much w..
Determine the 5 year flat volatility for caps and floors : Use DerivaGem to determine the 5-year flat volatility for caps and floors. - The floor rate in a zero-cost 5-year collar when the cap rate is 8%.
Formula for arc price elasticity what is percentage in price : IF the price of a slice of pizza rises from $2,50 to $3, and quantity demanded falls from 10,000 slices to 7,400 slices, using the formula for arc price elasticity what is the percentage in price? The market demand for wheat is Q=100-2p+1pb+2y. If th..
How long it takes charge to leave the capacitor : A 5.0 x 10^6 Ohm resistor is connected in series with a 4.0 x 10^-6 F capacitor. If the capacitor is discharged, what is the value (in seconds) of the "time constant" that characterizes how long it takes charge to leave the capacitor?
Analysis of variance relating casual to weekday : MATH 4044 - Statistics for Data Science Assignment. Carry out a one-way analysis of variance relating casual to weekday. Use contrasts to test at least one a-priori hypothesis of your choice. Examine and comment on residuals. Also carry out appropr..
Determine the pressure drop per 100-m length of horizontal : Determine the pressure drop per 100-m length of horizontal new 0.20-m-diameter cast iron water pipe when the average velocity is 1.7 m/s.
Comment on desirability of computerizing flanders supplies : Comment on the desirability of computerizing Flanders Supplies financial reporting system, the elimination of the work sheet in a computerized accounting system.
Deduce v1 equals v2 when sk equals current forward swap rate : Show that V1 + f = V2, where V1 is the value of a swaption to pay a fixed rate of sK and receive LIBOR between times T1and T2.
Discuss how imperfectly competitive firm resorts to price : Discuss how an imperfectly competitive firm resorts to price discrimination to maximize its profits. One of the criticisms of oligopolies is the adverse impacts these firms have on income distribution. Do you believe that is a valid critism? Discuss ..

Reviews

len1241113

10/13/2016 2:49:23 AM

The submitted assignment needs to be a single file, in either a Microsoft Word (doc or docx) or pdf file format, 25 pages at most excluding any appendices. To achieve maximum marks for each question, you should aim to: Complete the requested statistical analysis in SAS using appropriate tasks or procedures. Provide and interpret only the output most relevant to the question. Do not include every piece of output produced by SAS! Discuss the results in the context of the question.

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd