1 competitive auctions on ebaycom the ebayauctions

Assignment Help Financial Accounting
Reference no: EM13371539

1 Competitive Auctions on eBay.com. The eBayAuctions contains information on 1972 auctions transacted on eBay.com during May-June 2004. The goal is to use these data to build a model that will classify competitive auctions from noncompetitive ones.

A competitive auction is defined as an auction with at least two bids placed on the item auctioned. The data include variables that describe the item (auction category), the seller (his/her eBay rating), and the auction terms that the seller selected (auction duration , opening price, currency, day-of-week of auction close). In addition, we have the price at which the auction closed. The goal is to predict whether or not the auction will be competitive.

Data Preprocessing. Create dummy variables for the categorical predictors. These include Category (18 categories), Currency (USD, GBP. Euro), EndDay (Monday- Sunday), and Duration (1, 3, 5, 7, or 10 days). Split the data in to training and validation datasets using a 60% : 40% ratio.

a. Fit a classification tree using all predictors using the best pruned tree. To avoid overfitting, set the minimum number of observations in a leaf node to 50. Also. set the maximum number of levels to be displayed at seven (the maximum allowed in XLMiner). To remain within the limitation of 30 predictors, combine some of the categories of categorical predictors. Write down the results in terms of rules.

b. Is this model practical for predicting the outcome of a new auction?

c. Describe the interesting and uninteresting information that these rules provide.

d. Fit another classification tree ( using the best-pruned tree, with a minimum number of observations per leaf node = 50 and maximum
allowed number of displayed levels), this time only with predictors that can be used for predicting the outcome of a new auction. Describe the resulting tree in terms of rules. Make sure to report the smallest set of rules required for classification.

e. Plot the resulting tree on a scatterplot: Use the two axes for the two best (quantitative) predictors. Each auction will appear as a point, with coordinates corresponding to its values on those two predictors. Use different colors or symbols to separate competitive and noncompetitive auctions. Draw lines (you can sketch these by hand or use Excel) at the values that create splits. Does this splitting seem reasonable with respect to the meaning of the two predictors? Does it seem to do a good job of separating the two classes?

f. Examine the lift chart and the classification table for the tree. What can you say about the predictive performance of this model?

g. Based on this last tree, what can you conclude from these data about the chances of an auction obtaining at least two bids and its relationship to the auction settings set by the seller (duration, opening price. ending day, currency)? What would you recommend for a seller as the strategy that will most likely lead to a competitive auction?

9.2 Predicting Delayed Flights. The file FlightDelays.xls contains information on ail commercial flights departing the Washington, D.C., area and arriving at New York during January 2004. For each flight there is information on the departure and arrival airports, the distance of the route, the scheduled time and date of the flight, and so on. The variable that we are trying to predict is whether or not a flight is delayed. A delay is defined as an arrival that is at least 15 minutes later than scheduled.

Classification and Regression Tree

Data Processing. Create dummies for day of week, carrier, departure airport, and arrival airport.

This will give you 17 dummies. Bin the scheduled departure time into 2- hour bins (in XLMiner use Data Utilities > Bin Continuous Data and select 8 bins with equal width). After binning DEP _TIME into 8 bins, this new variable should be broken down into 7 dummies (because the effect will not be linear due to the morning and afternoon rush hours). This will avoid treating the departure time as a continuous predictor because it is reasonable that delays are related to rush-hour times. Partition the data into training and validation
sets.

a. Fit a classification tree to the flight delay variable using all the relevant predictors. Do not include DEP_TI ME (actual departure time) in the model because it is unknown at the time of prediction (unless we are doing our predicting of delays after the plane takes off, which is unlikely). In the third step of the classification tree menu, choose:

• "Maximum number levels to be displayed = 6".
• Use the best pruned tree without a limitationon the minimum number of observations in the final nodes.

Express the resulting tree as a set of rules.

b. If you needed to fly between DCA and EWR. on a Monday at 7 AM. would you be able to use this tree? What other information would you need? Is it available in practice? What information is redundant?

c. Fit another tree, this time excluding the day-of-month predictor. (Why?) Select the option of seeing both the full tree and the best pruned tree. You will find that the best pruned tree contains a single terminal node.

i. How is this tree used for classification? (What is the rule for classifying?)
ii. To what is this rule equivalent?
iii. Examine the full tree. What are the top three predictors according to this tree?
iv. Why, technically, does the pruned tree result in a tree with a single node?
v. What is the disadvantage of using the top levels of the full tree as opposed to the best pruned tree?
vi. Compare this general result to chat from logistic regression in the example in Chapter 10. What are possible reasons for the classification tree's failure to find a good predictive model?

9.3 Predicting Prices of Used Cars (Regression Trees). The file ToyotaCorolla.xls contains the data on used cars (Toyota Corolla) on sale during late summer of 2004 in The Netherlands. It has 1436 observations containing details on 38 attributes, including Price, Age, KM, HP, and other specifications. The goal is to predict the price of a used Toyota Corolla based on its specifications. (The example in Section 9.8 is a subset of this dataset.)

Data Preprocessing. Create dummy variables for the categorical predictors (Fuel Type and Color). Split the data into training (50%), validation (30%), and test (20%) datasets.

a. Run a regression tree (RT) using the prediction menu in XLMiner with the out- put variable Price and input variables Age_08_0-L KM, FueLType, H

P, Automatic, Doors, Quarterly_ Tax, Mfg_Guarantee, Guarantee _ Period, Airco, Automatic_Airco, CD_ Player, Powered _ Windows, Sport_ Model, and Tow_ Bar. Normalize the variables. Keep the minimum number of observations in a terminal node to 1 and the scoring option to Full Tree, to make the run least restrictive.

b. Which appear to be the three or four most important car specifications for predicting the car's price?

Reference no: EM13371539

Questions Cloud

Looks at the effects of galerucella on purple loosestrife : looks at the effects of galerucella on purple loosestrife. you should formulate a hypothesis based on some of the
Questionderby ltd operates a chain of department stores in : questionderby ltd operates a chain of department stores in melbourne which uses a courier company for deliveries. the
Question 1let us assume there is a two-terminal box there : question 1.let us assume there is a two-terminal box. there are one resistor one capacitor and one inductor inside the
To enhance speed accuracy and reliability of their : to enhance speed accuracy and reliability of their information system sewworld has option to either purchase software
1 competitive auctions on ebaycom the ebayauctions : 1 competitive auctions on ebay.com. the ebayauctions contains information on 1972 auctions transacted on ebay.com
Question 11using the diagram belowlsquobuilding blocks of : question 11.using the diagram belowlsquobuilding blocks of financial management explain the three most important
Q1 strategy implementation amp disruption qsbull kaplan : q1. strategy implementation amp disruption qsbull kaplan amp norton suggest techniques for implementing strategies
1 strategy formulation presupposes a set of goals and : 1. strategy formulation presupposes a set of goals and objectives. why arent goals and objectives obvious?what
1 middle managers are often at the center of efforts to : 1. middle managers are often at the center of efforts to develop tactical plans to implement established strategies.how

Reviews

Write a Review

Financial Accounting Questions & Answers

  The dividend discount

Calculate the annual rate of return for each asset in each of the 10 preceding years, and those values to find the average annual return for each asset over the 10-year period.

  What is balance in the income summary account

Revenues total $10,200. Expenses total $7,300. Dividends paid total $2,600. What is the balance in the Income summary account after closing net income or loss to the Retained earnings account?

  Elucidate why revenue recognition rules were violated

Elucidate why revenue recognition rules were violated based on the facts of the case. How do such violations relate to the standards for legal liability under the securities acts?

  Question 1 a review of the accounting records of rayford

question 1 a review of the accounting records of rayford manufacturing showing that the company incurred the subsequent

  Should be recorded by the coy for its fiscal year ended

should be recorded by the coy for its fiscal year ended Dec31, 2008, under each of the three methods? Note the machine will have been used for one-half of its first year of life.

  In generating theories of accounting based upon what

in generating theories of accounting based upon what accountants actually do it is unspecified often implicitly that

  Computation of par value of stock after split off

How much of the $1,000,000 notes payable should be classified as current in Reeds balance sheet at December, 2007 and Computation of par value of stock after split off

  How you report equipment that cost with accumulated

how you report equipment that cost $27,000 with accumulated depreciation ot $25,000 sold at a gain of $10,600 on a indirect cash flows statement?

  How many shares are held as treasury stock by coca-cola

What percentage of authorized shares was issued by Coca-Cola at December 31, 2010, and by PepsiCo at December 29, 2010? How many shares are held as treasury stock by Coca-Cola at December 31, 2010, and by PepsiCo at December 29, 2010?

  Solve equations about ages

Write an equation that models how old in years each of you will be, when your ages add up to 150 years old. For example, if x = your age and the eldest person was a year older than you, you would write their age as x + 1. Then the equation would b..

  Jensen company forecasts a requirement for 200000 pounds of

jensen company forecasts a requirement for 200000 pounds of cotton in may. on 11th april the company acquires a call

  Find the value of the treasury stock

When treasury stock is accounted for by the cost method is subsequently sold for more than its purchase price, the excess of the cash proceeds over the carrying value of the treasury stock

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd