Why does shuffling defeat the purpose of k-fold cv

Assignment Help Other Engineering
Reference no: EM131926805

Problem

1. Why is shuffling a dataset before conducting k-fold CV generally a bad idea in finance? What is the purpose of shuffling? Why does shuffling defeat the purpose of k-fold CV in financial datasets?

2. Take a pair of matrices (X, y), representing observed features and labels. These could be one of the datasets derived from the exercises in Chapter 3.

(a) Derive the performance from a 10-fold CV of an RF classifier on (X, y), without shuffling.

(b) Derive the performance from a 10-fold CV of an RF on (X, y), with shuffling.

(c) Why are both results so different?

(d) How does shuffling leak information?

Text Book: Advances in Financial Machine Learning By Marcos Lopez de Prado.

Reference no: EM131926805

Questions Cloud

Compute the rolling standard deviation of two-sampled series : Compute the rolling standard deviation of the two-sampled series. Which one is least heteroscedastic? What is the reason for these results?
What method achieves the lowest test statistic : Apply the Jarque-Bera normality test on returns from the three bar types. What method achieves the lowest test statistic?
What would be the mathematical grounds for disregarding : Suppose that you develop a momentum strategy on a futures contract. What would be the mathematical grounds for disregarding the second result, if any?
What is the key to avoiding selection bias : What if we split the dataset in three sets: training, validation, and testing? What is the key to avoiding selection bias?
Why does shuffling defeat the purpose of k-fold cv : Why is shuffling a dataset before conducting k-fold CV generally a bad idea in finance? Why does shuffling defeat the purpose of k-fold CV in financial dataset?
What can you conclude about productivity improvement : Last year, Chop-M-Up, Inc. implemented a new labor process and redesigned its product, hoping to increase input usage efficiency.
Should the proposed discount be offered : The firm's current average collection period is 60 days, sales are 40,000 units, selling price is $45 per unit, and variable cost per unit is $36.
What break date does the method select : Compute the SDFC (Chow-type) explosiveness test. What break date does this method select? Is this what you expected?
Calculate what the annual payments and interest charges : The market discount rate is assumed to be 6%. Beyond this initial tabulation, estimate the present worth of the annual payments.

Reviews

Write a Review

Other Engineering Questions & Answers

  Characterization technology for nanomaterials

Calculate the reciprocal lattice of the body-centred cubic and Show that the reciprocal of the face-centred cubic (fcc) structure is itself a bcc structure.

  Calculate the gasoline savings

How much gasoline do vehicles with the following fuel efficiencies consume in one year? Calculate the gasoline savings, in gallons per year, created by the following two options. Show all your work, and draw boxes around your answers.

  Design and modelling of adsorption chromatography

Design and modelling of adsorption chromatography based on isotherm data

  Application of mechatronics engineering

Write an essay on Application of Mechatronics Engineering

  Growth chracteristics of the organism

To examine the relationship between fermenter design and operating conditions, oxygen transfer capability and microbial growth.

  Block diagram, system performance and responses

Questions based on Block Diagram, System Performance and Responses.

  Explain the difference in a technical performance measure

good understanding of Mil-Std-499 and Mil-Std-499A

  Electrode impedances

How did this procedure affect the signal observed from the electrode and the electrode impedances?

  Write a report on environmental companies

Write a report on environmental companies

  Scanning electron microscopy

Prepare a schematic diagram below of the major parts of the SEM

  Design a pumping and piping system

creating the pumping and piping system to supply cool water to the condenser

  A repulsive potential energy should be a positive one

Using the data provided on the webvista site in the file marked vdw.txt, try to develop a mathematical equation for the vdW potential we discussed in class, U(x), that best fits the data

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd