- but won’t the later folds yield better results? cause they are trained on more data
- I should look at more time series problems to see how they do CV
- some ppl just average the results of these 4 folds, even though the later folds should have more accuracy that the middle folds