what is Resampling ?

Validation set approach

Problems in this setting

  1. highly variable test scores, based on the observations included in the respective sets

    Screenshot 2025-07-31 at 1.42.05 PM.png

  2. data loss, only a subset of the total data corpus are used to train when we are aware that statistical methods perform better with more data

Leave-One-Out Cross Validation

K-Fold cross validation

Bootstrapping