highly variable test scores, based on the observations included in the respective sets
data loss, only a subset of the total data corpus are used to train when we are aware that statistical methods perform better with more data
similar to Validation set approach, but here we select a single data point every point for validation and repeat this several times on the entire dataset
test error is averaged over all the iterations