Step 1: Understanding the Question:
The question asks for the number of validation splits created when using Leave-one-out-Cross-Validation (LOOCV). It is specified that LOOCV is used for model selection *before* the final testing phase. This means LOOCV is applied only to the training dataset.
Step 2: Key Concepts:
- Train-Test Split: The data is first split into a training set and a testing set. The testing set is kept aside and is only used for the final evaluation of the model.
- Leave-one-out-Cross-Validation (LOOCV): This is a specific type of cross-validation where the number of "folds" or "splits" is equal to the number of samples in the dataset being used. For a dataset of size N, LOOCV involves N iterations. In each iteration, one sample is held out for validation, and the model is trained on the remaining N-1 samples.
Step 3: Detailed Explanation:
First, we determine the size of the training dataset.
- Total number of samples = 1000.
- Number of samples for testing = 100.
- Therefore, the number of samples available for training and model selection is:
\[ \text{Training data size} = 1000 - 100 = 900 \]
LOOCV is performed on this training dataset of 900 samples.
According to the definition of LOOCV, the number of validation splits is equal to the number of samples in the dataset it is applied to.
Since the training data has 900 samples, LOOCV will create 900 validation splits. In each split, 1 sample will be used for validation, and the remaining 899 samples will be used for training.
Step 4: Final Answer:
The number of validation splits generated by LOOCV on the training data is 900.