Stratified Sampling
Split the dataset into training/testing based on the proportion in the original dataset.
Cross Validation
- K-fold cross validation
- Randomly split the training set into K distinct subsets(folds)
- Train & evaluate K times, using some folds for evaluation and others for training
Fine Tuning Hyperparameters
Grid search to find the best set of hyperparameters.