Test sets and validation sets

Test sets

Overfitting

Overfitting is a risk. Instead we split to test, train. risk of using too many features. more features always improve training score, not necessarily test score.

As model gets more complex, both test and train do better. however at some point, test stops doing better, overfitting

Structural risk minimisation can address this trade off. use test and training sets. train model on train, rate it on test

Structural minimisation curve has accuracy of boths sets over complexity

To avoid overfitting:

+ reduce number of features + do a model selection + use regularisation + do cross validation

Can choose other model parameters

How to evalute model?

K-fold cross-validation

Can do k-fold cross validation. given algo A and dataset D, divide D into k equal sized subsets

For each subset, train the model on all other subsets and test on the other subset. average error between folds

Splitting

K-folds

The problem of different sample sizes (sample size for validations sets is lower, different hyper parameters could be more appropriate)

Test sets and validation sets

Test sets

Overfitting

K-fold cross-validation

Splitting

K-folds

Leave-One-Out

Hyperparameters

Learning rate, batching and momentum

Search methods

Grid-search

Validation sets

Validation sets