The problem when choosing a model based just on it's test set error

If we don’t use cross validation sets when choosing the correct model, the d5 model below is likely to be an overly optimistic (overfitted) model for the test set.

One way to break down our dataset into the three sets is:

  1. Training set: 60%
  2. Cross validation set: 20%
  3. Test set: 20%

How To select the best model

We can now calculate three separate error values for the three different sets using the following method:

  1. Optimize the parameters in Θ using the training set for each polynomial degree.