Uploaded image for project: 'Red Hat Enterprise Linux AI'
  1. Red Hat Enterprise Linux AI
  2. RHELAI-4006

Add support for validation dataset

XMLWordPrintable

      Currently, there is no way to provide both a training and validation set to the training code. Ideally it would be possible to provide one of:

      • 2 datasets (one for training, the other for validation)
      • 1 dataset dict that contains a predefined train/val split
      • 1 dataset and a percentage to randomly split the dataset into train and val

      In addition, the user should be able to specify how frequently to evaluate the model on the validation dataset. 

      Then during the main training loop, the model's validation loss will be computed at the desired frequency and logged.

      This is an essential component as it allows us to verify that the model is not overfitting to the training data, but has learned to generalize to unseen data as well.

              rh-ee-fschmitt Fynn Schmitt-Ulms
              rh-ee-fschmitt Fynn Schmitt-Ulms
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: