Uploaded image for project: 'Red Hat Enterprise Linux AI'
  1. Red Hat Enterprise Linux AI
  2. RHELAI-3903

Add unmasking to SDG and deprecate legacy pretraining format

XMLWordPrintable

      Goal:

      To support generating data for use in training with non-Granite student models, we need to support a new unmask parameter in SDG and stop using the legacy pretraining format that embedded the student model chat template in the generated pretraining samples.

      This is needed to unblock third party student model training support.

              bbrownin@redhat.com Ben Browning
              bbrownin@redhat.com Ben Browning
              Oleg Silkin
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: