Class TatoebaEnglishFrenchDataset

  • All Implemented Interfaces:
    ai.djl.training.dataset.Dataset

    public class TatoebaEnglishFrenchDataset
    extends TextDataset
    TatoebaEnglishFrenchDataset is a English-French machine translation dataset from The Tatoeba Project (http://www.manythings.org/anki/).
    • Constructor Detail

      • TatoebaEnglishFrenchDataset

        protected TatoebaEnglishFrenchDataset​(TatoebaEnglishFrenchDataset.Builder builder)
        Creates a new instance of TatoebaEnglishFrenchDataset.
        Parameters:
        builder - the builder object to build from
    • Method Detail

      • prepare

        public void prepare​(ai.djl.util.Progress progress)
                     throws java.io.IOException,
                            ai.djl.modality.nlp.embedding.EmbeddingException
        Throws:
        java.io.IOException
        ai.djl.modality.nlp.embedding.EmbeddingException
      • get

        public ai.djl.training.dataset.Record get​(ai.djl.ndarray.NDManager manager,
                                                  long index)
        Specified by:
        get in class ai.djl.training.dataset.RandomAccessDataset
      • availableSize

        protected long availableSize()
        Specified by:
        availableSize in class ai.djl.training.dataset.RandomAccessDataset