Package ai.djl.basicdataset.nlp
Class TatoebaEnglishFrenchDataset
- java.lang.Object
-
- ai.djl.training.dataset.RandomAccessDataset
-
- ai.djl.basicdataset.nlp.TextDataset
-
- ai.djl.basicdataset.nlp.TatoebaEnglishFrenchDataset
-
- All Implemented Interfaces:
ai.djl.training.dataset.Dataset
public class TatoebaEnglishFrenchDataset extends TextDataset
TatoebaEnglishFrenchDatasetis a English-French machine translation dataset from The Tatoeba Project (http://www.manythings.org/anki/).
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classTatoebaEnglishFrenchDataset.BuilderA builder for aTatoebaEnglishFrenchDataset.-
Nested classes/interfaces inherited from class ai.djl.basicdataset.nlp.TextDataset
TextDataset.Sample
-
-
Field Summary
-
Fields inherited from class ai.djl.basicdataset.nlp.TextDataset
manager, mrl, prepared, samples, sourceTextData, targetTextData, usage
-
-
Constructor Summary
Constructors Modifier Constructor Description protectedTatoebaEnglishFrenchDataset(TatoebaEnglishFrenchDataset.Builder builder)Creates a new instance ofTatoebaEnglishFrenchDataset.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description protected longavailableSize()static TatoebaEnglishFrenchDataset.Builderbuilder()Creates a new builder to build aTatoebaEnglishFrenchDataset.ai.djl.training.dataset.Recordget(ai.djl.ndarray.NDManager manager, long index)voidprepare(ai.djl.util.Progress progress)-
Methods inherited from class ai.djl.basicdataset.nlp.TextDataset
getProcessedText, getRawText, getSamples, getTextEmbedding, getVocabulary, preprocess
-
Methods inherited from class ai.djl.training.dataset.RandomAccessDataset
getData, getData, getData, getData, newSubDataset, newSubDataset, randomSplit, size, subDataset, subDataset, subDataset, subDataset, toArray
-
-
-
-
Constructor Detail
-
TatoebaEnglishFrenchDataset
protected TatoebaEnglishFrenchDataset(TatoebaEnglishFrenchDataset.Builder builder)
Creates a new instance ofTatoebaEnglishFrenchDataset.- Parameters:
builder- the builder object to build from
-
-
Method Detail
-
builder
public static TatoebaEnglishFrenchDataset.Builder builder()
Creates a new builder to build aTatoebaEnglishFrenchDataset.- Returns:
- a new builder
-
prepare
public void prepare(ai.djl.util.Progress progress) throws java.io.IOException, ai.djl.modality.nlp.embedding.EmbeddingException- Throws:
java.io.IOExceptionai.djl.modality.nlp.embedding.EmbeddingException
-
get
public ai.djl.training.dataset.Record get(ai.djl.ndarray.NDManager manager, long index)- Specified by:
getin classai.djl.training.dataset.RandomAccessDataset
-
availableSize
protected long availableSize()
- Specified by:
availableSizein classai.djl.training.dataset.RandomAccessDataset
-
-