public class TatoebaEnglishFrenchDataset extends TextDataset
TatoebaEnglishFrenchDataset is a English-French machine translation dataset from The
Tatoeba Project (http://www.manythings.org/anki/).| Modifier and Type | Class and Description |
|---|---|
static class |
TatoebaEnglishFrenchDataset.Builder
A builder for a
TatoebaEnglishFrenchDataset. |
TextDataset.Samplemanager, prepared, resource, samples, sourceTextData, targetTextData, usage| Modifier | Constructor and Description |
|---|---|
protected |
TatoebaEnglishFrenchDataset(TatoebaEnglishFrenchDataset.Builder builder)
Creates a new instance of
TatoebaEnglishFrenchDataset. |
| Modifier and Type | Method and Description |
|---|---|
protected long |
availableSize() |
static TatoebaEnglishFrenchDataset.Builder |
builder()
Creates a new builder to build a
TatoebaEnglishFrenchDataset. |
ai.djl.training.dataset.Record |
get(ai.djl.ndarray.NDManager manager,
long index) |
void |
prepare(ai.djl.util.Progress progress) |
getProcessedText, getRawText, getSamples, getTextEmbedding, getVocabulary, preprocessgetData, getData, getData, getData, randomSplit, size, subDataset, toArrayprotected TatoebaEnglishFrenchDataset(TatoebaEnglishFrenchDataset.Builder builder)
TatoebaEnglishFrenchDataset.builder - the builder object to build frompublic static TatoebaEnglishFrenchDataset.Builder builder()
TatoebaEnglishFrenchDataset.public void prepare(ai.djl.util.Progress progress)
throws java.io.IOException,
ai.djl.modality.nlp.embedding.EmbeddingException
java.io.IOExceptionai.djl.modality.nlp.embedding.EmbeddingExceptionpublic ai.djl.training.dataset.Record get(ai.djl.ndarray.NDManager manager,
long index)
get in class ai.djl.training.dataset.RandomAccessDatasetprotected long availableSize()
availableSize in class ai.djl.training.dataset.RandomAccessDataset