| Interface | Description |
|---|---|
| Dataset |
Interface representing a dataset.
|
| DatasetResolver | Deprecated
use the more general
DescriptorResolver |
| DatasetResolverFactory | Deprecated
use
DescriptorResolverFactory as DatasetResolver is deprecated
with DescriptorResolver |
| DatasetsFinder<T extends Dataset> |
Finds
Datasets in the file system. |
| DescriptorResolver |
A resolver transforms an existing
Descriptor to a new one |
| DescriptorResolverFactory |
Factory to create a
DescriptorResolver instance |
| FileSystemDataset |
Dataset in a file system, which can be characterized by a root Path. |
| IterableDatasetFinder<T extends Dataset> | |
| PartitionableDataset<T extends PartitionableDataset.DatasetPartition> |
A
Dataset that can be partitioned into disjoint subsets of the dataset. |
| PartitionableDataset.DatasetPartition |
A partition of a
PartitionableDataset. |
| URNIdentified |
An object that can be identified by URN.
|
| Class | Description |
|---|---|
| DatasetConstants | |
| DatasetDescriptor |
A
Descriptor identifies and provides metadata to describe a dataset |
| Descriptor |
A descriptor is a simplified representation of a resource, which could be a dataset, dataset partition, file, etc.
|
| HiveToHdfsDatasetResolver | |
| HiveToHdfsDatasetResolverFactory | |
| IterableDatasetFinderImpl<T extends Dataset> |
Wraps a
DatasetsFinder into an IterableDatasetFinder. |
| NoopDatasetResolver |
The default
DatasetResolver that directly uses Gobblin raw dataset as job dataset |
| PartitionDescriptor |
A
Descriptor to identifies a partition of a dataset |