- withConsumerBlackList(String) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
Black list used by directory partitioner to filter out unwanted files in a hdfs directory.
- withConsumerBufferCapacity(long) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
The capacity of the hdfs consumer buffer - the blocking queue used for storing messages.
- withConsumerGroupPattern(String) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
Group pattern used by directory partitioner for advanced partitioning.
- withConsumerNumMaxRetries(long) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
Number of max retries for the hdfs consumer readers per partition.
- withConsumerWhiteList(String) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
White list used by directory partitioner to filter out unwanted files in a hdfs directory.
- withDatePathFormat(String) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
In an HdfsWriter implementation that performs time-based output bucketing,
the user may configure a date format (suitable for inclusion in a file path)
using SimpleDateFormat formatting that the Bucketer implementation will
use to generate HDFS paths and filenames.
- withOutputBaseDir(String) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
The base output directory into which all HDFS output for this job will be written.
- withReaderType(String) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
The type of the file reader for consumer (avro, plain, etc.)
- withStagingDirectory(String) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
Staging directory for storing partition description.
- withWriteBatchSizeBytes(long) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
Split output files from all writer tasks based on # of bytes written to optimize
MapReduce utilization for Hadoop jobs that will process the data later.
- withWriteBatchSizeRecords(long) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
Split output files from all writer tasks based on # of bytes written to optimize
MapReduce utilization for Hadoop jobs that will process the data later.
- withWriteCompressionType(String) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
Simple, human-readable label for various compression options.
- withWriterClassName(String) - Method in class org.apache.samza.system.hdfs.descriptors.HdfsSystemDescriptor
-
The fully-qualified class name of the HdfsWriter subclass that will write for this system.