Interface S3ExcelSource.Builder

    • Method Detail

      • name

        S3ExcelSource.Builder name​(String name)

        The name of the S3 Excel data source.

        Parameters:
        name - The name of the S3 Excel data source.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • paths

        S3ExcelSource.Builder paths​(Collection<String> paths)

        The S3 paths where the Excel files are located.

        Parameters:
        paths - The S3 paths where the Excel files are located.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • paths

        S3ExcelSource.Builder paths​(String... paths)

        The S3 paths where the Excel files are located.

        Parameters:
        paths - The S3 paths where the Excel files are located.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • compressionType

        S3ExcelSource.Builder compressionType​(String compressionType)

        The compression format used for the Excel files.

        Parameters:
        compressionType - The compression format used for the Excel files.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
        See Also:
        ParquetCompressionType, ParquetCompressionType
      • exclusions

        S3ExcelSource.Builder exclusions​(Collection<String> exclusions)

        Patterns to exclude specific files or paths from processing.

        Parameters:
        exclusions - Patterns to exclude specific files or paths from processing.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • exclusions

        S3ExcelSource.Builder exclusions​(String... exclusions)

        Patterns to exclude specific files or paths from processing.

        Parameters:
        exclusions - Patterns to exclude specific files or paths from processing.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • groupSize

        S3ExcelSource.Builder groupSize​(String groupSize)

        Defines the size of file groups for batch processing.

        Parameters:
        groupSize - Defines the size of file groups for batch processing.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • groupFiles

        S3ExcelSource.Builder groupFiles​(String groupFiles)

        Specifies how files should be grouped for processing.

        Parameters:
        groupFiles - Specifies how files should be grouped for processing.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • recurse

        S3ExcelSource.Builder recurse​(Boolean recurse)

        Indicates whether to recursively process subdirectories.

        Parameters:
        recurse - Indicates whether to recursively process subdirectories.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • maxBand

        S3ExcelSource.Builder maxBand​(Integer maxBand)

        The maximum number of processing bands to use.

        Parameters:
        maxBand - The maximum number of processing bands to use.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • maxFilesInBand

        S3ExcelSource.Builder maxFilesInBand​(Integer maxFilesInBand)

        The maximum number of files to process in each band.

        Parameters:
        maxFilesInBand - The maximum number of files to process in each band.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • additionalOptions

        S3ExcelSource.Builder additionalOptions​(S3DirectSourceAdditionalOptions additionalOptions)

        Additional configuration options for S3 direct source processing.

        Parameters:
        additionalOptions - Additional configuration options for S3 direct source processing.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • numberRows

        S3ExcelSource.Builder numberRows​(Long numberRows)

        The number of rows to process from each Excel file.

        Parameters:
        numberRows - The number of rows to process from each Excel file.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • skipFooter

        S3ExcelSource.Builder skipFooter​(Integer skipFooter)

        The number of rows to skip at the end of each Excel file.

        Parameters:
        skipFooter - The number of rows to skip at the end of each Excel file.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • outputSchemas

        S3ExcelSource.Builder outputSchemas​(Collection<GlueSchema> outputSchemas)

        The Glue schemas to apply to the processed data.

        Parameters:
        outputSchemas - The Glue schemas to apply to the processed data.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • outputSchemas

        S3ExcelSource.Builder outputSchemas​(GlueSchema... outputSchemas)

        The Glue schemas to apply to the processed data.

        Parameters:
        outputSchemas - The Glue schemas to apply to the processed data.
        Returns:
        Returns a reference to this object so that method calls can be chained together.