Interface CreateDataSourceFromRdsRequest.Builder

    • Method Detail

      • dataSourceId

        CreateDataSourceFromRdsRequest.Builder dataSourceId​(String dataSourceId)

        A user-supplied ID that uniquely identifies the DataSource. Typically, an Amazon Resource Number (ARN) becomes the ID for a DataSource.

        Parameters:
        dataSourceId - A user-supplied ID that uniquely identifies the DataSource. Typically, an Amazon Resource Number (ARN) becomes the ID for a DataSource.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • dataSourceName

        CreateDataSourceFromRdsRequest.Builder dataSourceName​(String dataSourceName)

        A user-supplied name or description of the DataSource.

        Parameters:
        dataSourceName - A user-supplied name or description of the DataSource.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • rdsData

        CreateDataSourceFromRdsRequest.Builder rdsData​(RDSDataSpec rdsData)

        The data specification of an Amazon RDS DataSource:

        • DatabaseInformation -

          • DatabaseName - The name of the Amazon RDS database.

          • InstanceIdentifier - A unique identifier for the Amazon RDS database instance.

        • DatabaseCredentials - AWS Identity and Access Management (IAM) credentials that are used to connect to the Amazon RDS database.

        • ResourceRole - A role (DataPipelineDefaultResourceRole) assumed by an EC2 instance to carry out the copy task from Amazon RDS to Amazon Simple Storage Service (Amazon S3). For more information, see Role templates for data pipelines.

        • ServiceRole - A role (DataPipelineDefaultRole) assumed by the AWS Data Pipeline service to monitor the progress of the copy task from Amazon RDS to Amazon S3. For more information, see Role templates for data pipelines.

        • SecurityInfo - The security information to use to access an RDS DB instance. You need to set up appropriate ingress rules for the security entity IDs provided to allow access to the Amazon RDS instance. Specify a [ SubnetId, SecurityGroupIds] pair for a VPC-based RDS DB instance.

        • SelectSqlQuery - A query that is used to retrieve the observation data for the Datasource.

        • S3StagingLocation - The Amazon S3 location for staging Amazon RDS data. The data retrieved from Amazon RDS using SelectSqlQuery is stored in this location.

        • DataSchemaUri - The Amazon S3 location of the DataSchema.

        • DataSchema - A JSON string representing the schema. This is not required if DataSchemaUri is specified.

        • DataRearrangement - A JSON string that represents the splitting and rearrangement requirements for the Datasource.

          Sample - "{\"splitting\":{\"percentBegin\":10,\"percentEnd\":60}}"

        Parameters:
        rdsData - The data specification of an Amazon RDS DataSource:

        • DatabaseInformation -

          • DatabaseName - The name of the Amazon RDS database.

          • InstanceIdentifier - A unique identifier for the Amazon RDS database instance.

        • DatabaseCredentials - AWS Identity and Access Management (IAM) credentials that are used to connect to the Amazon RDS database.

        • ResourceRole - A role (DataPipelineDefaultResourceRole) assumed by an EC2 instance to carry out the copy task from Amazon RDS to Amazon Simple Storage Service (Amazon S3). For more information, see Role templates for data pipelines.

        • ServiceRole - A role (DataPipelineDefaultRole) assumed by the AWS Data Pipeline service to monitor the progress of the copy task from Amazon RDS to Amazon S3. For more information, see Role templates for data pipelines.

        • SecurityInfo - The security information to use to access an RDS DB instance. You need to set up appropriate ingress rules for the security entity IDs provided to allow access to the Amazon RDS instance. Specify a [SubnetId, SecurityGroupIds] pair for a VPC-based RDS DB instance.

        • SelectSqlQuery - A query that is used to retrieve the observation data for the Datasource .

        • S3StagingLocation - The Amazon S3 location for staging Amazon RDS data. The data retrieved from Amazon RDS using SelectSqlQuery is stored in this location.

        • DataSchemaUri - The Amazon S3 location of the DataSchema.

        • DataSchema - A JSON string representing the schema. This is not required if DataSchemaUri is specified.

        • DataRearrangement - A JSON string that represents the splitting and rearrangement requirements for the Datasource.

          Sample - "{\"splitting\":{\"percentBegin\":10,\"percentEnd\":60}}"

        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • rdsData

        default CreateDataSourceFromRdsRequest.Builder rdsData​(Consumer<RDSDataSpec.Builder> rdsData)

        The data specification of an Amazon RDS DataSource:

        • DatabaseInformation -

          • DatabaseName - The name of the Amazon RDS database.

          • InstanceIdentifier - A unique identifier for the Amazon RDS database instance.

        • DatabaseCredentials - AWS Identity and Access Management (IAM) credentials that are used to connect to the Amazon RDS database.

        • ResourceRole - A role (DataPipelineDefaultResourceRole) assumed by an EC2 instance to carry out the copy task from Amazon RDS to Amazon Simple Storage Service (Amazon S3). For more information, see Role templates for data pipelines.

        • ServiceRole - A role (DataPipelineDefaultRole) assumed by the AWS Data Pipeline service to monitor the progress of the copy task from Amazon RDS to Amazon S3. For more information, see Role templates for data pipelines.

        • SecurityInfo - The security information to use to access an RDS DB instance. You need to set up appropriate ingress rules for the security entity IDs provided to allow access to the Amazon RDS instance. Specify a [ SubnetId, SecurityGroupIds] pair for a VPC-based RDS DB instance.

        • SelectSqlQuery - A query that is used to retrieve the observation data for the Datasource.

        • S3StagingLocation - The Amazon S3 location for staging Amazon RDS data. The data retrieved from Amazon RDS using SelectSqlQuery is stored in this location.

        • DataSchemaUri - The Amazon S3 location of the DataSchema.

        • DataSchema - A JSON string representing the schema. This is not required if DataSchemaUri is specified.

        • DataRearrangement - A JSON string that represents the splitting and rearrangement requirements for the Datasource.

          Sample - "{\"splitting\":{\"percentBegin\":10,\"percentEnd\":60}}"

        This is a convenience method that creates an instance of the RDSDataSpec.Builder avoiding the need to create one manually via RDSDataSpec.builder().

        When the Consumer completes, SdkBuilder.build() is called immediately and its result is passed to rdsData(RDSDataSpec).

        Parameters:
        rdsData - a consumer that will call methods on RDSDataSpec.Builder
        Returns:
        Returns a reference to this object so that method calls can be chained together.
        See Also:
        rdsData(RDSDataSpec)
      • roleARN

        CreateDataSourceFromRdsRequest.Builder roleARN​(String roleARN)

        The role that Amazon ML assumes on behalf of the user to create and activate a data pipeline in the user's account and copy data using the SelectSqlQuery query from Amazon RDS to Amazon S3.

        Parameters:
        roleARN - The role that Amazon ML assumes on behalf of the user to create and activate a data pipeline in the user's account and copy data using the SelectSqlQuery query from Amazon RDS to Amazon S3.

        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • computeStatistics

        CreateDataSourceFromRdsRequest.Builder computeStatistics​(Boolean computeStatistics)

        The compute statistics for a DataSource. The statistics are generated from the observation data referenced by a DataSource. Amazon ML uses the statistics internally during MLModel training. This parameter must be set to true if the DataSource needs to be used for MLModel training.

        Parameters:
        computeStatistics - The compute statistics for a DataSource. The statistics are generated from the observation data referenced by a DataSource. Amazon ML uses the statistics internally during MLModel training. This parameter must be set to true if the DataSource needs to be used for MLModel training.
        Returns:
        Returns a reference to this object so that method calls can be chained together.