ProductionVariant.Builder |
ProductionVariant.Builder.acceleratorType(String acceleratorType) |
The size of the Elastic Inference (EI) instance to use for the production variant.
|
ProductionVariant.Builder |
ProductionVariant.Builder.acceleratorType(ProductionVariantAcceleratorType acceleratorType) |
The size of the Elastic Inference (EI) instance to use for the production variant.
|
static ProductionVariant.Builder |
ProductionVariant.builder() |
|
ProductionVariant.Builder |
ProductionVariant.Builder.containerStartupHealthCheckTimeoutInSeconds(Integer containerStartupHealthCheckTimeoutInSeconds) |
The timeout value, in seconds, for your inference container to pass health check by SageMaker Hosting.
|
default ProductionVariant.Builder |
ProductionVariant.Builder.coreDumpConfig(Consumer<ProductionVariantCoreDumpConfig.Builder> coreDumpConfig) |
Specifies configuration for a core dump from the model container when the process crashes.
|
ProductionVariant.Builder |
ProductionVariant.Builder.coreDumpConfig(ProductionVariantCoreDumpConfig coreDumpConfig) |
Specifies configuration for a core dump from the model container when the process crashes.
|
ProductionVariant.Builder |
ProductionVariant.Builder.enableSSMAccess(Boolean enableSSMAccess) |
You can use this parameter to turn on native Amazon Web Services Systems Manager (SSM) access for a
production variant behind an endpoint.
|
ProductionVariant.Builder |
ProductionVariant.Builder.initialInstanceCount(Integer initialInstanceCount) |
Number of instances to launch initially.
|
ProductionVariant.Builder |
ProductionVariant.Builder.initialVariantWeight(Float initialVariantWeight) |
Determines initial traffic distribution among all of the models that you specify in the endpoint
configuration.
|
ProductionVariant.Builder |
ProductionVariant.Builder.instanceType(String instanceType) |
The ML compute instance type.
|
ProductionVariant.Builder |
ProductionVariant.Builder.instanceType(ProductionVariantInstanceType instanceType) |
The ML compute instance type.
|
ProductionVariant.Builder |
ProductionVariant.Builder.modelDataDownloadTimeoutInSeconds(Integer modelDataDownloadTimeoutInSeconds) |
The timeout value, in seconds, to download and extract the model that you want to host from Amazon S3 to the
individual inference instance associated with this production variant.
|
ProductionVariant.Builder |
ProductionVariant.Builder.modelName(String modelName) |
The name of the model that you want to host.
|
default ProductionVariant.Builder |
ProductionVariant.Builder.serverlessConfig(Consumer<ProductionVariantServerlessConfig.Builder> serverlessConfig) |
The serverless configuration for an endpoint.
|
ProductionVariant.Builder |
ProductionVariant.Builder.serverlessConfig(ProductionVariantServerlessConfig serverlessConfig) |
The serverless configuration for an endpoint.
|
ProductionVariant.Builder |
ProductionVariant.toBuilder() |
|
ProductionVariant.Builder |
ProductionVariant.Builder.variantName(String variantName) |
The name of the production variant.
|
ProductionVariant.Builder |
ProductionVariant.Builder.volumeSizeInGB(Integer volumeSizeInGB) |
The size, in GB, of the ML storage volume attached to individual inference instance associated with the
production variant.
|