| Package | Description |
|---|---|
| software.amazon.awssdk.services.sagemaker.model |
| Modifier and Type | Method and Description |
|---|---|
ProductionVariant.Builder |
ProductionVariant.Builder.acceleratorType(ProductionVariantAcceleratorType acceleratorType)
The size of the Elastic Inference (EI) instance to use for the production variant.
|
ProductionVariant.Builder |
ProductionVariant.Builder.acceleratorType(String acceleratorType)
The size of the Elastic Inference (EI) instance to use for the production variant.
|
static ProductionVariant.Builder |
ProductionVariant.builder() |
ProductionVariant.Builder |
ProductionVariant.Builder.containerStartupHealthCheckTimeoutInSeconds(Integer containerStartupHealthCheckTimeoutInSeconds)
The timeout value, in seconds, for your inference container to pass health check by SageMaker Hosting.
|
default ProductionVariant.Builder |
ProductionVariant.Builder.coreDumpConfig(Consumer<ProductionVariantCoreDumpConfig.Builder> coreDumpConfig)
Specifies configuration for a core dump from the model container when the process crashes.
|
ProductionVariant.Builder |
ProductionVariant.Builder.coreDumpConfig(ProductionVariantCoreDumpConfig coreDumpConfig)
Specifies configuration for a core dump from the model container when the process crashes.
|
ProductionVariant.Builder |
ProductionVariant.Builder.initialInstanceCount(Integer initialInstanceCount)
Number of instances to launch initially.
|
ProductionVariant.Builder |
ProductionVariant.Builder.initialVariantWeight(Float initialVariantWeight)
Determines initial traffic distribution among all of the models that you specify in the endpoint
configuration.
|
ProductionVariant.Builder |
ProductionVariant.Builder.instanceType(ProductionVariantInstanceType instanceType)
The ML compute instance type.
|
ProductionVariant.Builder |
ProductionVariant.Builder.instanceType(String instanceType)
The ML compute instance type.
|
ProductionVariant.Builder |
ProductionVariant.Builder.modelDataDownloadTimeoutInSeconds(Integer modelDataDownloadTimeoutInSeconds)
The timeout value, in seconds, to download and extract the model that you want to host from Amazon S3 to the
individual inference instance associated with this production variant.
|
ProductionVariant.Builder |
ProductionVariant.Builder.modelName(String modelName)
The name of the model that you want to host.
|
default ProductionVariant.Builder |
ProductionVariant.Builder.serverlessConfig(Consumer<ProductionVariantServerlessConfig.Builder> serverlessConfig)
The serverless configuration for an endpoint.
|
ProductionVariant.Builder |
ProductionVariant.Builder.serverlessConfig(ProductionVariantServerlessConfig serverlessConfig)
The serverless configuration for an endpoint.
|
ProductionVariant.Builder |
ProductionVariant.toBuilder() |
ProductionVariant.Builder |
ProductionVariant.Builder.variantName(String variantName)
The name of the production variant.
|
ProductionVariant.Builder |
ProductionVariant.Builder.volumeSizeInGB(Integer volumeSizeInGB)
The size, in GB, of the ML storage volume attached to individual inference instance associated with the
production variant.
|
| Modifier and Type | Method and Description |
|---|---|
static Class<? extends ProductionVariant.Builder> |
ProductionVariant.serializableBuilderClass() |
Copyright © 2023. All rights reserved.