| Package | Description |
|---|---|
| com.amazonaws.services.sagemaker.model |
| Modifier and Type | Method and Description |
|---|---|
ProductionVariant |
ProductionVariant.clone() |
ProductionVariant |
ProductionVariant.withAcceleratorType(ProductionVariantAcceleratorType acceleratorType)
The size of the Elastic Inference (EI) instance to use for the production variant.
|
ProductionVariant |
ProductionVariant.withAcceleratorType(String acceleratorType)
The size of the Elastic Inference (EI) instance to use for the production variant.
|
ProductionVariant |
ProductionVariant.withContainerStartupHealthCheckTimeoutInSeconds(Integer containerStartupHealthCheckTimeoutInSeconds)
The timeout value, in seconds, for your inference container to pass health check by SageMaker Hosting.
|
ProductionVariant |
ProductionVariant.withCoreDumpConfig(ProductionVariantCoreDumpConfig coreDumpConfig)
Specifies configuration for a core dump from the model container when the process crashes.
|
ProductionVariant |
ProductionVariant.withEnableSSMAccess(Boolean enableSSMAccess)
You can use this parameter to turn on native Amazon Web Services Systems Manager (SSM) access for a production
variant behind an endpoint.
|
ProductionVariant |
ProductionVariant.withInferenceAmiVersion(ProductionVariantInferenceAmiVersion inferenceAmiVersion)
Specifies an option from a collection of preconfigured Amazon Machine Image (AMI) images.
|
ProductionVariant |
ProductionVariant.withInferenceAmiVersion(String inferenceAmiVersion)
Specifies an option from a collection of preconfigured Amazon Machine Image (AMI) images.
|
ProductionVariant |
ProductionVariant.withInitialInstanceCount(Integer initialInstanceCount)
Number of instances to launch initially.
|
ProductionVariant |
ProductionVariant.withInitialVariantWeight(Float initialVariantWeight)
Determines initial traffic distribution among all of the models that you specify in the endpoint configuration.
|
ProductionVariant |
ProductionVariant.withInstanceType(ProductionVariantInstanceType instanceType)
The ML compute instance type.
|
ProductionVariant |
ProductionVariant.withInstanceType(String instanceType)
The ML compute instance type.
|
ProductionVariant |
ProductionVariant.withManagedInstanceScaling(ProductionVariantManagedInstanceScaling managedInstanceScaling)
Settings that control the range in the number of instances that the endpoint provisions as it scales up or down
to accommodate traffic.
|
ProductionVariant |
ProductionVariant.withModelDataDownloadTimeoutInSeconds(Integer modelDataDownloadTimeoutInSeconds)
The timeout value, in seconds, to download and extract the model that you want to host from Amazon S3 to the
individual inference instance associated with this production variant.
|
ProductionVariant |
ProductionVariant.withModelName(String modelName)
The name of the model that you want to host.
|
ProductionVariant |
ProductionVariant.withRoutingConfig(ProductionVariantRoutingConfig routingConfig)
Settings that control how the endpoint routes incoming traffic to the instances that the endpoint hosts.
|
ProductionVariant |
ProductionVariant.withServerlessConfig(ProductionVariantServerlessConfig serverlessConfig)
The serverless configuration for an endpoint.
|
ProductionVariant |
ProductionVariant.withVariantName(String variantName)
The name of the production variant.
|
ProductionVariant |
ProductionVariant.withVolumeSizeInGB(Integer volumeSizeInGB)
The size, in GB, of the ML storage volume attached to individual inference instance associated with the
production variant.
|
| Modifier and Type | Method and Description |
|---|---|
List<ProductionVariant> |
DescribeEndpointConfigResult.getProductionVariants()
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
List<ProductionVariant> |
CreateEndpointConfigRequest.getProductionVariants()
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
List<ProductionVariant> |
DescribeEndpointConfigResult.getShadowProductionVariants()
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint in
shadow mode with production traffic replicated from the model specified on ProductionVariants. |
List<ProductionVariant> |
CreateEndpointConfigRequest.getShadowProductionVariants()
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint in
shadow mode with production traffic replicated from the model specified on ProductionVariants. |
| Modifier and Type | Method and Description |
|---|---|
DescribeEndpointConfigResult |
DescribeEndpointConfigResult.withProductionVariants(ProductionVariant... productionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
CreateEndpointConfigRequest |
CreateEndpointConfigRequest.withProductionVariants(ProductionVariant... productionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
DescribeEndpointConfigResult |
DescribeEndpointConfigResult.withShadowProductionVariants(ProductionVariant... shadowProductionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint in
shadow mode with production traffic replicated from the model specified on ProductionVariants. |
CreateEndpointConfigRequest |
CreateEndpointConfigRequest.withShadowProductionVariants(ProductionVariant... shadowProductionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint in
shadow mode with production traffic replicated from the model specified on ProductionVariants. |
| Modifier and Type | Method and Description |
|---|---|
void |
DescribeEndpointConfigResult.setProductionVariants(Collection<ProductionVariant> productionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
void |
CreateEndpointConfigRequest.setProductionVariants(Collection<ProductionVariant> productionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
void |
DescribeEndpointConfigResult.setShadowProductionVariants(Collection<ProductionVariant> shadowProductionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint in
shadow mode with production traffic replicated from the model specified on ProductionVariants. |
void |
CreateEndpointConfigRequest.setShadowProductionVariants(Collection<ProductionVariant> shadowProductionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint in
shadow mode with production traffic replicated from the model specified on ProductionVariants. |
DescribeEndpointConfigResult |
DescribeEndpointConfigResult.withProductionVariants(Collection<ProductionVariant> productionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
CreateEndpointConfigRequest |
CreateEndpointConfigRequest.withProductionVariants(Collection<ProductionVariant> productionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
DescribeEndpointConfigResult |
DescribeEndpointConfigResult.withShadowProductionVariants(Collection<ProductionVariant> shadowProductionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint in
shadow mode with production traffic replicated from the model specified on ProductionVariants. |
CreateEndpointConfigRequest |
CreateEndpointConfigRequest.withShadowProductionVariants(Collection<ProductionVariant> shadowProductionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint in
shadow mode with production traffic replicated from the model specified on ProductionVariants. |
Copyright © 2025. All rights reserved.