| Package | Description |
|---|---|
| com.amazonaws.services.sagemaker.model |
| Modifier and Type | Method and Description |
|---|---|
ProductionVariant |
ProductionVariant.clone() |
ProductionVariant |
ProductionVariant.withAcceleratorType(ProductionVariantAcceleratorType acceleratorType)
The size of the Elastic Inference (EI) instance to use for the production variant.
|
ProductionVariant |
ProductionVariant.withAcceleratorType(String acceleratorType)
The size of the Elastic Inference (EI) instance to use for the production variant.
|
ProductionVariant |
ProductionVariant.withContainerStartupHealthCheckTimeoutInSeconds(Integer containerStartupHealthCheckTimeoutInSeconds)
The timeout value, in seconds, for your inference container to pass health check by SageMaker Hosting.
|
ProductionVariant |
ProductionVariant.withCoreDumpConfig(ProductionVariantCoreDumpConfig coreDumpConfig)
Specifies configuration for a core dump from the model container when the process crashes.
|
ProductionVariant |
ProductionVariant.withInitialInstanceCount(Integer initialInstanceCount)
Number of instances to launch initially.
|
ProductionVariant |
ProductionVariant.withInitialVariantWeight(Float initialVariantWeight)
Determines initial traffic distribution among all of the models that you specify in the endpoint configuration.
|
ProductionVariant |
ProductionVariant.withInstanceType(ProductionVariantInstanceType instanceType)
The ML compute instance type.
|
ProductionVariant |
ProductionVariant.withInstanceType(String instanceType)
The ML compute instance type.
|
ProductionVariant |
ProductionVariant.withModelDataDownloadTimeoutInSeconds(Integer modelDataDownloadTimeoutInSeconds)
The timeout value, in seconds, to download and extract the model that you want to host from Amazon S3 to the
individual inference instance associated with this production variant.
|
ProductionVariant |
ProductionVariant.withModelName(String modelName)
The name of the model that you want to host.
|
ProductionVariant |
ProductionVariant.withServerlessConfig(ProductionVariantServerlessConfig serverlessConfig)
The serverless configuration for an endpoint.
|
ProductionVariant |
ProductionVariant.withVariantName(String variantName)
The name of the production variant.
|
ProductionVariant |
ProductionVariant.withVolumeSizeInGB(Integer volumeSizeInGB)
The size, in GB, of the ML storage volume attached to individual inference instance associated with the
production variant.
|
| Modifier and Type | Method and Description |
|---|---|
List<ProductionVariant> |
CreateEndpointConfigRequest.getProductionVariants()
An list of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
List<ProductionVariant> |
DescribeEndpointConfigResult.getProductionVariants()
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
| Modifier and Type | Method and Description |
|---|---|
CreateEndpointConfigRequest |
CreateEndpointConfigRequest.withProductionVariants(ProductionVariant... productionVariants)
An list of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
DescribeEndpointConfigResult |
DescribeEndpointConfigResult.withProductionVariants(ProductionVariant... productionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
| Modifier and Type | Method and Description |
|---|---|
void |
CreateEndpointConfigRequest.setProductionVariants(Collection<ProductionVariant> productionVariants)
An list of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
void |
DescribeEndpointConfigResult.setProductionVariants(Collection<ProductionVariant> productionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
CreateEndpointConfigRequest |
CreateEndpointConfigRequest.withProductionVariants(Collection<ProductionVariant> productionVariants)
An list of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
DescribeEndpointConfigResult |
DescribeEndpointConfigResult.withProductionVariants(Collection<ProductionVariant> productionVariants)
An array of
ProductionVariant objects, one for each model that you want to host at this endpoint. |
Copyright © 2022. All rights reserved.