Interface ScalingPolicyMetric.Builder
-
- All Superinterfaces:
Buildable,CopyableBuilder<ScalingPolicyMetric.Builder,ScalingPolicyMetric>,SdkBuilder<ScalingPolicyMetric.Builder,ScalingPolicyMetric>,SdkPojo
- Enclosing class:
- ScalingPolicyMetric
public static interface ScalingPolicyMetric.Builder extends SdkPojo, CopyableBuilder<ScalingPolicyMetric.Builder,ScalingPolicyMetric>
-
-
Method Summary
All Methods Instance Methods Abstract Methods Modifier and Type Method Description ScalingPolicyMetric.BuilderinvocationsPerInstance(Integer invocationsPerInstance)The number of invocations sent to a model, normalized byInstanceCountin each ProductionVariant.ScalingPolicyMetric.BuildermodelLatency(Integer modelLatency)The interval of time taken by a model to respond as viewed from SageMaker.-
Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copy
-
Methods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, build
-
Methods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFields
-
-
-
-
Method Detail
-
invocationsPerInstance
ScalingPolicyMetric.Builder invocationsPerInstance(Integer invocationsPerInstance)
The number of invocations sent to a model, normalized by
InstanceCountin each ProductionVariant.1/numberOfInstancesis sent as the value on each request, wherenumberOfInstancesis the number of active instances for the ProductionVariant behind the endpoint at the time of the request.- Parameters:
invocationsPerInstance- The number of invocations sent to a model, normalized byInstanceCountin each ProductionVariant.1/numberOfInstancesis sent as the value on each request, wherenumberOfInstancesis the number of active instances for the ProductionVariant behind the endpoint at the time of the request.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
modelLatency
ScalingPolicyMetric.Builder modelLatency(Integer modelLatency)
The interval of time taken by a model to respond as viewed from SageMaker. This interval includes the local communication times taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.
- Parameters:
modelLatency- The interval of time taken by a model to respond as viewed from SageMaker. This interval includes the local communication times taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
-