public static interface RecommendationJobStoppingConditions.Builder extends SdkPojo, CopyableBuilder<RecommendationJobStoppingConditions.Builder,RecommendationJobStoppingConditions>
| Modifier and Type | Method and Description |
|---|---|
RecommendationJobStoppingConditions.Builder |
flatInvocations(FlatInvocations flatInvocations)
Stops a load test when the number of invocations (TPS) peaks and flattens, which means that the instance has
reached capacity.
|
RecommendationJobStoppingConditions.Builder |
flatInvocations(String flatInvocations)
Stops a load test when the number of invocations (TPS) peaks and flattens, which means that the instance has
reached capacity.
|
RecommendationJobStoppingConditions.Builder |
maxInvocations(Integer maxInvocations)
The maximum number of requests per minute expected for the endpoint.
|
RecommendationJobStoppingConditions.Builder |
modelLatencyThresholds(Collection<ModelLatencyThreshold> modelLatencyThresholds)
The interval of time taken by a model to respond as viewed from SageMaker.
|
RecommendationJobStoppingConditions.Builder |
modelLatencyThresholds(Consumer<ModelLatencyThreshold.Builder>... modelLatencyThresholds)
The interval of time taken by a model to respond as viewed from SageMaker.
|
RecommendationJobStoppingConditions.Builder |
modelLatencyThresholds(ModelLatencyThreshold... modelLatencyThresholds)
The interval of time taken by a model to respond as viewed from SageMaker.
|
equalsBySdkFields, sdkFieldscopyapplyMutation, buildRecommendationJobStoppingConditions.Builder maxInvocations(Integer maxInvocations)
The maximum number of requests per minute expected for the endpoint.
maxInvocations - The maximum number of requests per minute expected for the endpoint.RecommendationJobStoppingConditions.Builder modelLatencyThresholds(Collection<ModelLatencyThreshold> modelLatencyThresholds)
The interval of time taken by a model to respond as viewed from SageMaker. The interval includes the local communication time taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.
modelLatencyThresholds - The interval of time taken by a model to respond as viewed from SageMaker. The interval includes the
local communication time taken to send the request and to fetch the response from the container of a
model and the time taken to complete the inference in the container.RecommendationJobStoppingConditions.Builder modelLatencyThresholds(ModelLatencyThreshold... modelLatencyThresholds)
The interval of time taken by a model to respond as viewed from SageMaker. The interval includes the local communication time taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.
modelLatencyThresholds - The interval of time taken by a model to respond as viewed from SageMaker. The interval includes the
local communication time taken to send the request and to fetch the response from the container of a
model and the time taken to complete the inference in the container.RecommendationJobStoppingConditions.Builder modelLatencyThresholds(Consumer<ModelLatencyThreshold.Builder>... modelLatencyThresholds)
The interval of time taken by a model to respond as viewed from SageMaker. The interval includes the local communication time taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.
This is a convenience method that creates an instance of theModelLatencyThreshold.Builder avoiding the need to
create one manually via
ModelLatencyThreshold.builder().
When the Consumer completes,
SdkBuilder.build() is called
immediately and its result is passed to #modelLatencyThresholds(List.
modelLatencyThresholds - a consumer that will call methods on
ModelLatencyThreshold.Builder#modelLatencyThresholds(java.util.Collection) RecommendationJobStoppingConditions.Builder flatInvocations(String flatInvocations)
Stops a load test when the number of invocations (TPS) peaks and flattens, which means that the instance has
reached capacity. The default value is Stop. If you want the load test to continue after
invocations have flattened, set the value to Continue.
flatInvocations - Stops a load test when the number of invocations (TPS) peaks and flattens, which means that the
instance has reached capacity. The default value is Stop. If you want the load test to
continue after invocations have flattened, set the value to Continue.FlatInvocations,
FlatInvocationsRecommendationJobStoppingConditions.Builder flatInvocations(FlatInvocations flatInvocations)
Stops a load test when the number of invocations (TPS) peaks and flattens, which means that the instance has
reached capacity. The default value is Stop. If you want the load test to continue after
invocations have flattened, set the value to Continue.
flatInvocations - Stops a load test when the number of invocations (TPS) peaks and flattens, which means that the
instance has reached capacity. The default value is Stop. If you want the load test to
continue after invocations have flattened, set the value to Continue.FlatInvocations,
FlatInvocationsCopyright © 2023. All rights reserved.