public static final class PredictionServiceGrpc.PredictionServiceFutureStub extends io.grpc.stub.AbstractFutureStub<PredictionServiceGrpc.PredictionServiceFutureStub>
A service for online predictions and explanations.
| Modifier and Type | Method and Description |
|---|---|
protected PredictionServiceGrpc.PredictionServiceFutureStub |
build(io.grpc.Channel channel,
io.grpc.CallOptions callOptions) |
com.google.common.util.concurrent.ListenableFuture<CountTokensResponse> |
countTokens(CountTokensRequest request)
Perform a token counting.
|
com.google.common.util.concurrent.ListenableFuture<DirectPredictResponse> |
directPredict(DirectPredictRequest request)
Perform an unary online prediction request for Vertex first-party products
and frameworks.
|
com.google.common.util.concurrent.ListenableFuture<DirectRawPredictResponse> |
directRawPredict(DirectRawPredictRequest request)
Perform an online prediction request through gRPC.
|
com.google.common.util.concurrent.ListenableFuture<ExplainResponse> |
explain(ExplainRequest request)
Perform an online explanation.
|
com.google.common.util.concurrent.ListenableFuture<PredictResponse> |
predict(PredictRequest request)
Perform an online prediction.
|
com.google.common.util.concurrent.ListenableFuture<com.google.api.HttpBody> |
rawPredict(RawPredictRequest request)
Perform an online prediction with an arbitrary HTTP payload.
|
protected PredictionServiceGrpc.PredictionServiceFutureStub build(io.grpc.Channel channel, io.grpc.CallOptions callOptions)
build in class io.grpc.stub.AbstractStub<PredictionServiceGrpc.PredictionServiceFutureStub>public com.google.common.util.concurrent.ListenableFuture<PredictResponse> predict(PredictRequest request)
Perform an online prediction.
public com.google.common.util.concurrent.ListenableFuture<com.google.api.HttpBody> rawPredict(RawPredictRequest request)
Perform an online prediction with an arbitrary HTTP payload. The response includes the following HTTP headers: * `X-Vertex-AI-Endpoint-Id`: ID of the [Endpoint][google.cloud.aiplatform.v1beta1.Endpoint] that served this prediction. * `X-Vertex-AI-Deployed-Model-Id`: ID of the Endpoint's [DeployedModel][google.cloud.aiplatform.v1beta1.DeployedModel] that served this prediction.
public com.google.common.util.concurrent.ListenableFuture<DirectPredictResponse> directPredict(DirectPredictRequest request)
Perform an unary online prediction request for Vertex first-party products and frameworks.
public com.google.common.util.concurrent.ListenableFuture<DirectRawPredictResponse> directRawPredict(DirectRawPredictRequest request)
Perform an online prediction request through gRPC.
public com.google.common.util.concurrent.ListenableFuture<ExplainResponse> explain(ExplainRequest request)
Perform an online explanation. If [deployed_model_id][google.cloud.aiplatform.v1beta1.ExplainRequest.deployed_model_id] is specified, the corresponding DeployModel must have [explanation_spec][google.cloud.aiplatform.v1beta1.DeployedModel.explanation_spec] populated. If [deployed_model_id][google.cloud.aiplatform.v1beta1.ExplainRequest.deployed_model_id] is not specified, all DeployedModels must have [explanation_spec][google.cloud.aiplatform.v1beta1.DeployedModel.explanation_spec] populated.
public com.google.common.util.concurrent.ListenableFuture<CountTokensResponse> countTokens(CountTokensRequest request)
Perform a token counting.
Copyright © 2024 Google LLC. All rights reserved.