public static final class PredictionServiceGrpc.PredictionServiceBlockingStub extends io.grpc.stub.AbstractBlockingStub<PredictionServiceGrpc.PredictionServiceBlockingStub>
A service for online predictions and explanations.
| Modifier and Type | Method and Description |
|---|---|
protected PredictionServiceGrpc.PredictionServiceBlockingStub |
build(io.grpc.Channel channel,
io.grpc.CallOptions callOptions) |
CountTokensResponse |
countTokens(CountTokensRequest request)
Perform a token counting.
|
DirectPredictResponse |
directPredict(DirectPredictRequest request)
Perform an unary online prediction request for Vertex first-party products
and frameworks.
|
DirectRawPredictResponse |
directRawPredict(DirectRawPredictRequest request)
Perform an online prediction request through gRPC.
|
ExplainResponse |
explain(ExplainRequest request)
Perform an online explanation.
|
PredictResponse |
predict(PredictRequest request)
Perform an online prediction.
|
com.google.api.HttpBody |
rawPredict(RawPredictRequest request)
Perform an online prediction with an arbitrary HTTP payload.
|
Iterator<StreamingPredictResponse> |
serverStreamingPredict(StreamingPredictRequest request)
Perform a server-side streaming online prediction request for Vertex
LLM streaming.
|
Iterator<GenerateContentResponse> |
streamGenerateContent(GenerateContentRequest request)
Generate content with multimodal inputs with streaming support.
|
protected PredictionServiceGrpc.PredictionServiceBlockingStub build(io.grpc.Channel channel, io.grpc.CallOptions callOptions)
build in class io.grpc.stub.AbstractStub<PredictionServiceGrpc.PredictionServiceBlockingStub>public PredictResponse predict(PredictRequest request)
Perform an online prediction.
public com.google.api.HttpBody rawPredict(RawPredictRequest request)
Perform an online prediction with an arbitrary HTTP payload. The response includes the following HTTP headers: * `X-Vertex-AI-Endpoint-Id`: ID of the [Endpoint][google.cloud.aiplatform.v1beta1.Endpoint] that served this prediction. * `X-Vertex-AI-Deployed-Model-Id`: ID of the Endpoint's [DeployedModel][google.cloud.aiplatform.v1beta1.DeployedModel] that served this prediction.
public DirectPredictResponse directPredict(DirectPredictRequest request)
Perform an unary online prediction request for Vertex first-party products and frameworks.
public DirectRawPredictResponse directRawPredict(DirectRawPredictRequest request)
Perform an online prediction request through gRPC.
public Iterator<StreamingPredictResponse> serverStreamingPredict(StreamingPredictRequest request)
Perform a server-side streaming online prediction request for Vertex LLM streaming.
public ExplainResponse explain(ExplainRequest request)
Perform an online explanation. If [deployed_model_id][google.cloud.aiplatform.v1beta1.ExplainRequest.deployed_model_id] is specified, the corresponding DeployModel must have [explanation_spec][google.cloud.aiplatform.v1beta1.DeployedModel.explanation_spec] populated. If [deployed_model_id][google.cloud.aiplatform.v1beta1.ExplainRequest.deployed_model_id] is not specified, all DeployedModels must have [explanation_spec][google.cloud.aiplatform.v1beta1.DeployedModel.explanation_spec] populated.
public CountTokensResponse countTokens(CountTokensRequest request)
Perform a token counting.
public Iterator<GenerateContentResponse> streamGenerateContent(GenerateContentRequest request)
Generate content with multimodal inputs with streaming support.
Copyright © 2024 Google LLC. All rights reserved.