- generate() - Method in class ai.djl.serving.wlm.WorkerIdGenerator
-
generate a new worker id.
- getBatchSize() - Method in class ai.djl.serving.http.DescribeModelResponse
-
Returns the batch size.
- getBatchSize() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the default batchSize for workers.
- getBatchSize() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the configured batch size.
- getBytes(ByteBuf) - Static method in class ai.djl.serving.util.NettyUtils
-
Returns the bytes for the specified ByteBuf.
- getChildGroup() - Method in class ai.djl.serving.util.ServerGroups
-
Returns the client event loop group.
- getClientChannel() - Method in class ai.djl.serving.util.Connector
-
Returns the client channel class.
- getCode() - Method in class ai.djl.serving.http.ErrorResponse
-
Returns the error code.
- getConfigFile() - Method in class ai.djl.serving.Arguments
-
Returns the configuration file path.
- getConnector(Connector.ConnectorType) - Method in class ai.djl.serving.util.ConfigManager
-
Returns the models server socket connector.
- getCorsAllowedHeaders() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the CORS allowed headers setting.
- getCorsAllowedMethods() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the CORS allowed method setting.
- getCorsAllowedOrigin() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the CORS allowed origin setting.
- getDefaultWorkers() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the default number of workers for a new registered model.
- getId() - Method in class ai.djl.serving.http.DescribeModelResponse.Worker
-
Returns the worker's ID.
- getInput() - Method in class ai.djl.serving.wlm.Job
-
Returns the input data.
- getInstance() - Static method in class ai.djl.serving.util.ConfigManager
-
Returns the singleton ConfigManager instance.
- getInstance() - Static method in class ai.djl.serving.wlm.ModelManager
-
Returns the singleton ModelManager instance.
- getIntParameter(QueryStringDecoder, String, int) - Static method in class ai.djl.serving.util.NettyUtils
-
Read the parameter's integer value for the key from the uri.
- getIoRatio() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the native IO ratio.
- getJobQueueSize() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the default job queue size.
- getLoadModels() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the model urls that to be loaded at startup.
- getMaxBatchDelay() - Method in class ai.djl.serving.http.DescribeModelResponse
-
Returns the maximum delay in milliseconds to aggregate a batch.
- getMaxBatchDelay() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the default maxBatchDelay for the working queue.
- getMaxBatchDelay() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the maximum delay in milliseconds to aggregate a batch.
- getMaxIdleTime() - Method in class ai.djl.serving.http.DescribeModelResponse
-
Returns the maximum idle time for worker threads.
- getMaxIdleTime() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the default max idle time for workers.
- getMaxIdleTime() - Method in class ai.djl.serving.wlm.ModelInfo
-
returns the configured maxIdleTime of workers.
- getMaxRequestSize() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the maximum allowed request size in bytes.
- getMaxWorkers() - Method in class ai.djl.serving.http.DescribeModelResponse
-
Returns the desired maximum number of workers.
- getMaxWorkers() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the configured maximum number of workers.
- getMessage() - Method in class ai.djl.serving.http.ErrorResponse
-
Returns the error message.
- getMinWorkers() - Method in class ai.djl.serving.http.DescribeModelResponse
-
Returns the desired minimum number of workers.
- getMinWorkers() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the configured minimum number of workers.
- getModel() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the loaded ZooModel.
- getModelDir() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the model cache directory.
- getModelName() - Method in class ai.djl.serving.http.DescribeModelResponse
-
Returns the model name.
- getModelName() - Method in class ai.djl.serving.http.ListModelsResponse.ModelItem
-
Returns the model name.
- getModelName() - Method in class ai.djl.serving.wlm.Job
-
Returns the model name that associated with this job.
- getModelName() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the model name.
- getModels() - Method in class ai.djl.serving.Arguments
-
Returns the model urls that specified in command line.
- getModels() - Method in class ai.djl.serving.http.ListModelsResponse
-
Returns a list of models.
- getModels() - Method in class ai.djl.serving.wlm.ModelManager
-
Returns the registry of all models.
- getModelServerHome() - Static method in class ai.djl.serving.util.ConfigManager
-
Returns the model server home directory.
- getModelStore() - Method in class ai.djl.serving.Arguments
-
Returns the model store location.
- getModelStore() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the model store location.
- getModelUrl() - Method in class ai.djl.serving.http.DescribeModelResponse
-
Returns the model URL.
- getModelUrl() - Method in class ai.djl.serving.http.ListModelsResponse.ModelItem
-
Returns the model url.
- getModelUrl() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the model url.
- getModelUrlPattern() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the allowed model url pattern regex.
- getNettyClientThreads() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the configured netty client threads.
- getNettyThreads() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the configured netty threads.
- getNextPageToken() - Method in class ai.djl.serving.http.ListModelsResponse
-
Returns the next page token.
- getNumberOfGpu() - Method in class ai.djl.serving.util.ConfigManager
-
Returns the number of GPUs to be used.
- getOptions() - Static method in class ai.djl.serving.Arguments
-
Builds the command line options.
- getParameter(QueryStringDecoder, String, String) - Static method in class ai.djl.serving.util.NettyUtils
-
Reads the parameter's value for the key from the uri.
- getPluginFolder() - Method in class ai.djl.serving.util.ConfigManager
-
return the folder where the model search for plugins.
- getProperty(String, String) - Method in class ai.djl.serving.util.ConfigManager
-
Returns the value with the specified key in this configuration.
- getQueueSize() - Method in class ai.djl.serving.wlm.ModelInfo
-
Returns the configured size of the workers queue.
- getRequestId() - Method in class ai.djl.serving.http.Session
-
Returns the request ID.
- getRequestId(Channel) - Static method in class ai.djl.serving.util.NettyUtils
-
Returns the request ID for the specified channel.
- getRequestId() - Method in class ai.djl.serving.wlm.Job
-
Returns the request id.
- getServerChannel() - Method in class ai.djl.serving.util.Connector
-
Returns the server channel class.
- getServerGroup() - Method in class ai.djl.serving.util.ServerGroups
-
Returns the server event loop group.
- getSocketAddress() - Method in class ai.djl.serving.util.Connector
-
Returns the TCP socket listening address.
- getSocketPath() - Method in class ai.djl.serving.util.Connector
-
Returns the unix domain socket path.
- getSocketType() - Method in class ai.djl.serving.util.Connector
-
Returns the socket type.
- getSslContext() - Method in class ai.djl.serving.util.ConfigManager
-
Returns a SSLContext instance.
- getStartTime() - Method in class ai.djl.serving.http.DescribeModelResponse.Worker
-
Returns the worker's start time.
- getStartupModels() - Method in class ai.djl.serving.wlm.ModelManager
-
Returns a set of models that was loaded at startup.
- getStatus() - Method in class ai.djl.serving.http.DescribeModelResponse
-
Returns the model's status.
- getStatus() - Method in class ai.djl.serving.http.DescribeModelResponse.Worker
-
Returns the worker's status.
- getStatus() - Method in class ai.djl.serving.http.StatusResponse
-
Returns the status.
- getType() - Method in class ai.djl.serving.http.ErrorResponse
-
Returns the error type.
- getType() - Method in class ai.djl.serving.util.Connector
-
Returns the type of the connector.
- getWorkers() - Method in class ai.djl.serving.http.DescribeModelResponse
-
Returns all workers information of the model.
- GpuAssignmentStrategy - Interface in ai.djl.serving.wlm
-
Strategies to find next available GpuID.
- InferenceRequestHandler - Class in ai.djl.serving.http
-
A class handling inbound HTTP requests for the management API.
- InferenceRequestHandler() - Constructor for class ai.djl.serving.http.InferenceRequestHandler
-
default constructor.
- inferModelNameFromUrl(String) - Static method in class ai.djl.serving.wlm.ModelInfo
-
Infer model name form model URL in case model name is not provided.
- init(Arguments) - Static method in class ai.djl.serving.util.ConfigManager
-
Initialize the global ConfigManager instance.
- init(ConfigManager) - Static method in class ai.djl.serving.wlm.ModelManager
-
Initialized the global ModelManager instance.
- initChannel(Channel) - Method in class ai.djl.serving.ServerInitializer
- InternalServerException - Exception in ai.djl.serving.http
-
Thrown when an internal server failure occurs.
- InternalServerException(String) - Constructor for exception ai.djl.serving.http.InternalServerException
-
Constructs an InternalServerException with the specified detail message.
- InternalServerException(String, Throwable) - Constructor for exception ai.djl.serving.http.InternalServerException
-
Constructs an BadRequestException with the specified detail message and cause.
- InvalidRequestHandler - Class in ai.djl.serving.http
-
A class handling unhandled inbound HTTP requests.
- InvalidRequestHandler() - Constructor for class ai.djl.serving.http.InvalidRequestHandler
-
- isDebug() - Method in class ai.djl.serving.util.ConfigManager
-
Returns if debug is enabled.
- isFinished() - Method in class ai.djl.serving.wlm.PermanentBatchAggregator
-
Checks if this BatchAggregator and the thread can be shutdown or if this aggregator
waits for more data.
- isFinished() - Method in class ai.djl.serving.wlm.TemporaryBatchAggregator
-
Checks if this BatchAggregator and the thread can be shutdown or if this aggregator
waits for more data.
- isGpu() - Method in class ai.djl.serving.http.DescribeModelResponse.Worker
-
Return if the worker using GPU.
- isLoadedAtStartup() - Method in class ai.djl.serving.http.DescribeModelResponse
-
Returns if the models was loaded at startup.
- isRunning() - Method in class ai.djl.serving.ModelServer
-
Return if the server is running.
- isSsl() - Method in class ai.djl.serving.util.Connector
-
Return if the connector requires SSL.
- isUds() - Method in class ai.djl.serving.util.Connector
-
Returns if the connector is using unix domain socket.
- ScaleCapacityExceededException - Exception in ai.djl.serving.wlm
-
Is thrown when capacity of workers is reached during autoscaling.
- ScaleCapacityExceededException() - Constructor for exception ai.djl.serving.wlm.ScaleCapacityExceededException
-
No arguments.
- ScaleCapacityExceededException(String) - Constructor for exception ai.djl.serving.wlm.ScaleCapacityExceededException
-
construct using a message.
- ScaleCapacityExceededException(Throwable) - Constructor for exception ai.djl.serving.wlm.ScaleCapacityExceededException
-
construct using a cause.
- ScaleCapacityExceededException(String, Throwable) - Constructor for exception ai.djl.serving.wlm.ScaleCapacityExceededException
-
construct using a message and a clause.
- ScaleCapacityExceededException(String, Throwable, boolean, boolean) - Constructor for exception ai.djl.serving.wlm.ScaleCapacityExceededException
-
construct using a message cause and flags.
- scaleWorkers(int, int) - Method in class ai.djl.serving.wlm.ModelInfo
-
Sets new workers capcities for this model and returns a new configured ModelInfo object.
- sendError(ChannelHandlerContext, Throwable) - Static method in class ai.djl.serving.util.NettyUtils
-
Sends error to client with exception.
- sendError(ChannelHandlerContext, HttpResponseStatus, Throwable) - Static method in class ai.djl.serving.util.NettyUtils
-
Sends error to client with HTTP status and exception.
- sendError(HttpResponseStatus, String) - Method in class ai.djl.serving.wlm.Job
-
Sends error to the client.
- sendHttpResponse(ChannelHandlerContext, FullHttpResponse, boolean) - Static method in class ai.djl.serving.util.NettyUtils
-
Send HTTP response to client.
- sendJsonResponse(ChannelHandlerContext, Object) - Static method in class ai.djl.serving.util.NettyUtils
-
Sends the json object to client.
- sendJsonResponse(ChannelHandlerContext, Object, HttpResponseStatus) - Static method in class ai.djl.serving.util.NettyUtils
-
Sends the json string to client with specified status.
- sendJsonResponse(ChannelHandlerContext, String) - Static method in class ai.djl.serving.util.NettyUtils
-
Sends the json string to client.
- sendJsonResponse(ChannelHandlerContext, String, HttpResponseStatus) - Static method in class ai.djl.serving.util.NettyUtils
-
Sends the json object to client with specified status.
- sendOutput(Output) - Method in class ai.djl.serving.wlm.Job
-
Sends the response back to the client.
- ServerGroups - Class in ai.djl.serving.util
-
A class that holds all opened ChannelGroup and EventLoopGroup.
- ServerGroups(ConfigManager) - Constructor for class ai.djl.serving.util.ServerGroups
-
Constructs a new ServerGroups instance.
- ServerInitializer - Class in ai.djl.serving
-
A special ChannelInboundHandler which offers an easy way to initialize a
Channel once it was registered to its EventLoop.
- ServerInitializer(SslContext, Connector.ConnectorType, FolderScanPluginManager) - Constructor for class ai.djl.serving.ServerInitializer
-
Creates a new HttpRequestHandler instance.
- ServiceUnavailableException - Exception in ai.djl.serving.http
-
Thrown when the server is unable to serve the HTTP request.
- ServiceUnavailableException(String) - Constructor for exception ai.djl.serving.http.ServiceUnavailableException
-
Constructs an ServiceUnavailableException with the specified detail message.
- Session - Class in ai.djl.serving.http
-
A class that holds HTTP session information.
- Session(String, HttpRequest) - Constructor for class ai.djl.serving.http.Session
-
Constructs a new Session instance.
- setBatchSize(int) - Method in class ai.djl.serving.http.DescribeModelResponse
-
Sets the batch size.
- setCode(int) - Method in class ai.djl.serving.http.Session
-
Sets the HTTP response code.
- setGpu(boolean) - Method in class ai.djl.serving.http.DescribeModelResponse.Worker
-
Sets if the worker using GPU.
- setId(int) - Method in class ai.djl.serving.http.DescribeModelResponse.Worker
-
Sets the worker's ID.
- setLoadedAtStartup(boolean) - Method in class ai.djl.serving.http.DescribeModelResponse
-
Sets the load at startup status.
- setMaxBatchDelay(int) - Method in class ai.djl.serving.http.DescribeModelResponse
-
Sets the maximum delay in milliseconds to aggregate a batch.
- setMaxIdleTime(int) - Method in class ai.djl.serving.http.DescribeModelResponse
-
Sets the max idle time for worker threads.
- setMaxWorkers(int) - Method in class ai.djl.serving.http.DescribeModelResponse
-
Sets the desired maximum number of workers.
- setMinWorkers(int) - Method in class ai.djl.serving.http.DescribeModelResponse
-
Sets the desired minimum number of workers.
- setModelName(String) - Method in class ai.djl.serving.http.DescribeModelResponse
-
Sets the model name.
- setModelUrl(String) - Method in class ai.djl.serving.http.DescribeModelResponse
-
Sets the model URL.
- setNextPageToken(String) - Method in class ai.djl.serving.http.ListModelsResponse
-
Sets the next page token.
- setScheduled() - Method in class ai.djl.serving.wlm.Job
-
Marks the job has been scheduled.
- setStartTime(Date) - Method in class ai.djl.serving.http.DescribeModelResponse.Worker
-
Sets the worker's start time.
- setStatus(String) - Method in class ai.djl.serving.http.DescribeModelResponse
-
Sets the model's status.
- setStatus(String) - Method in class ai.djl.serving.http.DescribeModelResponse.Worker
-
Sets the worker's status.
- shutdown(boolean) - Method in class ai.djl.serving.util.ServerGroups
-
Shutdowns all opened channels and event loops.
- start() - Method in class ai.djl.serving.ModelServer
-
Main Method that prepares the future for the channel and sets up the ServerBootstrap.
- startAndWait() - Method in class ai.djl.serving.ModelServer
-
Starts the model server and block until server stops.
- StatusResponse - Class in ai.djl.serving.http
-
A class that holds model server status.
- StatusResponse() - Constructor for class ai.djl.serving.http.StatusResponse
-
Constructs a new StatusResponse instance.
- StatusResponse(String) - Constructor for class ai.djl.serving.http.StatusResponse
-
Constructs a new StatusResponse instance with status line.
- stop() - Method in class ai.djl.serving.ModelServer
-
Stops the model server.