A Private AI Services Container API Reference
Learn about the REST APIs available to use with Oracle Private AI Services Container.
models
Use GET requests to print a list of all currently deployed models.
Syntax
/v1/models
Returns
A list of deployed models.
{
"data": [
{
"id": "h126414603234059290",
"modelSize": "string",
"modelDeployedTime": "2025-12-22T15:49:11.745Z",
"modelCapabilities": [
"TEXT_EMBEDDINGS"
]
}
]
}models/{id}
Use GET requests to print information about a specific model.
Syntax
/v1/models/{id}
Parameters
id (string): A unique model name. This parameter
is required.
Returns
Information about the model specified by model ID.
{
"id": "L55808652807957200809612118083123839056757756025",
"modelSize": "string",
"modelDeployedTime": "2025-12-22T16:52:28.365Z",
"modelCapabilities": [
"TEXT_EMBEDDINGS"
]
}embeddings
Use POST requests to get embeddings against a model.
Syntax
/v1/embeddings
Parameters
x-convert-images (boolean): Indicates whether
images in the input list require conversion to JPG. The default value is
false.
Example Input
Note that input can be a string or an array of strings.
{
"input": "string",
"model": "string"
}
Example Output
-
Embedding results:
{ "data": [ { "embedding": [ 0 ], "index": 0 } ], "model": "string" } - 400: Error processing the request data.
- 404: Model not found.
- 500: An error occurred during the score operation for this model.
health
Use GET requests to verify that the container is ready to use.
Syntax
/health
Example Output
- 200: Private AI Services Container is up and running.
- 401: Unauthorized
- 500: Internal server error
metrics
Use GET requests to return a list of metric names exposed by the application.
Syntax
/metrics
Returns
Returns a list of metric names exposed by the application.
Example Output
-
Successful response with metric names:
{ "names": [ "embeddings_call_error_total", "embeddings_call_latency", "embeddings_call_success_total", "embeddings_call_total", "embeddings_last_latency", "http.server.requests", "jvm.memory.used", "process.cpu.usage", "system.cpu.usage" ] } - 401: Unauthorized
- 500: Internal server error
metrics/{metricName}
Use GET requests to return detailed information for a metric, including
measurements and available tags. Supports optional tag filters using repeated
tag query params in the form key:value.
Syntax
/metrics/{metricName}
Parameters
metricName (string): Metric name as returned by
GET /metrics. This parameter is required.
tag (array<string>): Tag filter(s) in the
form key:value. Repeat for multiple tags.
Example Output
-
Metric details:
{ "name": "embeddings_call_error_total", "description": "Total number of errors from embeddings calls.", "baseUnit": "count", "measurements": [ { "statistic": "COUNT", "value": 3 } ], "availableTags": [ { "tag": "model" }, { "tag": "status", "values": [ "success", "error" ] } ] } - 400: Invalid tag filter
- 404: Metric not found
- 500: Internal server error
api
Use GET requests to return the OpenAI specification for this API in YAML format.
Syntax
/v1/api
Returns
Returns the OpenAPI (YAML) document as a string.