Run a Python Function on Grouped Data with Script Owner Specified
post
/api/py-scripts/v1/group-apply/{scriptName}/{ownerName}
Run a Python function on data grouped by column values with script owner specified.
Request
Supported Media Types
- application/json
Path Parameters
-
ownerName: string
The owner of the Python script
-
scriptName: string
The name of the Python script
A JSON str of (name - value) pairs specifying arguments of the request and additional arguments
Root Schema : EmbedScriptComputeGrp
Type:
Show Source
object
-
asyncFlag(optional):
boolean
Whether to execute the job asynchronously.
-
envName(optional):
string
Name of the conda environment.
-
graphicsFlag(optional):
boolean
Whether to capture images rendered in the script to the result.
-
groupBy:
string
Columns to group by. To set a list of columns, provide a comma-separated list.
-
input:
string
Query statement specifying input data.
-
orderBy(optional):
string
Columns to order by. To set a list of columns, provide a comma-separated list.
-
parallel(optional):
integer(int32)
Degree of parallel.
-
parallelFlag(optional):
boolean
Whether to execute job in parallelism.
-
parameters(optional):
string
A JSON str of (name - value) pairs specifying keyword arguments to be passed into the script.
-
service(optional):
string
Allowed Values:
[ "LOW", "MEDIUM", "HIGH" ]
LEVEL of the service. Default is LOW. -
timeout(optional):
integer(int32)
Minimum Value:
1800
Maximum Value:43200
The timeout limit (in seconds, default 1800s) of asynchronous job. Must be used together with `asyncFlag`=true.
Example Request (application/json)
{"input":"select * from GRADE", "groupBy":"GENDER", "orderBy":"GENDER, STATUS", "parameters":"{\"oml_input_type\":\"default\"}", "parallelFlag":true, "asyncFlag":true, "timeout":3600}
Response
Supported Media Types
- application/json
200 Response
By default, returns the job result.
201 Response
If asyncFlag=True, returns the location header where the status of the job can be fetched.
Headers
400 Response
Invalid parameters specified, output exceeding size limit or other script execution error.
500 Response
Problem connecting to Broker, executing job or other unexpected error.
Examples
The following example runs the script named group_count. and specifies the script owner.
curl -i -X POST --header "Authorization: Bearer ${token}" \
--header 'Content-Type: application/json' --header 'Accept: application/json' \
-d '{"input":"select * from IRIS", "parameters":"{\"oml_input_type\":\"pandas.DataFrame\"}", "groupBy":"Species", "orderBy":"Sepal_Length"}' \
"<oml-cloud-service-location-url>/oml/api/py-scripts/v1/group-apply/group_count/<owner_name>"
Response Headers
The response headers are the following:
HTTP/1.1 200 OK
Date: Mon, 17 Aug 2020 21:06:54 GMT
Content-Type: application/json
Content-Length: 5045
Connection: keep-alive
Cache-Control: no-cache, no-store, private
X-Frame-Options: SAMEORIGIN
X-XSS-Protection: 1;mode=block
Strict-Transport-Security: max-age=31536000; includeSubDomains
X-Content-Type-Options: nosniff
Set-Cookie: JSESSIONID=node0bucgs4gt7hci1gxvgtqc1hjdv647.node0; Path=/oml; Secure; HttpOnly
Expires: Thu, 01 Jan 1970 00:00:00 GMT
Response Body
A portion of the response body in JSON format is the following.
{"result":{"versicolor_8":{"SEPAL_LENGTH":{"0":5.7},"SPECIES":{"0":"versicolor"},"COUNT":{"0":5}},
"versicolor_7":{"SEPAL_LENGTH":{"0":5.6},"SPECIES":{"0":"versicolor"},"COUNT":{"0":2}},...}