Transcribe Speech to Text with a Speech Action
Oracle Cloud Infrastructure Speech transcribes natural language speech to text. You can get accurate, text-normalized, and time-stamped transcriptions.
Capabilities
OCI Speech is an AI service that can transcribe speech to text. OCI Speech harnesses the power of spoken language by enabling you to easily convert audio files containing human speech into highly exact text transcriptions. It uses automatic speech recognition (ASR) technology to provide a grammatically correct transcription. It can handle low-fidelity media recordings and transcribe challenging recordings such as meetings or call center calls. Oracle Integration supports using OCI Speech in an integration with the speech action.
Prerequisites
See Prerequisites for information on the prerequisites you must satisfy in the Oracle Cloud Infrastructure Console.
Invoke Oracle Cloud Infrastructure Speech from an Integration
- Add a Speech action to an integration in either of the following ways:
- On the side of the canvas, click Actions
and drag the OCI Speech action to the appropriate location.
- Click
at the location where you want to add the OCI Speech action, then select OCI Speech.
- On the side of the canvas, click Actions
- Enter a name and optional description.
- Select the action you want to perform. You can create, update, or delete a transcription job. You can also get information about a transcription job or list the transcription jobs that are available in a compartment.
Element Description Action Select a transcription job to perform.
- Create transcription job: If you select this action, Oracle Integration will accept a request payload containing details about compartment id, model, input location, and output location to create a transcription job. Compartment id can also be specified in the Compartment field when performing Step 5.
- Get transcription job: If you select this action, Oracle Integration will accept the transcription job id (of the transcription job) as a path parameter to retrieve the transcription job.
- List transcription jobs: If you select this action, Oracle Integration will accept the compartment id (that contains the transcription jobs) as a query parameter to return a list of transcription jobs available in the compartment. It can also be specified in the Compartment field when performing Step 5.
- Update transcription job: If you select this action, Oracle Integration will accept the transcription job id (of the transcription job you want to update) as a path parameter and the request payload with details such as display name, description, and so on that need to be modified. This action updates the specified transcription job with the new details you provide.
- Delete transcription job: If you select this action, Oracle Integration will accept the transcription job id (of the transcription job you want to delete) as a path parameter. This action deletes the specified transcription job (but it will not delete the output transcription file that is stored in the output location bucket in the object store).
- Click Continue.
- Select the following information, then click Continue.
Element Description Compartment This field is displayed only when you select Create transcription job or List transcription jobs action in Step 3.
Select the Oracle Cloud Infrastructure compartment in which your Oracle Integration is installed.
Output bucket This field is displayed only when you select Create transcription job action in Step 3.
Select a bucket to store the text output generated by the OCI Speech action.
- On the Summary page, click Finish.
-
Open the mapper and define the mappings between the source and target elements as needed for the selected transcription job action.
Note:
You can optionally specify Compartment Id and Bucket Name in the mapper to override the value you selected initially for Compartment and Output Bucket respectively (in Step 5).- Perform the following source-to-target mappings for the Create transcription job action:
- Map the source Is Punctuation Enabled to the target Is Punctuation Enabled.
- Map the source Compartment Id to the target Compartment Id.
- Map the source Display Name to the target Display Name.
- Map the source Description to the target Description.
- Map the source Domain to the target Domain.
- Map the source Language Code to the target Language Code.
- Map the source Model Type to the target Model Type.
- Map the source Is Diarization Enabled to the target Is Diarization Enabled.
- Map the source Location Type to the target Location Type.
- Map the source Object Locations to the target Object Locations.
- Map the source Namespace Name to the target Namespace Name.
- Map the source Bucket Name to the target Bucket Name.
- Map the source Prefix to the target Prefix.
- Perform the following source-to-target mapping for the Get transcription job action:
- Map the source Transcription Job Id to the target Transcription Job Id.
- Perform the following source-to-target mapping for the List transcription jobs action:
- Map the source Compartment Id to the target Compartment Id.
You can optionally configure target elements such as Lifecycle State, Display Name, Id, Limit, Page, Sort Order, and Sort By.
- Perform the following source-to-target mappings for the Update transcription job action:
- Map the source Transcription Job Id to the target Transcription Job Id.
- Map the source Display Name to the target Display Name.
- Map the source Description to the target Description.
- Perform the following source-to-target mapping for the Delete transcription job action:
- Map the source Transcription Job Id to the target Transcription Job Id.
- Perform the following source-to-target mappings for the Create transcription job action:
- Exit the mapper.
The speech action is now configured.
Here's what happens when you activate and run the integration based on the transcription job action you selected in Step 3:- Create transcription job: Converts the speech you provided to text and the text output is stored in the output bucket you selected.
- Get transcription job: Retrieves the transcription job with the specified Id.
- List transcription jobs: Returns a list of transcription jobs available in the specified compartment.
- Update transcription job: Updates the specified transcription job with the new details you provided.
- Delete transcription job: Deletes the specified transcription job (but it will not delete the output transcription file that is stored in the output location bucket in the object store).