Learn About Deploying an AI Meetings App
This solution is centered around artificial intelligence, integrating generative AI and Oracle Cloud services to automate and enhance processes. Using a low code platform (Oracle Visual Builder), the application development and deployment are streamlined, making it easy to leverage AI services and Oracle Cloud Infrastructure Object Storage.
Oracle Database 23ai and OCI AI services are used to create an AI meetings app to leverage cutting-edge technology to enhance team meetings with seamless real-time audio transcription, efficient summarization, and easy interrogation of transcriptions. The AI meetings app extracts sentiment, reduces administrative tasks, and provides data-driven insights to turn meeting data into actionable intelligence and boost overall productivity.
Architecture
After the app is successfully configured, the Oracle Visual Builder interface is used to upload audio and video to a bucket in OCI Object Storage. Oracle Cloud Infrastructure Speech is then invoked for audio transcription. The transcription is diarised, to detect the number of meeting attendees, and to separate the text by speaker.
After transcription is finished, the file is read from object storage, and sent to the AI services for analysis:
- Oracle Cloud Infrastructure Generative AI summarizes the transcription.
- Oracle Cloud Infrastructure Language analyzes the sentiment and detects entities such as products, locations, and human mentions.
- Our custom RAG (Retrieval-Augmented Generation) is running on Oracle Database 23ai. The service is running on a virtual machine with an exposed API. The API is invoked, and we provide the transcript and questions for the RAG to respond with the processed data.
The following diagram illustrates this solution:
smart-meeting-app-architecture.zip
This architecture supports the following components:
- Region
An Oracle Cloud Infrastructure region is a localized geographic area that contains one or more data centers, called availability domains. Regions are independent of other regions, and vast distances can separate them (across countries or even continents).
- Object storage
Oracle Cloud Infrastructure Object Storage provides quick access to large amounts of structured and unstructured data of any content type, including database backups, analytic data, and rich content such as images and videos. You can safely and securely store and then retrieve data directly from the internet or from within the cloud platform. You can scale storage without experiencing any degradation in performance or service reliability. Use standard storage for "hot" storage that you need to access quickly, immediately, and frequently. Use archive storage for "cold" storage that you retain for long periods of time and seldom or rarely access.
- Oracle Cloud Infrastructure Language
Oracle Cloud Infrastructure Language reduces this time and effort by providing key language processing capabilities as production-ready pre-trained models to uncover insights in unstructured text and eliminates the need for machine learning expertise. You can use OCI Language to automate text analysis at scale and understand unstructured text in documents, customer feedback interactions or support tickets regarding any issue or reviews. This will enable you to extract insights for improving customer experience and increasing efficiency.
OCI Language empowers developers to apply capabilities like sentiment analysis, key-phrase extraction, text classification, named entity recognition, and more into their applications. Developers can integrate pretrained NLP capabilities into applications, without needing data scientists to create customized models. OCI Language can be accessed either through the OCI console, OCI SDKs in Python, Java, Go, Typescript, .Net, REST APIs, or the OCI CLI.
Using OCI Language provides the following benefits:
- Improve customer experience: Explore how customers use the product(s), extract sentiments about certain areas of interest, and identify key frustrations to address them in a timely manner.
- Identify important data: Extract named entities from customer feedback to identify people, products, and organizations mentioned.
- Ensure security and privacy: OCI Language upholds customer privacy with language models that do not store any data for training, debugging, or other purposes. In addition, OCI Language can be used to identify any potential personally identifiable information (PII) to protect customer privacy.
- Oracle Cloud Infrastructure Speech
Oracle Cloud Infrastructure (OCI) Speech is one of the several cloud-native AI services. You can use the Speech service to convert audio files to readable text that is stored in JSON format.
Speech harnesses the power of spoken language by allowing you to easily convert audio files containing human speech into highly accurate text transcriptions. The service is an OCI native application that you can access using a web application, REST API, SDK, CLI, or Console.
Speech uses automatic speech recognition (ASR) technology to provide a grammatically correct transcription of video and audio files. Speech handles low-fidelity audio recordings and transcribes challenging recordings like meetings or call center calls. Using Speech, you can turn files stored in OCI Object Storage or a data asset into accurate, normalized, timestamped, and profanity-filtered text. This functionality is available with downstream services. For example, you could use additional services such as language and forecasting to analyze call sentiment, target content for advertising, index your media folders and create a media search engine using Oracle Cloud Infrastructure Lakehouse.
- Oracle Cloud Infrastructure Generative AI
Oracle Cloud Infrastructure (OCI) Generative AI is a fully managed service available via an API to seamlessly integrate these versatile language models into a wide range of use cases, including writing assistance, summarization, and chat. The OCI Generative AI service includes the following foundational models:
- Generation: Give instructions to generate text or extract information from your text.
- Summarization: Summarize text with your instructed format, length, and tone.
- Embedding: Convert text to vector embeddings to use in applications for semantic searches, text classification, or text clustering.
- Oracle Database 23ai
Oracle Database 23ai delivers the most complete and simple converged database for developers looking to build new microservice, graph, document, and relational applications.Oracle has announced the plan to add semantic search capabilities using AI vectors to Oracle Database 23ai. The collection of features, called AI Vector Search, includes a new vector data type, vector indexes, and vector search SQL operators that enable the Oracle Database to store the semantic content of documents, images, and other unstructured data as vectors, and use these to run fast similarity queries.
- Oracle Visual Builder
Oracle Visual Builder is an intuitive development experience on top of a development and hosting platform that empowers you to create engaging responsive applications. Focusing on ease of use and a visual development approach, it provides an easy way for you to create applications that are hosted in Oracle’s secure and scalable cloud platform.
Before You Begin
About Required Services and Roles
This solution requires the following services:
- Oracle Cloud Infrastructure Speech
- Oracle Cloud Infrastructure Language
- Oracle Cloud Infrastructure Generative AI
- Oracle Visual Builder
These are the roles needed for each service.
Service Name: Role | Required to... |
---|---|
Oracle Cloud Infrastructure Speech: manage ai-service-speech-family |
Create and manage transcription jobs. |
Oracle Cloud Infrastructure Language: manage ai-service-language-family |
Manage language projects and sentiments. |
Oracle Cloud Infrastructure Generative AI: manage generative-ai-family
|
Manage AI models, generate text generation, and summarization. |
Oracle Visual Builder: ServiceDeveloper |
Use the visual design tool to create, manage, secure, and publish visual applications. Design pages, work with business objects, and test applications. |
See Oracle Products, Solutions, and Services to get what you need.