Summarize chat by using Oracle Generative AI on a dedicated AI cluster

Today, businesses rely on their support centers to resolve the external customers' issues. As part of the support engagement process, the support engineer uses chat communication to efficiently engage with the customer. After the chat, to improve the customer experience, the engineer must spend additional time sharing summary notes on what was discussed .
With help of Oracle Cloud Infrastructure(OCI) Generative AI service, the engineer can summarize the content of the chat and automate the end-to-end communication process. Additionally, OCI Generative AI allows you to refine the models and customize them using your own data. You can leverage the dedicated AI cluster to fine-tune custom models or to host endpoints for custom models. The Learning models can effectively summarize the conversation in a human-friendly readable tone, thereby saving the engineer's valuable time and increasing the productivity of customer engagement.

This reference architecture describes how OCI Generative AI service summarizes chat communication between the users.

Architecture

This reference architecture leverages Oracle Digital Assistant (ODA) to simulate conversation with human users over the internet. ODAs are AI-driven interfaces that help users accomplish a variety of tasks in natural language conversations. You can integrate ODA with custom knowledge-based search to provide the user more information on the context. This architecture also uses an OCI API Gateway to secure and manage the APIs from Generative AI and Digital Assistant. You can use these APIs to integrate with any application; for example, this architecture integrates with Oracle B2C Service Cloud. By using the webhook code, the ODA bot can transfer chats to a live agent for detailed analysis and issue resolution. Once the chat communication is closed,OCI Generative AI uses the fine-tuned model hosted on the Dedicated AI Cluster to generate a summary of the chat. The summary is shared with the customer by using OCI Email Delivery Service. This architecture also takes advantage of OCI Logging and Monitoring services for better visibility and resource control. If fine-tuning is not a requirement, then you can use the managed service to call the base model, eliminating the need for a dedicated AI Cluster.
In this reference architecture, the following OCI services are provisioned:
  • OCI Generative AI Service with Dedicated AI Cluster for Text Summarization
  • Oracle Digital Assistant
  • OCI API Gateway
  • OCI Email Delivery
  • Oracle B2C Service Cloud

The following diagram illustrates this reference architecture.


Description of summarize-chat-oci-gen-ai.png follows
Description of the illustration summarize-chat-oci-gen-ai.png

summarize-chat-oci-gen-ai-oracle.zip

The architecture has the following components:
  • Region

    An OCI region is a localized geographic area that contains one or more data centers, called availability domains. Regions are independent of other regions, and vast distances can separate them (across countries or even continents).

  • Virtual Cloud Network (VCN) and Subnets

    A VCN is a customizable, software-defined network that you set up in an OCI region. Like traditional data center networks, VCNs give you complete control over your network environment. A VCN can have multiple non-overlapping CIDR blocks that you can change after you create the VCN. You can segment a VCN into subnets, which can be scoped to a region or to an availability domain. Each subnet consists of a contiguous range of addresses that don't overlap with the other subnets in the VCN. You can change the size of a subnet after creation. A subnet can be public or private.

  • Object Storage

    Object storage provides quick access to large amounts of structured and unstructured data of any content type, including database backups, analytic data, and rich content such as images and videos. You can safely and securely store and then retrieve data directly from the internet or from within the cloud platform. You can seamlessly scale storage without experiencing any degradation in performance or service reliability. Use standard storage for "hot" storage that you need to access quickly, immediately, and frequently. Use archive storage for "cold" storage that you retain for long periods of time and seldom or rarely access.

  • OCI Generative AI
    OCI Generative AI is a fully managed service available via an API to seamlessly integrate these versatile language models into a wide range of use cases, including writing assistance, summarization, and chat. The OCI Generative AI service includes the following foundational models:
    • Generation: Give instructions to generate text or extract information from your text.
    • Summarization: Summarize text with your instructed format, length, and tone.
    • Embedding: Convert text to vector embeddings to use in applications for semantic searches, text classification, or text clustering.
  • Dedicated AI Clusters

    Dedicated AI clusters are compute resources that you can use for fine-tuning custom models or for hosting endpoints for custom models. The clusters are dedicated to your models and not shared with users in other tenancies. Custom Model OCI Generative AI lets you use your own data to refine the models.

  • Oracle Digital Assistant

    Digital Assistant is a platform for creating conversational interfaces or chatbots. A chatbot is a computer program that simulates and processes human conversation (written or spoken), allowing humans to interact with applications and data as if they were communicating with a real person. Developers assemble digital assistants from one or more skills. Skills can be based on prebuilt skills provided by Oracle or third parties, custom developed, or based on one of the many skill templates available. Digital Assistant routes the user’s request to the most appropriate skill to satisfy the user’s request. Skills combine a multilingual NLP deep learning engine, a powerful dialogue flow engine, and integration components to connect to back-end systems.

  • API Gateway

    Oracle API Gateway enables you to publish APIs with private endpoints that are accessible from within your network, and which you can expose to the public internet if required. The endpoints support API validation, request and response transformation, CORS, authentication and authorization, and request limiting.

  • Email Delivery

    OCI Email Delivery is a highly scalable, cost effective, and reliable email delivery service for sending high-volume, application-generated emails for mission-critical marketing, notification, and transactional communications such as receipts, fraud detection alerts, multifactor identity verification, and password resets.

  • Identity and Access Management (IAM)

    OCI Identity and Access Management (IAM) is the access control plane for OCI and Oracle Cloud Applications. The IAM API and the user interface enable you to manage identity domains and the resources within the identity domain. Each OCI IAM identity domain represents a standalone identity and access management solution or a different user population.

  • Logging
    Logging is a highly scalable and fully managed service that provides access to the following types of logs from your resources in the cloud:
    • Audit logs: Logs related to events emitted by the Audit service.
    • Service logs: Logs emitted by individual services such as API Gateway, Events, Functions, Load Balancing, Object Storage, and VCN flow logs.
    • Custom logs: Logs that contain diagnostic information from custom applications, other cloud providers, or an on-premises environment.
  • Monitoring

    OCI Monitoring service actively and passively monitors your cloud resources using metrics to monitor resources and alarms to notify you when these metrics meet alarm-specified triggers.

Acknowledgments

Author: Rajsagar Rawool