Summarize chat by using Oracle Generative AI on a dedicated AI cluster
This reference architecture describes how OCI Generative AI service summarizes chat communication between the users.
Architecture
- OCI Generative AI Service with Dedicated AI Cluster for Text Summarization
- Oracle Digital Assistant
- OCI API Gateway
- OCI Email Delivery
- Oracle B2C Service Cloud
The following diagram illustrates this reference architecture.
Description of the illustration summarize-chat-oci-gen-ai.png
summarize-chat-oci-gen-ai-oracle.zip
- Region
An OCI region is a localized geographic area that contains one or more data centers, called availability domains. Regions are independent of other regions, and vast distances can separate them (across countries or even continents).
- Virtual Cloud Network (VCN) and Subnets
A VCN is a customizable, software-defined network that you set up in an OCI region. Like traditional data center networks, VCNs give you complete control over your network environment. A VCN can have multiple non-overlapping CIDR blocks that you can change after you create the VCN. You can segment a VCN into subnets, which can be scoped to a region or to an availability domain. Each subnet consists of a contiguous range of addresses that don't overlap with the other subnets in the VCN. You can change the size of a subnet after creation. A subnet can be public or private.
- Object Storage
Object storage provides quick access to large amounts of structured and unstructured data of any content type, including database backups, analytic data, and rich content such as images and videos. You can safely and securely store and then retrieve data directly from the internet or from within the cloud platform. You can seamlessly scale storage without experiencing any degradation in performance or service reliability. Use standard storage for "hot" storage that you need to access quickly, immediately, and frequently. Use archive storage for "cold" storage that you retain for long periods of time and seldom or rarely access.
- OCI Generative AI
OCI Generative AI is a fully managed service available via an API to seamlessly integrate these versatile language models into a wide range of use cases, including writing assistance, summarization, and chat. The OCI Generative AI service includes the following foundational models:
- Generation: Give instructions to generate text or extract information from your text.
- Summarization: Summarize text with your instructed format, length, and tone.
- Embedding: Convert text to vector embeddings to use in applications for semantic searches, text classification, or text clustering.
- Dedicated AI Clusters
Dedicated AI clusters are compute resources that you can use for fine-tuning custom models or for hosting endpoints for custom models. The clusters are dedicated to your models and not shared with users in other tenancies. Custom Model OCI Generative AI lets you use your own data to refine the models.
- Oracle Digital Assistant
Digital Assistant is a platform for creating conversational interfaces or chatbots. A chatbot is a computer program that simulates and processes human conversation (written or spoken), allowing humans to interact with applications and data as if they were communicating with a real person. Developers assemble digital assistants from one or more skills. Skills can be based on prebuilt skills provided by Oracle or third parties, custom developed, or based on one of the many skill templates available. Digital Assistant routes the user’s request to the most appropriate skill to satisfy the user’s request. Skills combine a multilingual NLP deep learning engine, a powerful dialogue flow engine, and integration components to connect to back-end systems.
- API Gateway
Oracle API Gateway enables you to publish APIs with private endpoints that are accessible from within your network, and which you can expose to the public internet if required. The endpoints support API validation, request and response transformation, CORS, authentication and authorization, and request limiting.
- Email Delivery
OCI Email Delivery is a highly scalable, cost effective, and reliable email delivery service for sending high-volume, application-generated emails for mission-critical marketing, notification, and transactional communications such as receipts, fraud detection alerts, multifactor identity verification, and password resets.
- Identity and Access Management (IAM)
OCI Identity and Access Management (IAM) is the access control plane for OCI and Oracle Cloud Applications. The IAM API and the user interface enable you to manage identity domains and the resources within the identity domain. Each OCI IAM identity domain represents a standalone identity and access management solution or a different user population.
- Logging
Logging is a highly scalable and fully managed service that provides access to the following types of logs from your resources in the cloud:
- Audit logs: Logs related to events emitted by the Audit service.
- Service logs: Logs emitted by individual services such as API Gateway, Events, Functions, Load Balancing, Object Storage, and VCN flow logs.
- Custom logs: Logs that contain diagnostic information from custom applications, other cloud providers, or an on-premises environment.
- Monitoring
OCI Monitoring service actively and passively monitors your cloud resources using metrics to monitor resources and alarms to notify you when these metrics meet alarm-specified triggers.