Optimization features of Oracle Exadata and Oracle RAC
Oracle Exadata and Oracle RAC offer advanced optimization features for machine learning by leveraging distributed parallelism, storage-tier processing, and dynamic resource allocation to enable scalability, high-performance model building, and real-time scoring.
Distributed Parallelism and Scalability:
- Oracle RAC enables distributed parallelism across cluster nodes, improving efficiency for data processing and machine learning tasks.
- Exadata's architecture supports scalable performance through in-memory processing and autoscaling, ensuring consistent performance even during peak workloads.
Storage-Tier Processing with Smart Scan:
Exadata Smart Scan technology processes SQL predicates and machine learning model scoring directly at the storage tier. This reduces data movement and speeds up query execution by 2-5 times compared to non-Smart Scan in-database scoring.
Data Loading and Model Caching:
- Oracle Machine Learning (OML) loads data incrementally into memory, eliminating the need for the entire data set to fit in memory.
- Models are efficiently cached and shared across queries, minimizing memory overhead and improving multi-user performance.
High-Performance Scoring:
- Machine learning models are integrated as SQL functions, enabling high-performance scoring in both batch and online transactional processing (OLTP) environments.
- OML leverages Exadata’s storage-tier optimization for real-time scoring using current table data.
Autoscaling on Autonomous Database:
The Autonomous Database dynamically adjusts computing power to handle multiple users and simultaneous queries.
Multi-User Support:
- Disk-Aware Structures optimize memory allocation using the database memory manager, thereby efficient performance in multi-user environments.
- For partitioned models, only necessary component models are loaded, reducing memory usage and increasing speed.
Security and Auditing:
In-database machine learning models follow database security schemes, including access control, privilege management, and audit tracking. This enables compliance and secure use of machine learning models across different environments.
Parent topic: What is In-Database Machine Learning