CPU and GPU Support

Coherence RAG supports deployment on both CPU and GPU infrastructures. While CPU-based deployment offers cost-effective scalability for moderate workloads, GPU acceleration significantly enhances performance, making it ideal for high-throughput and low-latency applications. Users can configure their environment based on hardware availability and performance requirements.