CoreWeave and Perplexity Partner to Scale AI Inference Workloads

March 5, 2026

CoreWeave, Inc., The Essential Cloud for AI, has announced a multi-year strategic partnership with Perplexity to support the company’s growing inference workloads on the CoreWeave Cloud. In addition, both organizations will collaborate to pilot new services aimed at improving AI performance, scalability, and operational efficiency.

As AI-powered applications increasingly operate in real-world environments, companies must ensure that their infrastructure delivers high performance and reliability. Perplexity develops AI-native products and services designed to run continuously, where inference speed and consistency directly influence user experience. Therefore, the company requires an infrastructure platform that can support high-performance computing while maintaining low latency and predictable operational costs.

AI Authority Trend: CoreWeave Launches ARENA to Help Businesses Test AI Workloads at Production Scale

To address these needs, the CoreWeaveCloud platform provides infrastructure specifically designed for AI workloads. The platform delivers consistent performance, enabling organizations to manage large-scale inference tasks while scaling resources quickly as demand grows. Moreover, the infrastructure allows companies to move seamlessly from development to long-term production without needing to redesign existing systems or tools.

Under the terms of the partnership, Perplexity will run its next-generation inference workloads on CoreWeave’s platform. By leveraging dedicated NVIDIA GB200 NVL72-powered clusters, CoreWeave will provide the computing capacity required to support Perplexity’s rapid growth. At the same time, the infrastructure will meet the advanced performance requirements of the company’s Sonar and Search API ecosystem.

In addition to supporting inference workloads, CoreWeave will deploy Perplexity Enterprise Max across its internal operations. This integration will enable employees to search both the web and internal knowledge sources, conduct multi-step research tasks, visualize and analyze data, and interact with advanced AI models through a single platform.

“We’re proud to partner with Perplexity as they scale their inference workloads on CoreWeave’s AI cloud,” said Max Hjelm, senior vice president of revenue at CoreWeave. “AI applications running in production require more than just access to raw infrastructure – they require best-in-class performance and reliability as well as a cloud platform designed end-to-end for AI that simplifies compute operations.”

AI Authority Trend: NVIDIA Expands Partnership with CoreWeave to Power 5GW of AI Infrastructure by 2030

Furthermore, Perplexity has already begun running inference workloads using the CoreWeave Kubernetes Service as part of the initial deployment phase. The company is also utilizing W&B Models to train, fine-tune, and manage models throughout the entire lifecycle, from early experimentation to full-scale production.

“We were impressed by the combination of CoreWeave’s technical aptitude and partner-first mindset that help AI-native companies accelerate their growth and scaling goals,” said Dmitry Shevelenko, chief business officer at Perplexity. “CoreWeave is an essential partner in our efforts to optimize our infrastructure and the models we use to provide Perplexity users across industries with the strongest AI tools and agents on the market.”

Importantly, the partnership aligns with Perplexity’s broader multi-cloud strategy, which focuses on leveraging specialized infrastructure providers for advanced AI workloads. At the same time, the collaboration highlights CoreWeave’s role as a dedicated AI cloud provider supporting organizations that operate large-scale AI systems in demanding production environments.

CoreWeave continues to set performance benchmarks across the AI cloud industry. The company recently achieved industry-leading MLPerf benchmark results and remains the only AI cloud provider to earn the top Platinum ranking in both SemiAnalysis ClusterMAX 1.0 and 2.0 evaluations, which assess cloud performance, efficiency, and reliability for AI workloads.

AI Authority Trend: CoreWeave Integrates NVIDIA Rubin to Power Next-Gen AI Workloads

To share your insights, please write to us at info@intentamplify.com

Tags: AI models, AI performance, AI tools, AI workloads, AI-native products, AI-powered applications, CoreWeave, Perplexity

AI Tech Staff Writer

AI staff writer with a passion for exploring the latest in AI technology. Specializing in original rewrites and insightful coverage of cutting-edge advancements. Dedicated to delivering clear, engaging news and analysis on the evolving AI landscape to keep readers informed and ahead of the curve.

CoreWeave and Perplexity Partner to Scale AI Inference Workloads

AI Tech Staff Writer

Share With

Recent Posts

Core Education Launches CoreXP AI Operating Model for Universities

Redblock Launches 1-Click AI Agent Deployment for SailPoint

Altera Expands FPGA Capabilities for Physical AI in Robotics and Edge Applications

CoreWeave and Perplexity Partner to Scale AI Inference Workloads

Peridio and SolidRun Partner to Accelerate Deployment of Physical AI Platforms

Keysight and Qualcomm Demonstrate ML-Based CSI Compression for 5G-Advanced

Contact Us

Quick Links

Insights

Get in touch

Follow Us

Our Other Brands

Download the AI Technology Insights Media Kit