ElevenLabs, a London-based company specializing in artificial intelligence voice technology, is expanding its strategic partnership with Google Cloud. The collaboration, announced on , will leverage Google Cloud’s infrastructure and the latest NVIDIA Blackwell GPUs to scale ElevenLabs’ enterprise AI voice tools.
This multi-year agreement aims to provide ElevenLabs with the computational resources needed to support larger deployments for its enterprise customers and accelerate its research and development efforts. The core of the expansion centers on utilizing Google Cloud’s G4 virtual machines, powered by NVIDIA RTX PRO 6000 Blackwell GPUs. These GPUs represent a significant leap in performance for generative AI workloads, enabling faster model training and more reliable service delivery at scale.
The demand for this increased capacity stems from the growing adoption of ElevenLabs’ technology across various industries. Enterprises are deploying AI agents powered by ElevenLabs to handle customer support, internal training, and sales interactions in a multitude of languages. The company’s tools also facilitate the localization of extensive content libraries – translating and voicing materials into over 70 languages – and the creation of consistent brand voices for multimedia assets.
The Blackwell GPUs are particularly crucial because of their architecture, designed to efficiently handle the large-model workloads characteristic of modern AI. Next-generation accelerators like Blackwell offer substantial performance gains compared to previous generations, making complex AI tasks more feasible and cost-effective. This aligns with a broader industry trend towards specialized AI hardware to power increasingly sophisticated intelligent services.
Beyond the infrastructure upgrade, the partnership extends to software integration. Google Cloud’s AI stack, including its Gemini and Veo models, will be integrated into ElevenLabs’ Agents and Creative platforms. This integration is intended to enhance the reasoning capabilities, multi-step planning, and media generation features of ElevenLabs’ offerings. Gemini and Veo are expected to contribute to more natural and nuanced AI-generated voices and interactions.
A key component of this expanded collaboration is the availability of ElevenLabs’ solutions on the Google Cloud Marketplace. This simplifies the procurement process for businesses looking to integrate advanced voice capabilities into their operations, streamlining compliance and billing. The Marketplace listing allows organizations to quickly deploy and scale conversational agents for a variety of use cases, including customer support, internal training, and inbound sales.
ElevenLabs’ ambition is to make natural, real-time voice AI accessible globally. The company’s technology allows for the localization and consistent generation of voices across a wide range of languages, a capability increasingly sought after by enterprises aiming to improve customer engagement and transform their content strategies. The ability to deliver consistent brand voice across multiple languages is a significant differentiator in a globalized market.
For Google Cloud, the deepened partnership with ElevenLabs reinforces its position as a leading platform for high-performance AI applications. By offering best-in-class infrastructure, including access to cutting-edge GPUs, and a comprehensive marketplace encompassing compute resources, AI models, and third-party solutions, Google Cloud aims to attract and support innovative AI companies like ElevenLabs.
The expansion of this collaboration comes at a time of rapid growth for ElevenLabs. The company has reportedly achieved a valuation in the billions and is actively positioning voice-centric AI as a fundamental component of enterprise digital transformation. This suggests a growing recognition of the value of AI-powered voice technology in enhancing business processes and customer experiences.
The integration of ElevenLabs’ technology with Google Cloud’s Gemini 2.0 Flash model, as highlighted in a recent blog post, further demonstrates the synergy between the two companies. Gemini 2.0 Flash is designed for ultra-fast response times and reliable function calling, making it well-suited for voice-driven applications requiring low latency and high accuracy. This combination promises to deliver a more responsive and intelligent user experience.
