Sign up for major executives in San Francisco on July 11-12, to listen to how leaders are integrating and optimizing AI investments for results. Master Extra
Nvidia introduced nowadays at its yearly GPU Technology Conference (GTC) that it’s now offering a service that will enable enterprises to accessibility its AI supercomputing infrastructure and computer software as a result of a web browser.
The service — referred to as DGX Cloud — supplies dedicated clusters of Nvidia’s DGX AI supercomputers, which are created for deep studying and other highly developed AI applications. The business claimed that DGX Cloud would simplify the complexity of buying, deploying and controlling on-premises infrastructure for AI.
>>Follow VentureBeat’s ongoing Nvidia GTC spring 2023 protection<<
Nvidia has been iterating on different DGX form factors over the last several years, including building out small units with the DGX Station in 2021. The DGX Cloud now takes Nvidia hardware to the cloud, where it is offered as a service to offer organizations the full power of the converged platform, putting it in the cloud and making it accessible via a web browser interface.
Join us in San Francisco on July 11-12, where top executives will share how they have integrated and optimized AI investments for success and avoided common pitfalls.
“What we’ve done over the years with DGX is not just build the state-of-the-art supercomputer, but we built a software stack that sits on top of it, that turns this into a turnkey training-as-a-service platform,” Manuvir Das, VP of enterprise computing at Nvidia, said during a press briefing. “You just provide your job, point to your dataset, hit go and all of the orchestration and everything underneath is taken care of.”
Early users of DGX Cloud claim improved training speed
While the public service is only being formally announced now, Nvidia has had some early users of the DGX Cloud.
Among them is biopharmaceutical vendor Amgen, which has been using the DGX Cloud to accelerate machine learning (ML) training speed. Das noted that Amgen has reported three times faster performance for its training using DGX cloud versus other approaches.
Insurance technology vendor CCC Intelligent Solutions (CCC) is also a DGX Cloud user. Das said that CCC is an insurance company that is using AI to process claims from accidents in a matter of seconds versus days. CCC is using DGX Cloud to help with scaling demand.
“We run some experiments on-premises on Nvidia DGX systems, but we may have spikes where we want to add, for example, 10 million more data points and do another run,” Neda Hantehzadeh, director of data science at CCC, wrote in a blog post. “If we need additional capacity, we can switch to DGX Cloud.”
DGX Cloud service lands in the public cloud with Bluefield 3 DPU
DGX Cloud is being delivered as a service in the public cloud.
Das said that it is currently available on Oracle Cloud Infrastructure (OCI) as well as Equinix. He added that it’s coming soon to Microsoft Azure and Google Cloud.
The Oracle deployment was highlighted in a press briefing by Kevin Deierling, VP of networking at Nvidia. As part of its DGX Cloud deployment, Oracle is also making use of Nvidia’s Bluefield-3 data processing unit (DPU). Nvidia first announced the Bluefield-3 back in April 2021, but it has taken a couple of years to bring to full production. A DPU is a dedicated computing unit for data processing, which helps to accelerate overall performance by reducing the overhead on CPU and GPU resources.
“The Bluefield-3 has twice as many ARM processor cores, more accelerators, and runs workloads up to eight times faster than our previous generation DPU,” Deierling said. “The Bluefield-3 offloads, accelerates and isolates workloads across cloud, HPC, enterprise and accelerated AI use cases.”
VentureBeat’s mission is to be a digital town square for technical decision-makers to gain knowledge about transformative enterprise technology and transact. Discover our Briefings.