Date: April 04, 2024
VNvidia and Google Cloud have joined hands to launch a new cloud hardware offering, the L4 platform, optimized to accelerate ‘AI-powered’ video performance.Nvidia and Google Cloud have joined hands to launch a new cloud hardware offering, the L4 plat
Nvidia has partnered with Google Cloud to launch a new cloud hardware offering, the L4 platform, optimized for video-focused applications.
The L4 platform is designed to accelerate "AI-powered" video performance, serving as a general-purpose GPU with video decoding, transcoding, and video streaming capabilities. It will be available later this year from Nvidia's network hardware partners, including Asus, Cisco, Dell, Hewlett Packard Enterprise, and Lenovo.
In addition to L4, Nvidia has announced other AI-focused hardware solutions such as L40, H100 NVL, and Grace Hopper for Recommendation Models. The L40 is optimized for graphics and AI-enabled 2D, video, and 3D image generation, while the H100 NVL supports deploying large language models such as ChatGPT.
Grace Hopper for Recommendation Models is recommendation model-focused. L40 is available this week through Nvidia's hardware partners, and Grace Hopper and the H100 NVL are expected to ship in the second half of the year.
Nvidia is also launching its DGX Cloud platform, which gives companies access to infrastructure and software to train models for generative and other forms of AI. Each instance of DGX Cloud features eight Nvidia H100 or A100 80GB Tensor Core GPUs for a total of 640GB of GPU memory per node, paired with storage.
The service will be hosted by "leading" cloud service providers, starting with Oracle Cloud Infrastructure, with Microsoft Azure and Google Cloud to follow. These developments demonstrate Nvidia's aggressive push into AI compute as the company moves away from unprofitable investments in other areas like gaming and professional virtualization.
With the growth of its data center business, which includes chips for AI, Nvidia could continue to benefit from the generative AI boom. Stay tuned for more updates on these exciting developments!
By Arpit Dubey
Arpit is a dreamer, wanderer, and tech nerd who loves to jot down tech musings and updates. Armed with a Bachelor's in Business Administration and a knack for crafting compelling narratives and a sharp specialization in everything from Predictive Analytics to FinTech—and let’s not forget SaaS, healthcare, and more. Arpit crafts content that’s as strategic as it is compelling. With a Logician mind, he is always chasing sunrises and tech advancements while secretly preparing for the robot uprising.
Reddit Unveils AI-Powered Search Tool for Smarter Results
Reddit launched Reddit Answers, an AI-powered search tool that curates and summarizes discussions to enhance user experience and reduce reliance on Google.
OpenAI Scraps o3 Model, Pushes for Unified GPT-5 in a Major AI Overhaul
OpenAI is canceling its o3 AI model and merging it into GPT-5 for a simpler, more powerful system. A big move to stay ahead in the AI race.
Virtual Reality in Healthcare: Revolutionizing Patient Care
Experience the power of virtual reality in healthcare as it transforms medical training, patient care, and treatment methods with immersive technology for better accuracy, efficiency, and improved outcomes.
Google I/O 2025: Dates Announced for the Tech Giant’s Biggest Event of the Year
Google I/O 2025 is set for May 20-21! Expect big AI reveals, Android 16 updates, and more. Registrations are open for keynotes, demos, and game-changing tech innovations!