OpenAI LLMs ChatGPT GPU memory News A bootleg API, AI’s RAM needs, cat prompts and GPU shortages: Lessons from scaling ChatGPT "Cache misses have this weird massive non-linear effect into how much work the GPUs are doing, because we suddenly need to start recomputing all this stuff." Edward Targett November 02, 2023
AI Cloud News OVHcloud gets its happy mitts on NVIDIA GPUs, touts new AI instances NVIDIA A100 80GB powered GPU instances are “immediately available" and a range more coming very soon. Edward Targett September 28, 2023
AI LLMs News New open source NVIDIA toolkit aims to boost inference speed The chipmaker says it can double the speed of inference on its H100 GPUs The Stack September 11, 2023
artificial intelligence News Nvidia rides AI wave to massive financial quarter Nvidia says the booming AI market has helped it turn in record financial results as its datacenter business remains strong Shaun Nichols August 24, 2023
artificial intelligence News Nvidia launches AI offensive with Grace Hopper "superchip" platform Nvidia has announced a new "superchip" design dubbed Grace Hooper as well as a service called AI-Workbench as part of a renewed push into the artificial intelligence space Shaun Nichols August 09, 2023
Cloud networks Ethernet AI News Nvidia releases "Spectrum-X": Ethernet on steroids for cloud AI A fully standards-based Ethernet with support for open Ethernet stacks (SONiC, Linux Switch) at cloud scale... Edward Targett May 30, 2023
Enterprise IT Featured Read This GPU lithography semiconductors CEOs of TSMC, ASML cite “tremendous benefit” of shifting to GPUs Harder, faster, stronger... Edward Targett March 22, 2023
Cloud Enterprise IT Featured AI AI cloud NVIDIA to launch its own AI Cloud, baking DGX into hyperscalers NVIDIA's AI infrastructure, full stack, via the browser... The Stack February 23, 2023