Google's New AI Model Hits 1,000 Tokens Per Second On Nvidia GPUs
Google DeepMind released DiffusionGemma on June 10, 2026, a new text-generation model that produces text in parallel blocks rather than sequentially. The company says it reaches up to 1,000 tokens per second on Nvidia GPU hardware. According to a report, DeepMind's benchmarks show DiffusionGemma run