Google TPU v4

TPU v4 Architecture

Active

Launched May 2021

Core Specifications

VendorGoogle
ArchitectureTPU v4
Form FactorMezzanine
VRAM
Memory Bandwidth
TDP300 W

Compute Performance

PrecisionTFLOPs
BF16275

Performance Benchmarks

llm train

ConfigurationPrecisionPerformanceSource
Model FLOPs UtilizationBF1658 mfu_percentView
PaLM 540B, previous generationBF1611,000 tokens_per_secondView

llm inference

ConfigurationPrecisionPerformanceSource
Gemini models, batch inferenceBF166,500 tokens_per_secondView

Pricing

Cloud Rental (OPEX)

ProviderInstance TypePrice per HourRegionAs of
Google Cloud$3.00/hrus-centralDec 2025

Quick Stats

Peak Performance
275
TFLOPs (BF16)
Efficiency
0.92
TFLOPs per Watt

Similar XPUs

View other Google GPUs or compare across vendors