AWS Inferentia2
Inferentia Gen2 Architecture
Active
Launched November 2022
Core Specifications
VendorAWS
ArchitectureInferentia Gen2
Form Factor—
VRAM32 GB
Memory Bandwidth—
TDP150 W
Compute Performance
| Precision | TFLOPs |
|---|---|
| BF16 | 190 |
Performance Benchmarks
image gen
| Configuration | Precision | Performance | Source |
|---|---|---|---|
| Stable Diffusion 2.1, 512x512 | — | 1.2 images_per_second | View |