Compare XPUs
Select up to 5 XPUs to compare side-by-side
Select XPUs to Compare
Alibaba
Hanguang 800
AMD
MI100
23.1 TFLOPs
AMD
MI210
181 TFLOPs
AMD
MI250X
383 TFLOPs
AMD
MI300A
980.6 TFLOPs
AMD
MI300X
1,307 TFLOPs
AMD
MI325X
1,400 TFLOPs
AMD
MI350X
2,100 TFLOPs
AMD
Radeon PRO W7900
122 TFLOPs
AMD
Radeon RX 7900 XT
104 TFLOPs
AMD
Radeon RX 7900 XTX
122 TFLOPs
AWS
Inferentia2
190 TFLOPs
AWS
Trainium
190 TFLOPs
AWS
Trainium2
680 TFLOPs
Baidu
Kunlun II
Biren Technology
BR100
Cambricon
MLU370
256 TFLOPs
Cerebras
WSE-3
Enflame Technology
CloudBlazer T20
Etched
Sohu
10,000 TFLOPs
FuriosaAI
Warboy
TPU v4
275 TFLOPs
TPU v5e
197 TFLOPs
TPU v5p
459 TFLOPs
TPU v6e (Trillium)
918 TFLOPs
Graphcore
Bow IPU
Graphcore
IPU-M2000
Groq
LPU Inference Engine
Huawei
Ascend 910B
Iluvatar CoreX
BI-V150
300 TFLOPs
Intel
Data Center GPU Max 1100
177 TFLOPs
Intel
Data Center GPU Max 1550
419 TFLOPs
Intel Habana
Gaudi 2
432 TFLOPs
Intel Habana
Gaudi 3
1,835 TFLOPs
Meta
MTIA v1
Microsoft
Maia 100
700 TFLOPs
Moore Threads
MTT S80
NVIDIA
A10
125 TFLOPs
NVIDIA
A100 SXM
312 TFLOPs
NVIDIA
A40
150 TFLOPs
NVIDIA
B200
2,250 TFLOPs
NVIDIA
GB200 NVL72
360,000 TFLOPs
NVIDIA
GB200 Superchip
5,000 TFLOPs
NVIDIA
GeForce RTX 4060 Ti
44.2 TFLOPs
NVIDIA
GeForce RTX 4070
58.2 TFLOPs
NVIDIA
GeForce RTX 4070 Super
71 TFLOPs
NVIDIA
GeForce RTX 4070 Ti
80.2 TFLOPs
NVIDIA
GeForce RTX 4070 Ti Super
88.2 TFLOPs
NVIDIA
GeForce RTX 4080
97.5 TFLOPs
NVIDIA
GeForce RTX 4080 Super
104.4 TFLOPs
NVIDIA
GeForce RTX 4090
165.2 TFLOPs
NVIDIA
GeForce RTX 5070
61.6 TFLOPs
NVIDIA
GeForce RTX 5070 Ti
88 TFLOPs
NVIDIA
GeForce RTX 5080
112.6 TFLOPs
NVIDIA
GeForce RTX 5090
209.5 TFLOPs
NVIDIA
H100 PCIe
1,513 TFLOPs
NVIDIA
H100 SXM
1,979 TFLOPs
NVIDIA
H200 PCIe
1,513 TFLOPs
NVIDIA
H200 SXM
1,979 TFLOPs
NVIDIA
L4
121 TFLOPs
NVIDIA
L40S
733 TFLOPs
NVIDIA
RTX 4000 Ada Generation
53.4 TFLOPs
NVIDIA
RTX 5000 Ada Generation
130.6 TFLOPs
NVIDIA
RTX 6000 Ada Generation
182.2 TFLOPs
NVIDIA
RTX PRO 6000 Blackwell Max-Q
125 TFLOPs
NVIDIA
RTX PRO 6000 Blackwell Server Edition
250 TFLOPs
NVIDIA
RTX PRO 6000 Blackwell Workstation Edition
250 TFLOPs
Qualcomm
Cloud AI 100
50 TFLOPs
Rebellions
ATOM
SambaNova
SN40L
Tenstorrent
Grayskull
200 TFLOPs
Tenstorrent
Wormhole
364 TFLOPs
Maximum of 5 XPUs can be compared at once. Deselect one to add another.
Multi-Metric Comparison
Relative performance across 5 key metrics (normalized to 100 = best in comparison)
Compute Performance (BF16)
Memory Capacity
Power Consumption
Power Efficiency
Specifications
| Specification | AMD MI250X | AMD MI210 | Google TPU v4 | Alibaba Hanguang 800 | AWS Trainium |
|---|---|---|---|---|---|
| Architecture | CDNA 2 | CDNA 2 | TPU v4 | XuanTie | Inferentia/Trainium |
| Form Factor | OAM | PCIe | Mezzanine | — | — |
| VRAM | 128 GB | 64 GB | — | — | 32 GB |
| Memory Bandwidth | 3,277 GB/s | 1,638 GB/s | — | — | — |
| TFLOPs (FP32) | 47.9 | 45.3 | — | — | — |
| TFLOPs (FP16) | 383 | 181 | — | — | — |
| TFLOPs | 383 | 181 | 275 | — | 190 |
| TFLOPs (FP8) | — | — | — | — | — |
| TDP | 560 W | 300 W | 300 W | 150 W | 200 W |
| Launch Date | Nov 2021 | Jan 2022 | May 2021 | Sep 2019 | Nov 2021 |
Efficiency Metrics
| Metric | MI250X | MI210 | TPU v4 | Hanguang 800 | Trainium |
|---|---|---|---|---|---|
| TFLOPs per Watt (FP32-eq) | 0.34 | 0.30 | 0.46 | — | 0.47 |
| Memory Bandwidth per GB | 25.6 GB/s | 25.6 GB/s | — | — | — |
Performance Equivalence
How many units of each GPU are needed to match the performance of the others?
To match 1x AMD MI250X
To match 1x AMD MI210
To match 1x Google TPU v4
To match 1x AWS Trainium
Pricing
| Price Type | MI250X | MI210 | TPU v4 | Hanguang 800 | Trainium |
|---|---|---|---|---|---|
| CAPEX (Street Price) | $12,000 | $6,000 | — | — | — |
| OPEX (per hour) | $2.00/hr | — | $3.00/hr | — | — |
| Price per TFLOPs (FP32-eq) | $63 | $66 | — | — | — |