Cerebras WSE-3
Wafer Scale Engine Architecture
Active
Launched March 2024
Core Specifications
VendorCerebras
ArchitectureWafer Scale Engine
Form Factor—
VRAM44 GB
Memory Bandwidth—
TDP23000 W
Compute Performance
| Precision | TFLOPs |
|---|
Performance Benchmarks
llm inference
| Configuration | Precision | Performance | Source |
|---|---|---|---|
| LLaMA 70B, ultra-low latency | — | 16,000 tokens_per_second | View |