Cerebras WSE-3

Wafer Scale Engine Architecture

Active

Launched March 2024

Core Specifications

VendorCerebras
ArchitectureWafer Scale Engine
Form Factor
VRAM44 GB
Memory Bandwidth
TDP23000 W

Compute Performance

PrecisionTFLOPs

Performance Benchmarks

llm inference

ConfigurationPrecisionPerformanceSource
LLaMA 70B, ultra-low latency16,000 tokens_per_secondView

llm train

ConfigurationPrecisionPerformanceSource
GPT-3 175B, single CS-3 system120 hours_to_trainView
GPT-3 175B, weight streaming architecture25,000 tokens_per_secondView

Quick Stats

Similar XPUs

View other Cerebras GPUs or compare across vendors