BLACKWELL ARCHITECTURE NEW

NVIDIA
B300
BLACKWELL GPU

The next generation of AI compute. 192GB HBM3e, 4,500 TFLOPS FP8, 8.0 TB/s memory bandwidth — 2× the VRAM and 4.5× the FP8 throughput of H100.

Deploy B300 Now → Get Enterprise Quote

Starting From

$6.71

CAD / HR / GPU

≈ $4.90 USD at current rates

192GB HBM3e — largest in class

No contracts or minimums

Hourly billing in CAD

Deploy in under 60 seconds

99.9% uptime SLA

H100 SXM · $3.14 H200 SXM · $4.10 B300 · $6.71 GB300 NVL72 · $409

// Generation Leap

B300 VS H100
THE UPGRADE

NVIDIA B300 (BLACKWELL)

192GB

HBM3e VRAM · 2.4× more than H100

4,500

TFLOPS FP8 · 2.3× faster than H100

8.0

TB/s Memory Bandwidth · 2.4× H100

NVIDIA H100 (HOPPER)

80GB

HBM3 VRAM

1,979

TFLOPS FP8

3.35

TB/s Memory Bandwidth

// Technical Specifications

B300
FULL SPECS

SPECIFICATION	B300 BLACKWELL
Architecture	NVIDIA Blackwell
GPU Memory	192 GB HBM3e
Memory Bandwidth	8.0 TB/s
FP8 Performance	4,500 TFLOPS
FP16 / BF16 Performance	2,250 TFLOPS
FP4 Performance	9,000 TFLOPS
NVLink Bandwidth	1,800 GB/s
TDP (Power)	1,000W
Transformer Engine	Yes — 4th Gen
Second-Gen Sparsity	Yes
SScoreCompute Price	$6.71 CAD/hr
Cloud Platform	AWS + Azure

// Ideal Workloads

WHAT THE B300
UNLOCKS

🧠

FRONTIER MODEL INFERENCE

Run 405B+ parameter models that don't fit on H100 or H200. 192GB VRAM enables full-precision inference on the largest open-source models.

⚡

HIGH-THROUGHPUT SERVING

4,500 TFLOPS FP8 delivers more than double the inference throughput of H100 for production LLM serving at scale.

🔬

RESEARCH & PRETRAINING

Train larger models faster with 4th-gen Transformer Engine and second-gen sparsity support. Ideal for teams pushing model capability boundaries.

🤖

AGENTIC AI AT SCALE

Run multiple large models simultaneously for complex multi-agent pipelines. 192GB VRAM + 1,800 GB/s NVLink enables agent orchestration at scale.

🎬

VIDEO & MULTIMODAL AI

Generate and process high-resolution video, large multimodal models, and scientific simulation workloads that demand maximum VRAM and bandwidth.

💊

SCIENTIFIC COMPUTING

Drug discovery, protein folding, genomics, and climate modelling — workloads that require sustained high-precision floating point at massive scale.

// Performance

B300 BENCHMARK
RESULTS

LLM Inference — Output Tokens / Second (LLaMA-3 70B)

B300 (FP8)

Best

3,840tok/s

H200 (FP8)

H200

2,200tok/s

H100 (FP8)

H100

1,680tok/s

Time to First Token (TTFT) — Lower is Better

B300

Best

17ms

H100

Baseline

42ms

READY TO DEPLOY
B300 BLACKWELL?

The most powerful per-GPU compute available on SScoreCompute. CAD pricing, no contracts, live in 60 seconds.

Deploy B300 Now → Get Enterprise Pricing

NVIDIAB300BLACKWELL GPU

B300 VS H100THE UPGRADE

B300FULL SPECS

WHAT THE B300UNLOCKS