GETTING
STARTED.
Welcome to SScoreCompute. Deploy NVIDIA H100, H200, B300 and GB300 NVL72 GPUs on AWS and Azure in under 60 seconds. Billed per hour in CAD.
QUICK START
Create Your Account
Sign up at sscorecompute.com/signup. No credit card required. Free to create an account.
Choose Your GPU
Select H100, H200, B300 or GB300 NVL72. Pick your cloud (AWS or Azure) and region. All priced in CAD per hour.
Deploy in 60 Seconds
Click Deploy. Your GPU instance is ready in under 60 seconds. SSH in or connect via our web terminal.
Run Your Workload
Install your framework, upload your model, and start training or inference. Stop the instance when done — you only pay for what you use.
AVAILABLE GPUS
| GPU | ARCH | VRAM | BANDWIDTH | CAD/HR | BEST FOR |
|---|---|---|---|---|---|
| H100 | Hopper | 80GB HBM3 | 3.35 TB/s | $3.14 | Inference, RAG |
| H200 | Hopper | 141GB HBM3e | 4.8 TB/s | $4.10 | Large models |
| B300 | Blackwell | 192GB HBM3e | 8.0 TB/s | $6.71 | Agentic AI |
| GB300 NVL72 | Blackwell Ultra | 13.8TB pooled | 130 TB/s | $409 | Training |
PYTORCH QUICKSTART
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu124
# Verify GPU is available
python -c "import torch; print(torch.cuda.get_device_name(0))"
# Output: NVIDIA H100 80GB HBM3
VLLM — LLM INFERENCE
Deploy LLaMA, Mistral, Qwen or any HuggingFace model with vLLM for high-throughput inference on SScoreCompute GPUs.
pip install vllm
# Serve LLaMA 3 70B on H200
python -m vllm.entrypoints.openai.api_server \
--model meta-llama/Meta-Llama-3-70B-Instruct \
--tensor-parallel-size 1 \
--max-model-len 8192 \
--port 8000
HUGGING FACE TRANSFORMERS
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
model_id = "meta-llama/Meta-Llama-3-8B-Instruct"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
model_id, torch_dtype=torch.bfloat16, device_map="cuda"
)
FINE-TUNING WITH LORA
from peft import LoraConfig, get_peft_model
from trl import SFTTrainer
lora_config = LoraConfig(
r=16, lora_alpha=32,
target_modules=["q_proj", "v_proj"],
lora_dropout=0.05, bias="none"
)
BILLING IN CAD
All SScoreCompute instances are billed per hour in Canadian dollars. Billing starts when an instance is provisioned and stops when it is terminated. There is no minimum spend and no contracts.
| GPU | CAD/HR | CAD/DAY (8HRS) | CAD/MONTH (8HRS/DAY) |
|---|---|---|---|
| H100 | $3.14 | $25.12 | $753.60 |
| H200 | $4.10 | $32.80 | $984.00 |
| B300 | $6.71 | $53.68 | $1,610.40 |
| GB300 NVL72 | $409 | $3,272 | $98,160 |