Skip to main content

Overview

Fluidstack Atlas provides high-performance compute instances designed to meet the needs of inference and fine-tuning workloads. All compute instances are either bare-metal servers or highly tuned kvm virtual machines.

Instance Types

Fluidstack Atlas supports a variety of CPU and GPU-enabled instance types.

HGX Instance Types

Accelerate multi-GPU inference and training with 8 NVIDIA GPUs connected by a high-bandwidth NVLink fabric. All HGX instances include balanced CPU, RAM, and front-end network bandwidth for fast data loading, pre-processing, and checkpointing. For more specific details about the compute capabilities of the NVIDIA HGX platform, check out the official docs here.

Instance TypeDescription
b300-hgx-288gb.8x8 NVIDIA B300 GPUs with 288 GB memory each.
b200-hgx-192gb.8x8 NVIDIA B200 GPUs with 192 GB memory each.
h200-hgx-141gb.8x8 NVIDIA H200 GPUs with 141 GB memory each.
h100-hgx-80gb.8x8 NVIDIA H100 GPUs with 80 GB memory each.
a100-hgx-80gb.8x8 NVIDIA A100 GPUs with 80 GB memory each.

CPU Instance Types

Use small CPU-only instance types as a bastion hosts or control plane node or application specific general purpose compute.

Instance TypeDescription
cpu.2x2 CPUs and 4 GiB RAM
cpu.4x4 CPUs and 8 GiB RAM
cpu.8x8 CPUs and 16 GiB RAM