Main GPU Hub / Hardware + Execution + Performance + Tools

For AI Developers

GPU HUB
START HERE

This is your main GPU page. From here you can jump into architecture, execution internals, performance analysis, and practical GPU tools. The UI is intentionally clean white while keeping the technical style.

GPU board illustration for GPU Hub

Learning Tracks

7 sections

Interactive Tools

15+ tools

Supported GPUs

50+ GPUs

GPU LEARNING PATH

How to use this hub

  1. 1. Learn the hardware and memory basics if you are still building intuition.
  2. 2. Use the VRAM and GPU tools once you have a model or workload in mind.
  3. 3. Validate performance bottlenecks with the roofline and occupancy tools when tuning kernels or inference.
  4. 4. Move between learning pages and tools instead of treating them as separate tracks.

Why this matters for AI teams

Model choice without hardware understanding leads to expensive mistakes. This hub exists to connect architecture concepts with the decisions teams actually make around VRAM, latency, throughput, and cost.

The goal is not to memorize GPU theory. It is to make better deployment calls with less guesswork.

PRECISION TOOLS

HARDWARE INDEX

NVIDIA H100 SXM5

Large-scale training

Arch

Hopper

VRAM

80GB HBM3

FP16

1,979

Mem BW

3.3 TB/s

Compute

9.0

Best For

Large-scale training

NVIDIA A100 SXM4

General DL workloads

Arch

Ampere

VRAM

80GB HBM2e

FP16

312

Mem BW

2.0 TB/s

Compute

8.0

Best For

General DL workloads

RTX 4090

Local inference

Arch

Ada Lovelace

VRAM

24GB GDDR6X

FP16

82.6

Mem BW

1.0 TB/s

Compute

8.9

Best For

Local inference

L40S

Multi-model serving

Arch

Ada Lovelace

VRAM

48GB GDDR6

FP16

183

Mem BW

0.8 TB/s

Compute

8.9

Best For

Multi-model serving

HardwareExecutionPerformanceTools

Start here if you are new

Follow Physical Hardware, Memory Hierarchy, then Execution Model before jumping into tuning tools. That sequence makes the later pages much easier to interpret.

Start here if you already deploy models

Open the VRAM calculator and GPU picker first, then use the learning pages only where you need more explanation about bottlenecks or architecture tradeoffs.