> ## Documentation Index > Fetch the complete documentation index at: https://docs.livepeer.org/llms.txt > Use this file to discover all available pages before exploring further. # GPU Support Matrix > NVIDIA GPU compatibility, NVENC/NVDEC session limits, driver requirements, and CUDA versions for Livepeer orchestrators. export const CustomDivider = ({color = "var(--lp-color-border-default)", middleText = "", spacing = "default", style = {}, className = "", ...rest}) => { const spacingPresets = { default: { margin: "24px 0" }, overlap: { margin: "-1rem 0 -1rem 0" }, tight: { margin: "0 0 -1rem 0" }, section: { margin: "0 0 -2rem 0" }, sectionOverlap: { margin: "-1rem 0 -2rem 0" }, deepOverlap: { margin: "-1rem 0 -1.5rem 0" } }; const spacingStyle = spacingPresets[spacing] || spacingPresets.default; return

{middleText && <> {middleText} }

; }; export const TableCell = ({children, align = "left", header = false, style = {}, className = "", ...rest}) => { const Component = header ? "th" : "td"; return {children} ; }; export const TableRow = ({children, header = false, hover = false, style = {}, className = "", ...rest}) => { const rowId = `table-row-${Math.random().toString(36).substr(2, 9)}`; return <> {hover && } {children} ; }; export const StyledTable = ({children, variant = "default", style = {}, className = "", ...rest}) => { const wrapperVariants = { default: { border: "1px solid var(--lp-color-border-default)", backgroundColor: "var(--lp-color-bg-card)", overflow: "hidden" }, bordered: { border: "2px solid var(--lp-color-accent)", backgroundColor: "var(--lp-color-bg-page)", overflow: "hidden" }, minimal: { border: "none", backgroundColor: "transparent", overflow: "visible" } }; return

{children}

; }; Livepeer orchestrators use NVIDIA GPUs for video transcoding (NVENC/NVDEC hardware encoders) and AI inference (CUDA cores / Tensor cores). GPU compatibility, session limits, and driver requirements. ## Supported GPU Families go-livepeer requires NVIDIA GPUs with NVENC and NVDEC support. AMD and Intel GPUs are not supported. GPU Family Transcoding AI Inference Notes **GeForce RTX 40xx** (Ada Lovelace) Yes Yes Best consumer option. AV1 encode support. **GeForce RTX 30xx** (Ampere) Yes Yes Widely used by orchestrators. Good price-performance. **GeForce RTX 20xx** (Turing) Yes Yes Supported but older. **GeForce GTX 16xx** (Turing) Yes Limited No Tensor cores – AI inference slower or unsupported for some pipelines. **GeForce GTX 10xx** (Pascal) Yes Limited Legacy. NVENC Gen 6. No Tensor cores. **Tesla T4** Yes Yes Data centre card. 16 GB VRAM. Common in cloud. **Tesla V100** Yes Yes Data centre. 16/32 GB VRAM. **A100** Yes Yes Data centre. 40/80 GB VRAM. Highest throughput. **A10 / A10G** Yes Yes Cloud-optimised (AWS G5, etc.). 24 GB VRAM. **L4** Yes Yes Ada Lovelace data centre. 24 GB VRAM. Good for AI. **L40 / L40S** Yes Yes 48 GB VRAM. High-end AI and transcoding. **H100** Transcoding works but overkill Yes 80 GB VRAM. Primarily for LLM and large model inference. ## NVENC Session Limits Consumer NVIDIA GPUs enforce a hard limit on concurrent NVENC encoding sessions. This directly limits how many simultaneous transcoding streams your orchestrator can handle per GPU. GPU Class Default NVENC Session Limit Notes GeForce GTX 10xx 2 Can be patched GeForce GTX 16xx 3 Can be patched GeForce RTX 20xx 3 Can be patched GeForce RTX 30xx 3–5 (varies by model) Can be patched GeForce RTX 40xx 3–8 (varies by model) Can be patched Tesla / Quadro / A-series Unlimited No session limit on professional and data centre cards ### Removing the Session Limit The community-maintained [nvidia-patch](https://github.com/keylase/nvidia-patch) removes the NVENC session limit on consumer GPUs. This is widely used by Livepeer orchestrators and pool operators (Titan Node uses this in their worker setup). ```bash icon="terminal" theme={"theme":{"light":"github-light","dark":"dark-plus"}} # Example (Linux) — always check the repo for current instructions git clone https://github.com/keylase/nvidia-patch.git cd nvidia-patch bash patch.sh ``` Patching the NVIDIA driver modifies a system binary. This is not officially supported by NVIDIA. After driver updates, you must re-apply the patch. Some cloud providers (AWS, GCP) may not allow driver patching on managed GPU instances. ## CUDA and Driver Requirements Component Minimum Version Notes NVIDIA Driver 525+ ` ` CUDA Toolkit 12.0+ ` ` NVIDIA Container Toolkit Latest Required for Docker-based deployments (AI Runner, containerised orchestrator) ### Checking Your Versions ```bash icon="terminal" theme={"theme":{"light":"github-light","dark":"dark-plus"}} # Driver version nvidia-smi # CUDA version nvcc --version # Docker GPU access docker run --gpus all nvidia/cuda:12.0-base nvidia-smi ``` ## VRAM Requirements by Workload Workload Minimum VRAM Recommended Notes Video transcoding only 4 GB 8 GB NVENC/NVDEC uses minimal VRAM Batch AI (single warm model) 8 GB 16 GB Depends on model size – SDXL needs \~7 GB Batch AI (multiple warm models) 16 GB 24 GB+ Each warm model consumes VRAM simultaneously LLM inference (quantised) 8 GB 16 GB Via Ollama runner with quantised weights LLM inference (full precision) 24 GB+ 48 GB+ Large language models at full precision Real-time AI (ComfyStream) 12 GB 16 GB+ Latency-sensitive – VRAM headroom improves stability For detailed per-pipeline VRAM planning, see the [Model and Demand Reference](/v2/orchestrators/guides/ai-and-job-workloads/model-demand-reference). ## GPU Selection Guidance Any supported NVIDIA GPU works. For cost efficiency, an RTX 3060 12GB or RTX 4060 Ti 16GB provides good transcoding throughput at low power draw. Patch the NVENC limit to handle more concurrent sessions. **Budget pick:** GTX 1660 Super (6 GB) – cheapest entry for transcoding-only. 16 GB VRAM minimum. RTX 4070 Ti Super (16 GB) or RTX 3090 (24 GB) are common choices. 24 GB allows running 2–3 warm AI models alongside transcoding. **Best value:** RTX 3090 24 GB – widely available used, high VRAM, strong community track record. 24 GB+ VRAM. For LLM inference at reasonable speed, RTX 4090 (24 GB) or data centre cards (A100, L40S). Multiple-GPU setups for serving many warm models. **Production pick:** A100 40/80 GB or L40S 48 GB in a data centre. ## See Also