Examples

Training¶

Fine-tune Llama 3.1 8B with SFT and QLoRA, single-node or distributed across multiple nodes.

Fine-tune Llama models with FSDP and QLoRA, single-node or distributed across multiple nodes.

Fine-tune an agent on multiple nodes with RAGEN, verl, and Ray.

RL-fine-tune Qwen2.5-32B with Miles.

Set up GCP A4 and A3 clusters with optimized networking

Set up AWS EFA clusters with optimized networking

Set up Lambda clusters with optimized networking

Set up Crusoe clusters with optimized networking

Set up Nebius clusters with optimized networking

Run multi-node NCCL tests with MPI

Deploy Qwen3.6-27B with SGLang

Deploy Qwen3.6-27B with vLLM

Deploy a DeepSeek distilled model with NIM

Deploy Qwen3 with TensorRT-LLM

Deploy DeepSeek V4 with SGLang on B200:8

Deploy Qwen3.6-27B with SGLang on NVIDIA or AMD

Deploy and fine-tune LLMs on AMD

Deploy and fine-tune LLMs on TPU

Deploy and fine-tune LLMs on Tenstorrent