Skip to content

TPU

GitHub Discord

dstack

dstackai/dstack

Home
Docs
Docs
- Getting started
  Getting started
- Concepts
  Concepts
  - Backends
  - Dev environments
  - Tasks
  - Services
  - Fleets
  - Volumes
  - Repos
  - Projects
  - Gateways
- Guides
  Guides
- Reference
  Reference
  - .dstack.yml
    .dstack.yml
    
    dev-environment
    
    task
    
    service
    
    fleet
    
    gateway
    
    volume
  - server/config.yml
  - CLI
    CLI
    
    dstack server
    
    dstack init
    
    dstack apply
    
    dstack delete
    
    dstack ps
    
    dstack stop
    
    dstack attach
    
    dstack logs
    
    dstack metrics
    
    dstack project
    
    dstack fleet
    
    dstack offer
    
    dstack volume
    
    dstack gateway
  - API
    API
    
    Python API
    
    REST API
  - Environment variables
  - Plugins
    Plugins
    
    REST Plugin API
Examples
Examples
- Examples
- Single-node training
  Single-node training
  - TRL
  - Axolotl
- Distributed training
  Distributed training
  - TRL
  - Axolotl
  - Ray+RAGEN
- Clusters
  Clusters
- Inference
  Inference
  - SGLang
  - vLLM
  - TGI
  - NIM
  - TensorRT-LLM
- Accelerators
  Accelerators
  - AMD
  - TPU
  - Intel Gaudi
  - Tenstorrent
- Misc
  Misc
  - Docker Compose
Blog
Blog
- Blog
- Archive
  Archive
  - 2025
  - 2024
- Categories
  Categories
  - AMD
  - ARM
  - Benchmarks
  - Case studies
  - Cloud fleets
  - Dev environments
  - Intel Gaudi
  - Metrics
  - NVIDIA
  - SSH fleets
  - TPU TPU
    Table of contents
    
    Using TPUs for fine-tuning and deploying LLMs
  - Volumes

TPU¶

September 10, 2024
in TPU
4 min read

Using TPUs for fine-tuning and deploying LLMs

If you’re using or planning to use TPUs with Google Cloud, you can now do so via dstack. Just specify the TPU version and the number of cores (separated by a dash), in the gpu property under resources.

Read below to find out how to use TPUs with dstack for fine-tuning and deploying LLMs, leveraging open-source tools like Hugging Face’s Optimum TPU and vLLM .