kubernetes/compute-classes/gpu-spot-capacity.yaml (28 lines of code) (raw):

# Copyright 2024 Google LLC # # Licensed under the Apache License, Version 2.0 (the "License"); # you may not use this file except in compliance with the License. # You may obtain a copy of the License at # # https://www.apache.org/licenses/LICENSE-2.0 # # Unless required by applicable law or agreed to in writing, software # distributed under the License is distributed on an "AS IS" BASIS, # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. # See the License for the specific language governing permissions and # limitations under the License. apiVersion: cloud.google.com/v1 kind: ComputeClass metadata: name: gpu-spot-capacity spec: autoscalingPolicy: consolidationDelayMinutes: 2 # Mark a node as eligible for scale down after 1 minute at 10% utilisation consolidationThreshold: 30 # Very low threshold to encourage node packing nodePoolAutoCreation: enabled: true priorities: - machineFamily: g2 minCores: 8 minMemoryGb: 32 spot: true gpu: type: nvidia-l4 count: 1 - machineFamily: n1 minCores: 8 minMemoryGb: 32 spot: true gpu: type: nvidia-tesla-t4 count: 1 whenUnsatisfiable: ScaleUpAnyway # Required for Autopilot activeMigration: optimizeRulePriority: false # Prioritize moving workloads to optimal instances