Configure workload rightsizing policies and attach them to clusters with Terraform.

Workload Policies

devzero_workload_policy defines how workloads should be rightsized — CPU and memory vertical scaling, horizontal scaling, and the triggers that activate them. devzero_workload_policy_target attaches a policy to one or more clusters with optional namespace, kind, and workload filters.

WorkloadPolicy

Example

resource "devzero_workload_policy" "cost_saving" {
  name                    = "cost-saving-policy"
  description             = "Rightsize non-critical workloads"
  action_triggers         = ["on_detection", "on_schedule"]
  cron_schedule           = "*/15 * * * *"
  detection_triggers      = ["pod_creation", "pod_update", "pod_evict"]
  loopback_period_seconds = 86400
  cooldown_minutes        = 300
  min_data_points         = 20
  min_change_percent      = 0.2

  cpu_vertical_scaling = {
    enabled                    = true
    target_percentile          = 0.75
    min_request                = 25
    max_scale_up_percent       = 1000
    max_scale_down_percent     = 1
    min_data_points            = 20
    adjust_req_even_if_not_set = true
    limits_removal_enabled     = true
  }

  memory_vertical_scaling = {
    enabled                    = true
    target_percentile          = 1
    min_request                = 134217728
    max_scale_up_percent       = 1000
    max_scale_down_percent     = 1
    overhead_multiplier        = 0.3
    limits_adjustment_enabled  = true
    limit_multiplier           = 1
    min_data_points            = 20
    adjust_req_even_if_not_set = true
    limits_removal_enabled     = false
  }

  enable_pmax_protection = true
  pmax_ratio_threshold   = 3
}

Required Arguments

Parameter	Type	Description
`name`	string	Human-friendly name for the policy

Optional Arguments

Parameter	Type	Description
`description`	string	Free-form description
`action_triggers`	list(string)	When to act: `"on_detection"`, `"on_schedule"`
`cron_schedule`	string	5-field cron expression — required when `"on_schedule"` is set
`detection_triggers`	list(string)	What triggers detection: `"pod_creation"`, `"pod_update"`, `"pod_evict"`
`loopback_period_seconds`	number	Historical data window in seconds
`startup_period_seconds`	number	Grace period after pod starts before scaling
`cooldown_minutes`	number	Minimum wait time between scaling applications
`min_data_points`	number	Global minimum data points required before any recommendation
`min_change_percent`	number	Global minimum change threshold for applying recommendations
`min_vpa_window_data_points`	number	Minimum data points in VPA analysis window
`drift_delta_percent`	number	Percentage drift from baseline that triggers VPA refresh
`stability_cv_max`	number	Maximum coefficient of variation to consider stable
`hysteresis_vs_target`	number	Hysteresis threshold vs target for HPA coordination
`live_migration_enabled`	bool	Allow live migration when applying recommendations
`scheduler_plugins`	list(string)	Kubernetes scheduler plugins to activate
`defragmentation_schedule`	string	Cron expression for background defragmentation
`enable_pmax_protection`	bool	Raise requests to cover observed peak usage when peak-to-recommendation ratio exceeds `pmax_ratio_threshold` (default: `false`)
`pmax_ratio_threshold`	number	Peak-to-recommendation ratio that activates pmax protection (default: `3.0`)
`cpu_vertical_scaling`	object	CPU vertical scaling configuration (see Vertical Scaling)
`memory_vertical_scaling`	object	Memory vertical scaling configuration (see Vertical Scaling)
`gpu_vertical_scaling`	object	GPU vertical scaling configuration (see Vertical Scaling)
`gpu_vram_vertical_scaling`	object	GPU VRAM vertical scaling configuration (see Vertical Scaling)
`horizontal_scaling`	object	Horizontal scaling configuration (see Horizontal Scaling)

Read-Only

Attribute	Type	Description
`id`	string	Unique identifier of the workload policy

Vertical Scaling

Used by cpu_vertical_scaling, memory_vertical_scaling, gpu_vertical_scaling, and gpu_vram_vertical_scaling.

Parameter	Type	Description
`enabled`	bool	Enable or disable vertical scaling for this resource
`target_percentile`	number	Usage percentile to target (e.g. `0.75` for P75, `1` for P100)
`min_request`	number	Lower bound for resource requests (millicores for CPU, bytes for memory)
`max_request`	number	Upper bound for resource requests
`overhead_multiplier`	number	Extra headroom added to recommendations as a fraction (e.g. `0.3` for 30%)
`limit_multiplier`	number	How much higher limits should be vs requests (e.g. `2.0` = 2× the request)
`limits_adjustment_enabled`	bool	Adjust container limits as well as requests
`limits_removal_enabled`	bool	Remove resource limits from workloads (CPU only — memory limits removal is not supported)
`max_scale_up_percent`	number	Maximum percent to scale up in one step
`max_scale_down_percent`	number	Maximum percent to scale down in one step
`min_data_points`	number	Minimum data points required before a recommendation
`adjust_req_even_if_not_set`	bool	Suggest resource requests even if the workload currently has none set (default: `false`)

Horizontal Scaling

Parameter	Type	Description
`enabled`	bool	Enable horizontal scaling
`min_replicas`	number	Minimum number of replicas
`max_replicas`	number	Maximum number of replicas
`primary_metric`	string	Primary metric for HPA decisions
`target_utilization`	number	Target utilization for primary metric (0.0–1.0)
`max_replica_change_percent`	number	Maximum percent replica change in one step
`min_data_points`	number	Minimum data points required for HPA decisions

Import

terraform import devzero_workload_policy.example <workload_policy_id>

WorkloadPolicyTarget

devzero_workload_policy_target attaches a devzero_workload_policy to one or more clusters. You can optionally filter by workload kind, namespace, and name patterns.

Example

resource "devzero_workload_policy_target" "production" {
  name        = "production-target"
  description = "Apply cost-saving policy to production deployments"
  policy_id   = devzero_workload_policy.cost_saving.id
  cluster_ids = [devzero_cluster.production.id]
  priority    = 1
  enabled     = true

  kind_filter = ["Deployment", "StatefulSet"]

  namespace_pattern = {
    pattern = "^prod-"
    flags   = "i"
  }

  workload_selector = {
    match_labels = {
      app = "my-service"
    }
  }
}

Arguments

Parameter	Type	Required	Description
`name`	string	Yes	Human-friendly name for the target
`policy_id`	string	Yes	ID of the `devzero_workload_policy` to attach
`cluster_ids`	list(string)	Yes	List of cluster IDs to apply the policy to
`description`	string	No	Free-form description
`enabled`	bool	No	Whether the target is active (default: `true`)
`priority`	number	No	Evaluation priority when multiple targets overlap — higher values take precedence
`workload_names`	list(string)	No	Explicit list of workload names to include
`node_group_names`	list(string)	No	Restrict matching to specific node groups
`kind_filter`	list(string)	No	Workload kinds to include (see below)
`name_pattern`	object	No	Regex-based workload name matching (`pattern`, `flags`)
`namespace_pattern`	object	No	Regex-based namespace name matching (`pattern`, `flags`)
`namespace_selector`	object	No	Label selector for namespaces (`match_labels`, `match_expressions`)
`workload_selector`	object	No	Label selector for workloads (`match_labels`, `match_expressions`)

Supported kind filter values: Pod, Deployment, StatefulSet, DaemonSet, Job, CronJob, ReplicaSet, ReplicationController, Rollout

name_pattern / namespace_pattern

Parameter	Type	Description
`pattern`	string	Regular expression (RE2 syntax). Example: `^api-(staging\|prod)-.*$`
`flags`	string	Regex flags: `"i"` (case-insensitive), `"m"` (multi-line)

namespace_selector / workload_selector

Parameter	Type	Description
`match_labels`	map(string)	Exact label key/value pairs that must match
`match_expressions`	list(object)	Advanced label selector requirements

Each match_expressions entry:

Parameter	Type	Description
`key`	string	Label key to evaluate
`operator`	string	`In`, `NotIn`, `Exists`, or `DoesNotExist`
`values`	list(string)	Values for `In`/`NotIn`; omit for `Exists`/`DoesNotExist`

Read-Only

Attribute	Type	Description
`id`	string	Unique identifier of the workload policy target

Import

terraform import devzero_workload_policy_target.example <workload_policy_target_id>

Workload Policies

On this page