Workload Rule

devzero_workload_rule configures vertical and horizontal scaling for a specific Kubernetes workload — CPU, memory, GPU, and HPA rules — along with emergency response and per-container overrides.

Example Usage

Auto-generate all fields

resource "devzero_workload_rule" "auto" {
  cluster_id    = "<YOUR_CLUSTER_ID>"
  namespace     = "production"
  kind          = "Deployment"
  name          = "my-api"
  auto_generate = true
}

Manual — full control

resource "devzero_workload_rule" "manual" {
  cluster_id = "<YOUR_CLUSTER_ID>"
  namespace  = "production"
  kind       = "Deployment"
  name       = "my-api"

  action_triggers    = ["on_schedule", "on_detection"]
  cron_schedule      = "0 2 * * *"
  detection_triggers = ["pod_creation", "pod_update", "pod_evict"]

  cpu_rule = {
    enabled                   = true
    min_request               = 10
    max_request               = 32000
    target_percentile         = 0.95
    limits_adjustment_enabled = true
    limit_multiplier          = 1.0
  }

  memory_rule = {
    enabled                   = true
    min_request               = 67108864    # 64Mi in bytes
    max_request               = 68719476736 # 64Gi in bytes
    target_percentile         = 0.95
    limits_adjustment_enabled = true
  }

  hpa_rule = {
    enabled            = true
    min_replicas       = 1
    max_replicas       = 10
    target_utilization = 0.70
    primary_metric     = "cpu"
  }

  emergency_response = {
    oom_enabled               = true
    oom_memory_multiplier     = 1.5
    cpu_throttling_enabled    = true
    cpu_throttling_threshold  = 0.20
    cpu_throttling_multiplier = 1.25
  }

  live_migration_enabled        = false
  use_in_place_vertical_scaling = false
}

Per-container rules

resource "devzero_workload_rule" "per_container" {
  cluster_id = "<YOUR_CLUSTER_ID>"
  namespace  = "production"
  kind       = "Deployment"
  name       = "my-multi-container-app"

  action_triggers    = ["on_detection"]
  detection_triggers = ["pod_creation"]

  containers = [
    {
      container_name = "app"
      cpu_rule = {
        enabled     = true
        min_request = 10
        max_request = 32000
      }
      memory_rule = {
        enabled     = true
        min_request = 67108864    # 64Mi
        max_request = 68719476736 # 64Gi
      }
    },
    {
      container_name = "sidecar"
      cpu_rule = {
        enabled     = true
        max_request = 500
      }
    }
  ]
}

Arguments

Required

Parameter	Type	Description
`cluster_id`	string	ID of the cluster this rule targets
`kind`	string	Kubernetes workload kind
`name`	string	Name of the Kubernetes workload
`namespace`	string	Kubernetes namespace of the workload

Optional

Parameter	Type	Description
`action_triggers`	list(string)	When to apply recommendations: `"on_detection"`, `"on_schedule"`
`auto_generate`	bool	When `true`, the engine generates all rule fields automatically; manual overrides are ignored
`containers`	list(object)	Per-container rule configurations (see Containers)
`cpu_rule`	object	CPU vertical scaling rule (see Resource Rule)
`cron_schedule`	string	5-field UTC cron expression for scheduled application
`defragmentation_schedule`	string	Cron expression for node defragmentation
`detection_triggers`	list(string)	Events that trigger a recommendation: `"pod_creation"`, `"pod_update"`, `"pod_evict"`
`emergency_response`	object	Emergency response for OOM and CPU throttle events (see Emergency Response)
`gpu_rule`	object	GPU vertical scaling rule (see Resource Rule)
`hpa_rule`	object	Horizontal scaling rule (see HPA Rule)
`live_migration_enabled`	bool	Allow live pod migration when applying recommendations
`memory_rule`	object	Memory vertical scaling rule (see Resource Rule)
`scheduler_plugins`	list(string)	Kubernetes scheduler plugins to activate
`use_in_place_vertical_scaling`	bool	Use in-place pod vertical scaling instead of pod restarts

Read-Only

Attribute	Type	Description
`id`	string	Unique identifier of the workload rule

Resource Rule

Used by cpu_rule, memory_rule, and gpu_rule at both workload and per-container level.

Parameter	Type	Description
`enabled`	bool	Enable this resource axis rule
`min_request`	number	Minimum resource request (millicores for CPU, bytes for memory/GPU)
`max_request`	number	Maximum resource request (millicores for CPU, bytes for memory/GPU)
`target_percentile`	number	Percentile of usage data used as the recommendation target (0–1)
`limits_adjustment_enabled`	bool	Whether to also adjust resource limits alongside requests
`limits_removal_enabled`	bool	Actively remove limits from workloads
`limit_multiplier`	number	Multiplier applied to the request to derive the resource limit
`max_scale_up_percent`	number	Maximum percentage increase allowed in a single cycle
`max_scale_down_percent`	number	Maximum percentage decrease allowed in a single cycle

max_scale_up_percent and max_scale_down_percent are available on workload-level cpu_rule, memory_rule, and gpu_rule but not on per-container rules.

HPA Rule

Parameter	Type	Description
`enabled`	bool	Enable horizontal (replica) scaling
`min_replicas`	number	Minimum number of replicas
`max_replicas`	number	Maximum number of replicas
`primary_metric`	string	Primary metric: `"cpu"`, `"memory"`, `"gpu"`, `"network_ingress"`, `"network_egress"`
`target_utilization`	number	Target CPU utilization ratio (0–1)
`target_memory_utilization`	number	Target memory utilization ratio (0–1), tuned independently of CPU
`max_replica_change_percent`	number	Maximum percentage change in replica count per cycle
`composite_formula`	string	Formula combining multiple metric weights. Example: `"0.6cpu + 0.4memory"`
`metrics`	list(object)	Additional metric triggers (see HPA Metrics)
`behavior`	object	Fine-grained scale-up/scale-down policies (see HPA Behavior)
`fallback`	object	Replica fallback when metrics are unavailable (see HPA Fallback)

HPA Metrics

CPU/Memory/Network triggers are auto-generated from primary_metric. Use metrics for additional sources such as Prometheus.

Parameter	Type	Description
`type`	string	Metric source type. Example: `"CPU"`, `"Memory"`, `"prometheus"`
`target_utilization`	string	Target utilization as a decimal string. Example: `"0.70"`
`target_value`	string	Absolute target value as a string. Example: `"50000000"`
`weight`	string	Weight for composite formula scaling (0–1 decimal string). Example: `"0.5"`
`server_address`	string	Prometheus server URL. Example: `"http://prometheus:9090"`
`query`	string	PromQL query string. Example: `"sum(rate(http_requests_total[2m]))"`
`metadata`	map(string)	Free-form key-value metadata for external scalers

HPA Behavior

Parameter	Type	Description
`scale_up`	object	Scale-up behavior (see Scaling Policy)
`scale_down`	object	Scale-down behavior (see Scaling Policy)

Scaling Policy

Parameter	Type	Description
`stabilization_window_seconds`	number	Seconds to wait before acting on a scaling signal to avoid flapping
`select_policy`	string	Which policy wins when multiple match: `"Max"`, `"Min"`, `"Disabled"`
`policies`	list(object)	List of step policies

Each policy object requires:

Parameter	Type	Description
`type`	string	Policy type: `"Pods"` or `"Percent"`
`value`	number	Policy value (pod count or percent)
`period_seconds`	number	Period over which the policy applies

HPA Fallback

Parameter	Type	Required	Description
`replicas`	number	Yes	Number of replicas to fall back to when metrics are unavailable
`behavior`	string	No	Fallback strategy: `"static"`, `"currentReplicas"`, `"currentReplicasIfHigher"`, `"currentReplicasIfLower"`
`failure_threshold`	number	No	Consecutive metric failures before activating fallback

Emergency Response

Parameter	Type	Description
`oom_enabled`	bool	React to OOM kills by increasing memory
`oom_memory_multiplier`	number	Multiplier applied to memory on OOM reaction
`cpu_throttling_enabled`	bool	React to CPU throttling by increasing CPU request
`cpu_throttling_threshold`	number	Throttle ratio threshold that triggers a reaction (0–1)
`cpu_throttling_multiplier`	number	Multiplier applied to CPU request on throttle reaction

Containers

When containers is set, the workload-level cpu_rule, memory_rule, and gpu_rule are ignored. Each container entry accepts:

Parameter	Type	Required	Description
`container_name`	string	Yes	Name of the container this config applies to
`cpu_rule`	object	No	CPU resource rule (see Resource Rule)
`memory_rule`	object	No	Memory resource rule (see Resource Rule)
`gpu_rule`	object	No	GPU resource rule (see Resource Rule)

Workload Rule

On this page