Skip to content

Latest commit

 

History

History
363 lines (254 loc) · 15.5 KB

opencost-specv01.md

File metadata and controls

363 lines (254 loc) · 15.5 KB

OpenCost Specification

The OpenCost Spec is a vendor-neutral specification for measuring and allocating infrastructure and container costs in Kubernetes environments.

Introduction

Kubernetes enables complex deployments of containerized workloads, which are often transient and consume variable amounts of cluster resources. While this enables teams to construct powerful solutions to a broad range of technical problems, it also creates complexities when measuring the resource utilization and costs of workloads and their associated infrastructure within the dynamics of shared Kubernetes environments.

As Kubernetes adoption increases within an organization, these complexities become a business-critical challenge to solve. In this document, we specify a vendor-agnostic methodology for accurately measuring and allocating the costs of a Kubernetes cluster to its hosted tenants. This community resource is maintained by Kubernetes practitioners and we welcome all contributions.

Foundational definitions

Total Cluster Costs represent all costs required to operate a Kubernetes cluster. Cluster Assets Costs are the portion of these costs that are related to directly observable entities within a cluster; these include expenses from nodes, persistent volumes, attached disks, load balancers, and network ingress/egress costs. From a financial accounting perspective, these are equivalent to the Cost of Goods Sold when measuring product costs. Cluster Overhead Costs measure the overhead required to operate all of the Assets of a cluster, e.g. Cluster Management Fees. These are the equivalent to Selling, General and Administrative Expenses (SG&A), or indirect costs, when viewed from a financial accounting perspective.

Total Cluster Costs = Cluster Asset Costs + Cluster Overhead Costs

Cluster Asset Costs can be further segmented into Resource Allocation Costs and Resource Usage Costs. Resource Allocation Costs are expenses that accumulate based on the amount of time provisioned irrespective of usage (e.g. CPU hourly rate) whereas Resource Usage Costs accumulate on a per-unit basis (e.g. cost per byte egressed). Costs for an individual Asset are the summation of it’s Resource Allocation and Usage Costs, e.g. a Node’s cost is equal to CPU cost + GPU cost + RAM cost + Node Network Costs

Total Cluster Costs = Resource Allocation Costs

(for all assets)

+ Resource Usage Costs

(for all assets)

+ Cluster Overhead Costs

(for cluster)

The following chart shows these relationships:

image4

While billing models can differ by environment, below are common examples of segmentation by Allocation, Usage and Overhead Costs.

image1

Once calculated, these Asset Costs can then be distributed to the tenants that consume them, where Workload Costs plus Idle Costs equals Asset Costs. Workload costs are expenses that can be directly attributed to a set of Kubernetes workloads, e.g. a container, pod, deployment, etc. Cluster Idle Costs are the portion of Resource Allocation Costs that are not allocated to any workload1.

Total Cluster Costs = Workload Costs + Cluster Idle Costs + Cluster Overhead Costs

The following chart shows these relationships:

image2

Cluster Asset Costs

Cluster Assets are observable entities within a Kubernetes cluster that directly incur costs related to their resources. Asset Costs consist of Resource Allocation Costs and Resource Usage Costs. Every Asset conforming to this specification MUST include at least one cost component with Amount, Unit and Rate attributes as well as a TotalCost value.

Attributes for measured Resource Allocation Costs:

  • [float] Amount - the amount of resource reserved by the asset, e.g. 2 CPU cores
  • [float] Duration - time between the start and end of the allocation period measured in hours, e.g. 24 hours
  • [string] Unit - the amount’s unit of measurement, e.g. CPU cores
  • [float] HourlyRate - cost per one unit hour, e.g. $0.2 per CPU hourly rate
  • [float] Total Cost - defined as Amount * Duration * HourlyRate

Attributes for measured Resource Usage Costs:

  • [float] Amount - the amount of resource used, e.g. 1GB of internet egress
  • [string] Unit - the amount’s unit of measurement, e.g. GB
  • [float] UnitRate - cost per unit, e.g $ per GB egressed
  • [float] TotalCost - defined as Amount * UnitRate

Below are example inputs when measuring asset costs over a designated time window (e.g. 24 hours) with common billing models:

  • Nodes
    • CPU allocation costs
      • cores = avg_over_time(cpu) by (node) [cores]
      • duration = end running- start running [hrs]
      • price = provider defined or custom pricing sheet [$/core-hr] (see Appendix A for more details)
      • total cost = cores * duration * price [$]
    • RAM allocation costs
      • ram bytes = avg_over_time(GB) by (node) [ram GBs]
      • duration = end running- start running [hrs]
      • price = provider defined or custom pricing sheet [$/GB-hr] (see Appendix A for more details)
      • total cost = ram bytes * duration * price [$]
  • Persistent Volumes
    • Disk Size = avg_over_time(GB) by (pv) [disk GBs]
    • Price = provider defined or custom pricing sheet [$/GB-hr] (see Appendix A for more details) typically a function of disk class, IOPS, backup size
    • Persistent storage attached to the pod-level
  • Attached disks
    • Disk Size = avg_over_time(GB) by (pv) [disk GBs]
    • Price = provider defined or custom pricing sheet [$/GB-hr] (see Appendix A for more details) typically a function of disk class, IOPS, backup size
    • Ephemeral storage used per pod on node
  • Load balancers
    • Usage costs
      • amount = bytes ingressed
      • price = $ per byte ingressed
    • Allocation costs
      • rules = # of forwarding rules defined
      • price = average $ per forwarding rule
  • Overhead Costs
    • Cluster management fees: provider fees typically charged on an hourly basis
    • Operator fees: potential DevOps team costs allocated to cluster operations

Workload Costs

Workloads are defined as entities to which Asset Costs are committed. Some resources solely have Usage Costs, but others have Allocation Costs independent of actual usage. Workload Costs should be understood as max(request, usage) when Assets have Resource Allocation Costs, e.g. CPU or GPU. This formula effectively assigns costs that have been directly reserved or allocated by kube-scheduler. Workload Costs should be calculated at the lowest level possible, i.e. container level2, and then they can be aggregated by any dimension.

Resource Type Measurement
CPU The greater of requested and used CPU resources measured in cores or millicores.
Memory The greater of requested resources and memory in use measured in bytes or gigabytes.
GPU The greater of requested resources and used resources measured in cores.
Storage Volume The storage capacity of Persistent Volume Claim (PVC) requests measured in bytes or gigabytes. Attached at the Kubernetes pod-level.
Network Amount of ingress or egress across zones, regions, or the wide internet measured in bytes or gigabytes (can vary by provider)
Load Balancer The number of load balancers used plus the volume of connections and bytes (can vary by provider).

The following workload cost aggregations are supported in a complete implementation in the OpenCost Spec:

  • container
  • pod
  • deployment
  • statefulset
  • job
  • controller name
  • controller kind
  • label
  • annotation
  • namespace
  • cluster

Shared Costs

Shared Workload Costs, Cluster Idle Costs, and Overhead Costs are common examples of costs that organizations can optionally distribute amongst tenants. A common example would be system workload costs, e.g. kube-system pods, that benefit all tenants. Common methods for distributing these costs include the following:

  1. Uniformly across other tenants
  2. Proportionate to a tenant's consumption of Cluster Asset costs
  3. Custom metric, e.g. bytes of network egress

A full implementation of the spec should support various methods of distributing shared costs.

Idle Costs

Idle Costs can be calculated at both the Asset/Resource level as well as the Workload level. Asset Idle Costs represent the cost-weighted difference between Cluster Asset Costs and costs of resources being allocated or consumed. Idle Costs and then Idle Percentage can be calculated as follows:

Cluster Idle Cost = ( Cluster Asset Costs - Workload Costs )
Cluster

Idle %

= Idle Cost / Resource Allocation Costs

The following chart shows these relationships: image3

Asset Idle Cost can be calculated by individual assets, groups of assets, cluster(s), and by individual resources, e.g. CPU. Resources that are strictly billed on usage can be viewed to have 100% efficiency but should not be included when measuring idle percentage of a cluster.

Workload Idle Costs is a cost-weighted measurement of requested resources that are unused. Workload Idle Costs can be calculated on any grouping of Kubernetes workloads, e.g. containers, pods, labels, annotations, namespaces, etc.

Pod States

The state of a pod will affect the ability to assign costs and whether a resource is considered allocated. The OpenCost model does not account for resources allocated to pods with ImagePullBackOff.

State Cost Allocation Status
Running Max (Usage, request) Implemented
ImagePullBackOff Request Currently no charge

Glossary

Cluster Assets – Observable entities within a Kubernetes cluster that directly incur costs related to their resources. Examples include nodes, persistent volumes, attached disks, load balancers.

Container - An instance of a container image. You may have multiple copies of the same image running at the same time. More info

Image - A template of a container which contains software (usually microservices) that needs to be run. More info

Server / Instance / Node / Node Pool - A machine (possibly cloud or on-prem, physical or virtual) in this context used by Kubernetes More info

Pod - A Kubernetes specific concept that consists of a group of containers. A pod is treated as a single block of resources that may be scheduled or scaled on a cluster. More info

Container Orchestration - Manages the cluster of server instances and maintains the lifecycle of containers and pods. Scheduling is a function of the container orchestrator which schedules pods/containers to run on a server instance.

Cluster - A group of server instances

Namespace - A Kubernetes concept which creates a ‘virtual’ cluster where pods/containers may be deployed and observed discreetly from other namespaces. More info

Pod Labels - Key / Value pairs which may be used to identify objects that are meaningful to the user. There is no semantic meaning to the core of the system. Labels are typically used where a grouping of multiple namespaces need to be associated with a workload. More info

Appendix A

Various cloud providers supply an hourly resource cost directly in their user billing model.The OpenCost model recommends utilizing the fully Amortized Net Cost for each resource as an input when this is the case. When explicit RAM, CPU or GPU prices are not provided by a cloud provider, the OpenCost model needs to derive these values. The recommendation is to use a scalable ratio of CPU, GPU, RAM and other price inputs. These default values should be based on the marginal resource rates of the provider by family.

One approach for calculating is to ensure the sum of each component is equal to the total price of the Asset (e.g. node) based on billing rates from your provider. When the sum of resources (e.g. RAM/CPU/GPU) cost is greater (or less) than the price of the node, then the ratio between the input prices is held constant but the total value is adjusted.

As an example, you have provisioned a node with 1 GPU, 1 CPU and 1 GB of RAM that costs $35/mo. If your base GPU price is $30, base CPU price is $30, and RAM GB price is $10, based on the average marginal costs across instances in this family class, then these inputs will be normalized to $15 for GPU, $15 for CPU and $5 for RAM so that the sum equals the cost of the node. Note that the price of a GPU, as well as the price of a CPU remain 3x the price of a Gb of RAM.

Appendix B

Sampling Kubernetes resources is recommended with the following metrics / datasources:

  • container_cpu_usage_seconds_total – sample from cAdvisor
  • container_memory_working_set_bytes – sampled from cAdvisor
  • gpu_usage – sampled via chipset specific metrics
  • cpu_requested – sampled from kube API
  • ram_requested – sampled from kube API
  • gpu_requested – sampled from kube API

Appendix C

Working examples of OpenCost data to come!

Notes

Footnotes

  1. Resource usage costs cannot be part of idle cost because they are always used, the corresponding resource never "sits idle."

  2. This is because containers are the smallest identifiable unit of "thing that uses resources." For example, the lowest level of reliable CPU usage information is usually a container.