Version: v1.8 🚧

Metrics Collection

Introduction

Koordinator's metrics collection system provides comprehensive monitoring for node/pod resource usage, performance characteristics, and system health. The metrics advisor framework collects, processes, and exports metrics from various system components and specialized devices. This document details the metrics collection architecture, extension interfaces, and Prometheus integration. The framework enables resource optimization through periodic collection of resource profiles and detection of container interference (CPU scheduling delays, memory allocation latency, PSI).

Section sources

Metrics Advisor Framework

The metrics advisor implements a plugin-based architecture for collecting and processing metrics from system components and specialized devices. The framework manages collector lifecycles through centralized orchestration using a shared state mechanism to coordinate between collectors, enabling derived metrics calculation (system resource usage = node usage - pod usage - host app usage).

Designed for easy extension with new collectors for additional metrics or device types. Integrates with statesinformer for pod metadata and metriccache for metric storage. The advisor coordinates collection at configurable intervals while managing collector dependencies.

Metrics Advisor Framework Class Structure:

Core classes and relationships:

MetricAdvisor (Metric advisor)
- Methods: Run(stopCh <-chan struct{}) error, HasSynced() bool
- Uses framework.Options
- Manages framework.Context
framework.Options (Framework options)
- Fields: Config *Config, StatesInformer StatesInformer, MetricCache MetricCache, CgroupReader CgroupReader, PodFilters map[string]PodFilter
framework.Context (Framework context)
- Fields: DeviceCollectors map[string]DeviceCollector, Collectors map[string]Collector, State *SharedState
- Contains Collector set
- Contains SharedState
framework.Collector (Collector interface)
- Methods: Enabled() bool, Setup(*Context), Run(<-chan struct{}), Started() bool
framework.SharedState (Shared state)
- Methods: GetNodeUsage() (*CPUQuantity, *MemoryQuantity), GetPodsUsageByCollector() (map[string]*CPUQuantity, map[string]*MemoryQuantity), GetHostAppUsage() (*CPUQuantity, *MemoryQuantity)

Diagram sources

Section sources

Collector Interfaces and Extension Points

The metrics advisor framework provides well-defined interfaces for gathering new metrics from nodes and pods. The core extension mechanism is the Collector interface defining lifecycle methods for all collectors. This interface serves as the foundation for pod-level and device-level metrics collection.

Supports two specialized types: PodCollector for pod-specific metrics and DeviceCollector for device-specific metrics. These extend the base Collector interface with additional tailored methods. The framework provides factory functions (CollectorFactory and DeviceFactory) for creating collectors based on configuration.

Collector Interfaces and Extension Points Class Structure:

Core classes and relationships:

Collector (Base collector interface)
- Methods: Enabled() bool, Setup(s *Context), Run(stopCh <-chan struct{}), Started() bool
- Note: Base interface for all metrics collectors, provides lifecycle management methods
PodCollector (Specialized Pod metrics collector)
- Inherits: Collector
- Additional methods: PodFilter, GetPodMetric(uid, podParentDir string, cs []corev1.ContainerStatus) []metriccache.MetricSample
- Note: Implements PodFilter interface
DeviceCollector (Specialized device metrics collector)
- Inherits: Collector
- Additional methods: Shutdown(), Infos() metriccache.Devices, GetNodeMetric() []metriccache.MetricSample, GetContainerMetric(containerID, podParentDir string, c *corev1.ContainerStatus) []metriccache.MetricSample
- Note: Provides device-specific information
CollectorFactory (Collector factory)
- Methods: Create(opt *Options) Collector
- Creates Collector
DeviceFactory (Device collector factory)
- Methods: Create(opt *Options) DeviceCollector
- Creates DeviceCollector

Diagram sources

plugin.go

Section sources

plugin.go

Existing Collector Implementations

Koordinator includes built-in collectors demonstrating the framework's capabilities for monitoring system resources.

Node Resource Collector

Gathers CPU and memory usage metrics from node level, collecting accumulated CPU ticks and memory usage to calculate usage rates over time. Integrates with device collectors for specialized hardware metrics. Uses shared state to update node resource usage for access by other collectors.

Performance Collector

Monitors CPU Performance Indicators (CPI) and Pressure Stall Information (PSI) for containers and pods. Uses perf events for CPI when Libpfm4 is enabled, or alternative methods otherwise. PSI collection monitors CPU, memory, and I/O pressure for insights into resource contention and bottlenecks. Feature-gated for enable/disable based on system capabilities.

System Resource Collector

Calculates system-level resource usage: system usage = node usage - pod usage - host app usage. Depends on other collectors for input metrics, demonstrating framework support for collector dependencies. Validates input metric freshness before calculations to ensure accurate derived metrics.

Node Resource Collector Interaction Flow:

Participants:
- NodeResourceCollector (node resource collector)
- StatesInformer (states informer)
- MetricCache (metric cache)
- DeviceCollector (device collector)
- SharedState (shared state)

Flow:

1. NodeResourceCollector → StatesInformer: GetAllPods()

2. NodeResourceCollector → MetricCache: Get(NodeCPUInfoKey)

3. NodeResourceCollector → DeviceCollector: GetNodeMetric()

4. NodeResourceCollector → MetricCache: Appender().Append()

5. NodeResourceCollector → MetricCache: Appender().Commit()

6. NodeResourceCollector → SharedState: UpdateNodeUsage()

Diagram sources

Section sources

Custom Collector Development

Developing custom collectors involves implementing the Collector interface and registering with the framework. Follow this standardized pattern:

Implement Lifecycle Methods: Enabled, Setup, Run, Started
- Enabled: Determine activation based on configuration/capabilities
- Setup: Initialize with shared resources (context, states informer, metric cache)
- Run: Main collection logic in loop at configurable interval
- Started: Indicate if collector successfully began
Follow Dependency Patterns: Validate input data freshness and handle errors gracefully
Use Shared State: Coordinate with other collectors and share intermediate results
Consider Performance: Minimize resource consumption, make intervals configurable

Custom collectors should ensure stability through proper error handling and avoid excessive overhead.

Section sources

Prometheus Integration

Koordinator integrates with Prometheus to expose collected metrics through standard endpoints. Integration follows Kubernetes best practices with components exposing metrics via HTTP endpoints scraped by Prometheus. Configuration managed through monitor.yaml defining ServiceMonitor resources for Prometheus operator.

Metrics registered using Prometheus client libraries and exposed through merged internal and external registries. This allows both detailed debugging metrics and high-level operational metrics with environment-based control. The koordlet exposes metrics through dedicated HTTP server:

/metrics: Merged metrics
/internal/metrics: Internal debugging metrics
/external/metrics: External operational metrics

Prometheus Integration Architecture:

Prometheus Server (monitoring server)
  ↓ (scrapes)
ServiceMonitor (service monitor)
  ↓ (targets)
  ├── koordlet
  │   └── Exposes /metrics
  ├── SLO Controller
  │   └── Exposes /metrics
  └── Descheduler
      └── Exposes /metrics

/metrics Endpoint
  └── Returns Metric Data

Diagram sources

Section sources

Configuration and Management

The metrics collection system is highly configurable with settings controlling collection intervals, enabled collectors, and operational parameters. Managed through MetricsAdvisorConfig structure.

Key parameters:

CollectResUsedInterval: Resource usage metric collection interval
CollectSysMetricOutdatedInterval: System metric data validity period
CollectNodeCPUInfoInterval: Node CPU information collection interval
CollectNodeStorageInfoInterval: Node storage information collection interval
CPICollectorInterval/TimeWindow: CPI metric collection settings
PSICollectorInterval: PSI metric collection interval
ColdPageCollectorInterval: Cold page collection interval
EnablePageCacheCollector: Enable page cache collection flag
EnableResctrlCollector: Enable resctrl collection flag

These options allow fine-tuning based on requirements, balancing monitoring granularity with system performance. Feature gates provide additional control over collector enablement based on capabilities.

Metrics Advisor Configuration Class Structure:

Configuration parameters:

MetricsAdvisorConfig (Metrics advisor configuration)
- CollectResUsedInterval (duration): Resource usage collection interval
- CollectSysMetricOutdatedInterval (duration): System metric outdated interval
- CollectNodeCPUInfoInterval (duration): Node CPU info collection interval
- CollectNodeStorageInfoInterval (duration): Node storage info collection interval
- CPICollectorInterval (duration): CPI metric collection interval
- PSICollectorInterval (duration): PSI metric collection interval
- CPICollectorTimeWindow (duration): CPI collector time window
- ColdPageCollectorInterval (duration): Cold page collection interval
- ResctrlCollectorInterval (duration): Resctrl collection interval
- EnablePageCacheCollector (bool): Enable page cache collector
- EnableResctrlCollector (bool): Enable resctrl collector

Diagram sources

pkg/koordlet/metricsadvisor/framework/config.go

Section sources

pkg/koordlet/metricsadvisor/framework/config.go

Conclusion

Koordinator's metrics collection framework provides a robust and extensible system for monitoring node/pod resource usage, performance characteristics, and system health. The plugin-based architecture enables comprehensive monitoring through built-in collectors while providing clear extension points for custom metrics collection. Prometheus integration ensures collected metrics are accessible to standard monitoring and alerting systems, enabling deep insights into cluster performance and resource utilization. By following patterns from existing collectors and leveraging the framework's shared state and dependency management, developers can create custom collectors that enhance Koordinator's monitoring capabilities for specific use cases and hardware configurations.

Metrics Collection

Introduction​

Metrics Advisor Framework​

Collector Interfaces and Extension Points​

Existing Collector Implementations​

Node Resource Collector​

Performance Collector​

System Resource Collector​

Custom Collector Development​

Prometheus Integration​

Configuration and Management​

Conclusion​

Introduction

Metrics Advisor Framework

Collector Interfaces and Extension Points

Existing Collector Implementations

Node Resource Collector

Performance Collector

System Resource Collector

Custom Collector Development

Prometheus Integration

Configuration and Management

Conclusion