openshift · rexagod · Oct 28, 2025 · Nov 3, 2025 · Nov 24, 2025 · Dec 16, 2025
diff --git a/...s/monitoring/assets/optional-monitoring-capability-cluster-settings-am-page.png b/...s/monitoring/assets/optional-monitoring-capability-cluster-settings-am-page.png
diff --git a/enhancements/monitoring/assets/optional-monitoring-capability-deployment-page.png b/enhancements/monitoring/assets/optional-monitoring-capability-deployment-page.png
diff --git a/...s/monitoring/assets/optional-monitoring-capability-devconsole-observer-page.png b/...s/monitoring/assets/optional-monitoring-capability-devconsole-observer-page.png
diff --git a/...s/monitoring/assets/optional-monitoring-capability-devconsole-topology-page.png b/...s/monitoring/assets/optional-monitoring-capability-devconsole-topology-page.png
diff --git a/enhancements/monitoring/assets/optional-monitoring-capability-doca.png b/enhancements/monitoring/assets/optional-monitoring-capability-doca.png
diff --git a/enhancements/monitoring/assets/optional-monitoring-capability-metrics-page.png b/enhancements/monitoring/assets/optional-monitoring-capability-metrics-page.png
diff --git a/enhancements/monitoring/assets/optional-monitoring-capability-overview-page.png b/enhancements/monitoring/assets/optional-monitoring-capability-overview-page.png
diff --git a/enhancements/monitoring/assets/optional-monitoring-capability-pod-page.png b/enhancements/monitoring/assets/optional-monitoring-capability-pod-page.png
diff --git a/enhancements/monitoring/metrics-collection-profiles.md b/enhancements/monitoring/metrics-collection-profiles.md
@@ -92,6 +92,10 @@ kube-state-metrics, kubelet and the network daemon.
 
 ### User Stories
 
+- As a user, I want to aggressively save on compute and space, and as such, only
+  want the bare-minimum set of metrics churn in my cluster, even if that breaks
+  any of the builtin frontend features that rely on internal metrics, unlike the
+  `minimal` collection profile.
 - As a user, I want to lower the amount of resources consumed by Prometheus in a
   supported way, so I can configure the clusters metrics collection profiles to
   `Minimal`.
@@ -109,6 +113,9 @@ kube-state-metrics, kubelet and the network daemon.
   all the profile metrics are present in the cluster, and which of the profile
   monitors are affected if not. Also, I want additional information to narrow
   down where these metrics are exactly being used.
+- As a developer of a component, I want to make it telemetry-aware, and reduce
+  the overall metrics exposition data from it when the user opts into telemetry
+  collection (by setting the current collection profile to `Telemetry`).
 
 ### Goals
 
@@ -185,11 +192,12 @@ all.
 Once set up, the implementation within the operator that defines its behavior
 for such profiles will decide how it reconciles under such conditions.
 
-The goal is to support 2 profiles:
+The goal is to support 3 profiles:
 
 - `Full` (same as today)
 - `Minimal` (only collect metrics necessary for recording rules, alerts,
   dashboards, HPA, VPA and telemetry)
+- `Telemetry` (only collect metrics that are required for telemetry)
 
 Note that the profile names are PascalCased in this KEP, but at this time,
 only camelCase is supported in CMO. However, since Kubernetes enums are
@@ -215,9 +223,9 @@ spec:
       - "full"
 ```
 
-An OpenShift team that wants to support the metrics collection profiles feature
-would need to provide 2 monitors for each profile (in this example 1
-ServiceMonitor per profile).
+An OpenShift team that wants to completely support the metrics collection
+profiles feature would need to provide 3 monitors for each profile (in this
+example 1 ServiceMonitor per profile).
 
 ```yaml
 ---
@@ -250,6 +258,31 @@ metadata:
     monitoring.openshift.io/collection-profile: minimal
   name: telemeter-client-minimal
   namespace: openshift-monitoring
+spec:
+  endpoints:
+  - bearerTokenFile: /var/run/secrets/kubernetes.io/serviceaccount/token
+    interval: 30s
+    port: https
+    scheme: https
+    tlsConfig:
+      <...>
+  jobLabel: k8s-app
+  selector:
+    matchLabels:
+      k8s-app: telemeter-client
+  metricRelabelings:
+  - sourceLabels: [__name__]
+    action: keep
+    regex: "federate_samples|federate_filtered_samples"
+---
+apiVersion: monitoring.coreos.com/v1
+kind: ServiceMonitor
+metadata:
+  labels:
+    k8s-app: telemeter-client
+    monitoring.openshift.io/collection-profile: telemetry
+  name: telemeter-client-telemetry
+  namespace: openshift-monitoring
 spec:
   endpoints:
   - bearerTokenFile: /var/run/secrets/kubernetes.io/serviceaccount/token
@@ -312,14 +345,15 @@ NA
 ```go
 type PrometheusK8sConfig struct {
     // Defines the metrics collection profile that Prometheus uses to collect
-    // metrics from the platform components. Supported values are `Full` or
-    // `Minimal`. In the `Full` profile (default), Prometheus collects all
-    // metrics that are exposed by the platform components (same behavior as
-    // before) . In the `Minimal` profile, Prometheus only collects metrics
-    // necessary for the default platform alerts, recording rules, telemetry
-    // and console dashboards. When unset, the default value is `Full`.
-    // Note that while PascalCase and camelCase values are supported, the
-    // former is preferred for consistency with the Kubernetes API. There are
+    // metrics from the platform components. Supported values are `Full`,
+    // `Minimal` and `Telemetry`. In the `Full` profile (default), Prometheus
+    // collects all metrics that are exposed by the platform components (same
+    // behavior as before) . In the `Minimal` profile, Prometheus only collects
+    // metrics necessary for the default platform alerts, recording rules,
+    // telemetry and console dashboards. When unset, the default value is `Full`.
+    // In the `Telemetry` profile, only metrics necessary for telemetry are
+    // collected. Note that while PascalCase and camelCase values are supported,
+    // the former is preferred for consistency with the Kubernetes API. There are
     // no plans to drop camelCase support, as it may break existing workloads.
     CollectionProfile CollectionProfile `json:"collectionProfile,omitempty"`
 }
@@ -383,13 +417,21 @@ not. To aid teams with this effort the monitoring team will provide:
 
 - Unit tests in CMO to validate that the correct monitors are being selected.
 - E2E tests in CMO to validate that everything works correctly.
+- For the `Telemetry` profile, similar testing should be done as [exists for the `Minimal` profile](https://github.com/openshift/origin/pull/28889/changes#diff-00da964b40cc78eccb31c5bd15423de5364fa3dfa65c08a09e089c651cb28281).
+
+#### API team's suggestions
+
 - For the `Minimal` profile, origin/CI test to validate that every metric used
-in a resource (Alerts/PrometheusRules/Dashboards) exists in the `keep`
-expression of a minimal monitors.
+  in a resource (Alerts/PrometheusRules/Dashboards) exists in the `keep`
+  expression of a minimal monitors.
+  - Introduce profile-based recording rules that target such expressions (so
+    it's easier for us to track them as well)?
 - E2E test that ensures that for every monitor that is labelled as `Full`
-collection profile, there also exists one for `Minimal`, and vice versa, using
-[rexagod/cpv], a CLI tool that can be used to validate the implementation of
-metrics collection profiles in OpenShift components.
+  collection profile, there also exists one for `Minimal`, and vice versa, using
+  [rexagod/cpv], a CLI tool that can be used to validate the implementation of
+  metrics collection profiles in OpenShift components.
+  - Emphasis on keeping a historical "record" in place, so any profile being
+    dropped or added is identified.
 
 [rexagod/cpv]: https://github.com/rexagod/cpv#status
 
@@ -404,6 +446,11 @@ TechPreview gate. PTAL at the section below for more details.
 
 - GA'd in 4.19: https://github.com/openshift/api/pull/2286
 
+- Profile addition: The `Telemetry` collection profile is currently being
+  developed at [CMO#2694].
+
+[CMO#2694]: https://github.com/openshift/cluster-monitoring-operator/pull/2694.
+
 ### Dev Preview -> Tech Preview
 
 - [Design scrape profiles in CMO](https://issues.redhat.com/browse/MON-2483)
@@ -446,6 +493,7 @@ all.
 - https://github.com/openshift/cluster-monitoring-operator/pull/2030
 - https://github.com/openshift/cluster-monitoring-operator/pull/2047
 - https://github.com/openshift/origin/pull/28889
+- https://github.com/openshift/cluster-monitoring-operator/pull/2694
 
 ## Alternatives (Not Implemented)
 
@@ -507,6 +555,8 @@ and implementation status. Possible implementation status:
 - implementation in progress
 - implemented
 
+#### `Minimal` collection profile
+
 | Team            | Component          | Implementation Status      |
 |-----------------|--------------------|----------------------------|
 | Monitoring Team | kubelet            | Implemented                |
@@ -515,6 +565,26 @@ and implementation status. Possible implementation status:
 | Monitoring Team | node-exporter      | Implemented                |
 | Monitoring Team | prometheus-adapter | Implemented                |
 
+#### `Telemetry` collection profile
+
+| Team            | Component                    | Implementation Status      |
+|-----------------|------------------------------|----------------------------|
+| Monitoring Team | alertmanager                 | Implemented                |
+| Monitoring Team | cluster-monitoring-operator  | Implemented                |
+| Monitoring Team | control-plane                | Implemented                |
+| Monitoring Team | kube-state-metrics           | Implemented                |
+| Monitoring Team | metrics-server               | Implemented                |
+| Monitoring Team | node-exporter                | Implemented                |
+| Monitoring Team | openshift-state-metrics      | Implemented                |
+| Monitoring Team | prometheus-k8s               | Implemented                |
+| Monitoring Team | prometheus-k8s               | Implemented                |
+| Monitoring Team | prometheus-operator          | Implemented                |
+| Monitoring Team | telemeter-client             | Implemented                |
+| Monitoring Team | thanos-querier               | Implemented                |
+| Monitoring Team | control-plane                | Implemented                |
+| Monitoring Team | kube-state-metrics           | Implemented                |
+| Monitoring Team | node-exporter                | Implemented                |
+
 ### Topology Considerations
 
 Supported on all topologies that deploy CMO.