From c33ab268b53c0deb8e7a2a1505e570905fa3e83b Mon Sep 17 00:00:00 2001
From: jortizpa <jortizpa@redhat.com>
Date: Tue, 14 Oct 2025 13:45:15 +0200
Subject: [PATCH 1/4] Enhancement: Ingress Operator Resource Configuration via
 v1alpha1 API

This enhancement proposes adding the ability to configure resource limits
and requests for the ingress-operator deployment containers via a new
v1alpha1 API field in the IngressController custom resource.

This addresses the need for:
- Setting resource limits for QoS guarantees
- Compliance requirements for resource constraints
- Scaling operator resources for large deployments

Relates to: RFE-1476
---
 .../operator-resource-configuration.md        | 687 ++++++++++++++++++
 1 file changed, 687 insertions(+)
 create mode 100644 enhancements/ingress/operator-resource-configuration.md

diff --git a/enhancements/ingress/operator-resource-configuration.md b/enhancements/ingress/operator-resource-configuration.md
new file mode 100644
index 0000000000..1f017fdc09
--- /dev/null
+++ b/enhancements/ingress/operator-resource-configuration.md
@@ -0,0 +1,687 @@
+---
+title: ingress-operator-resource-configuration
+authors:
+  - "@jortizpa"
+reviewers:
+  - "@Miciah"
+  - "@frobware"
+  - "@candita"
+  - "@danehans"
+approvers:
+  - "@deads2k"
+api-approvers:
+  - "@JoelSpeed"
+  - "@deads2k"
+creation-date: 2025-01-14
+last-updated: 2025-01-14
+tracking-link:
+  - https://issues.redhat.com/browse/RFE-1476
+see-also:
+  - "/enhancements/monitoring/cluster-monitoring-config.md"
+replaces: []
+superseded-by: []
+---
+
+# Ingress Operator Resource Configuration
+
+## Summary
+
+This enhancement proposes adding the ability to configure resource limits and 
+requests for the ingress-operator deployment containers via a new v1alpha1 API 
+field in the IngressController custom resource.
+
+## Motivation
+
+Currently, the ingress-operator deployment has hardcoded resource requests 
+(CPU: 10m, Memory: 56Mi for the main container, and CPU: 10m, Memory: 40Mi for 
+the kube-rbac-proxy sidecar) with no resource limits defined. This presents 
+challenges for:
+
+1. **Clusters with resource constraints**: Cannot guarantee QoS guarantees without limits
+2. **Large-scale deployments**: May need higher resource allocation
+3. **Compliance requirements**: Some organizations require all pods have limits
+4. **Resource accounting**: Better cost allocation and resource planning
+
+Related to [RFE-1476](https://issues.redhat.com/browse/RFE-1476).
+
+### User Stories
+
+#### Story 1: Platform Administrator Needs Resource Limits
+
+As a platform administrator, I want to set resource limits on the ingress-operator 
+to ensure it has a QoS class of "Guaranteed" for critical cluster infrastructure.
+
+Acceptance Criteria:
+- Can specify resource limits via IngressController CR
+- Operator pod reflects the configured limits
+- Pod achieves QoS class "Guaranteed" when requests == limits
+
+#### Story 2: Large Cluster Operator
+
+As an operator of a large-scale cluster with thousands of routes, I need to 
+increase the ingress-operator's resource allocation to handle the increased load 
+from managing many IngressController instances and routes.
+
+Acceptance Criteria:
+- Can configure higher CPU and memory allocations
+- Operator performs adequately under high load
+- Configuration survives operator restarts and upgrades
+
+#### Story 3: Compliance Requirements
+
+As a compliance officer, I need all pods in my OpenShift cluster to have both 
+resource requests and limits defined for auditing, cost allocation, and capacity 
+planning purposes.
+
+Acceptance Criteria:
+- All operator containers can have limits configured
+- Configuration is auditable via oc commands
+- Meets organizational policy requirements
+
+### Goals
+
+- Allow configuration of resource requests and limits for ingress-operator containers
+- Follow established patterns from cluster monitoring configuration
+- Maintain backward compatibility with existing IngressController v1 API
+- Use v1alpha1 API version for this Tech Preview feature
+- Provide sensible defaults that work for most deployments
+- Support both the ingress-operator and kube-rbac-proxy containers
+
+### Non-Goals
+
+- Configuring resources for router pods (these are separate workloads managed by the operator)
+- Auto-scaling or dynamic resource adjustment based on load
+- Configuring resources for other operators in the cluster
+- Modifying the v1 API (stable API remains unchanged)
+- Vertical Pod Autoscaler (VPA) integration (may be future work)
+
+## Proposal
+
+### Workflow Description
+
+**Platform Administrator** is a human responsible for configuring the OpenShift cluster.
+
+1. Platform administrator creates or updates an IngressController CR using the 
+   v1alpha1 API version
+2. Platform administrator sets the `operatorResourceRequirements` field with 
+   desired resource limits/requests
+3. The ingress-operator watches for changes to the IngressController CR
+4. A new operator deployment controller reconciles the operator's own deployment 
+   with the specified resources
+5. Kubernetes performs a rolling restart of the operator pods with the new resource configuration
+6. Platform administrator verifies the changes with `oc describe deployment ingress-operator -n openshift-ingress-operator`
+
+### API Extensions
+
+Create a new v1alpha1 API version for IngressController in the 
+`operator.openshift.io` group, following the pattern established by 
+[cluster monitoring v1alpha1 configuration](https://github.com/openshift/api/blob/94481d71bb6f3ce6019717ea7900e6f88f42fa2c/config/v1alpha1/types_cluster_monitoring.go#L172-L193).
+
+#### New API Types
+
+```go
+package v1alpha1
+
+import (
+    metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"
+    corev1 "k8s.io/api/core/v1"
+    
+    operatorv1 "github.com/openshift/api/operator/v1"
+)
+
+// IngressController describes a managed ingress controller for the cluster.
+// This is a v1alpha1 Tech Preview API that extends the v1 API with additional
+// configuration options.
+//
+// Compatibility level 4: No compatibility is provided, the API can change at any point for any reason.
+// +openshift:compatibility-gen:level=4
+type IngressController struct {
+    metav1.TypeMeta   `json:",inline"`
+    metav1.ObjectMeta `json:"metadata,omitempty"`
+
+    // spec is the specification of the desired behavior of the IngressController.
+    Spec IngressControllerSpec `json:"spec,omitempty"`
+    
+    // status is the most recently observed status of the IngressController.
+    Status operatorv1.IngressControllerStatus `json:"status,omitempty"`
+}
+
+// IngressControllerSpec extends the v1 IngressControllerSpec with v1alpha1 fields.
+type IngressControllerSpec struct {
+    // Embed the entire v1 spec for backwards compatibility
+    operatorv1.IngressControllerSpec `json:",inline"`
+
+    // operatorResourceRequirements defines resource requirements for the
+    // ingress operator's own containers (not the router pods managed by the operator).
+    // This allows configuring CPU and memory limits/requests for the operator deployment.
+    //
+    // When not specified, the operator uses default resource requirements:
+    //   ingress-operator container: requests(cpu: 10m, memory: 56Mi), limits(cpu: 10m, memory: 56Mi)
+    //   kube-rbac-proxy container: requests(cpu: 10m, memory: 40Mi), limits(cpu: 10m, memory: 40Mi)
+    //
+    // Note: Changing these values will cause the ingress-operator pod to restart.
+    //
+    // +optional
+    // +openshift:enable:FeatureGate=IngressOperatorResourceManagement
+    OperatorResourceRequirements *OperatorResourceRequirements `json:"operatorResourceRequirements,omitempty"`
+}
+
+// OperatorResourceRequirements defines resource requirements for ingress operator containers.
+// Similar to the pattern used in cluster monitoring configuration.
+type OperatorResourceRequirements struct {
+    // ingressOperatorContainer specifies resource requirements for the
+    // ingress-operator container in the operator deployment.
+    //
+    // If not specified, defaults to:
+    //   requests: cpu: 10m, memory: 56Mi
+    //   limits: cpu: 10m, memory: 56Mi
+    //
+    // +optional
+    IngressOperatorContainer *corev1.ResourceRequirements `json:"ingressOperatorContainer,omitempty"`
+
+    // kubeRbacProxyContainer specifies resource requirements for the
+    // kube-rbac-proxy sidecar container in the operator deployment.
+    //
+    // If not specified, defaults to:
+    //   requests: cpu: 10m, memory: 40Mi
+    //   limits: cpu: 10m, memory: 40Mi
+    //
+    // +optional
+    KubeRbacProxyContainer *corev1.ResourceRequirements `json:"kubeRbacProxyContainer,omitempty"`
+}
+```
+
+#### Example Usage
+
+```yaml
+apiVersion: operator.openshift.io/v1alpha1
+kind: IngressController
+metadata:
+  name: default
+  namespace: openshift-ingress-operator
+spec:
+  # All existing v1 fields continue to work
+  replicas: 2
+  domain: apps.example.com
+  
+  # New v1alpha1 field for operator resource configuration
+  operatorResourceRequirements:
+    ingressOperatorContainer:
+      requests:
+        cpu: 20m
+        memory: 100Mi
+      limits:
+        cpu: 100m
+        memory: 200Mi
+    kubeRbacProxyContainer:
+      requests:
+        cpu: 10m
+        memory: 40Mi
+      limits:
+        cpu: 50m
+        memory: 80Mi
+```
+
+#### API Validation
+
+The following validations will be enforced:
+
+1. **Resource limits must be >= requests**: Kubernetes standard validation
+2. **Minimum values** (recommendations, not enforced):
+   - ingress-operator container: cpu >= 10m, memory >= 56Mi
+   - kube-rbac-proxy container: cpu >= 10m, memory >= 40Mi
+3. **API conversion**: v1alpha1-specific fields are dropped when converting to v1
+
+### Topology Considerations
+
+#### Hypershift / Hosted Control Planes
+
+In Hypershift environments:
+- The management cluster runs the ingress-operator for the management cluster
+- Each hosted cluster's control plane runs its own ingress-operator
+- This enhancement applies to both contexts independently
+- Configuration is specific to each IngressController instance
+
+#### Standalone Clusters
+
+Standard behavior - configuration applies to the cluster's ingress-operator deployment 
+in the `openshift-ingress-operator` namespace.
+
+#### Single-node Deployments
+
+Particularly beneficial for single-node OpenShift (SNO) deployments where:
+- Resource constraints are tighter
+- Setting appropriate limits helps prevent resource contention
+- Guaranteed QoS class improves stability
+
+### Implementation Details/Notes/Constraints
+
+#### API Versioning Strategy
+
+- **v1 API**: Remains stable and unchanged (storage version)
+- **v1alpha1 API**: Served but not stored
+- **Conversion**: Automatic conversion between versions via conversion webhooks
+- **Field handling**: v1alpha1-specific fields are dropped when reading via v1 API
+- **Compatibility**: Existing v1 clients continue working without changes
+
+#### Controller Implementation
+
+A new controller (`operator-deployment-controller`) in the cluster-ingress-operator 
+watches the default IngressController CR and reconciles the operator's own deployment 
+when `operatorResourceRequirements` is specified.
+
+**Controller responsibilities:**
+1. Watch IngressController resources (v1alpha1)
+2. Reconcile `ingress-operator` Deployment in `openshift-ingress-operator` namespace
+3. Update container resource specifications
+4. Handle error cases gracefully (invalid values, conflicts, etc.)
+
+#### Default Behavior
+
+When `operatorResourceRequirements` is not set or when using the v1 API:
+
+**Current state** (what exists now):
+- ingress-operator container: requests only (cpu: 10m, memory: 56Mi), no limits
+- kube-rbac-proxy container: requests only (cpu: 10m, memory: 40Mi), no limits
+
+**New default** (after this enhancement):
+- Static manifest updated to include limits matching requests
+- ingress-operator container: requests(cpu: 10m, memory: 56Mi), limits(cpu: 10m, memory: 56Mi)
+- kube-rbac-proxy container: requests(cpu: 10m, memory: 40Mi), limits(cpu: 10m, memory: 40Mi)
+- This provides QoS class "Guaranteed" by default
+
+#### Upgrade Behavior
+
+When upgrading to a version with this enhancement:
+1. Existing deployments get updated manifests with new default limits
+2. IngressController CRs remain at v1 unless explicitly changed
+3. No user action required for default behavior
+4. Users can opt-in to v1alpha1 to customize resources
+
+### Risks and Mitigations
+
+#### Risk: User sets resources too low, operator becomes unhealthy
+
+**Impact**: Operator may OOMKill, fail to reconcile, or become unresponsive
+
+**Mitigation**:
+- Document minimum recommended values
+- Add validation warnings (not blocking) for values below minimums
+- Include troubleshooting guide for common issues
+- Monitor operator health metrics
+
+**Likelihood**: Medium
+
+#### Risk: Incompatibility with existing tooling expecting v1 API only
+
+**Impact**: External tools may not recognize v1alpha1 resources
+
+**Mitigation**:
+- v1 API remains unchanged and fully functional
+- v1alpha1 is opt-in
+- Document migration path
+- Conversion webhooks ensure cross-version compatibility
+
+**Likelihood**: Low
+
+#### Risk: Operator restart causes brief unavailability
+
+**Impact**: Configuration changes trigger pod restart, brief reconciliation delay
+
+**Mitigation**:
+- Document that changes trigger rolling restart (expected behavior)
+- Operator restart is typically < 30 seconds
+- Router pods continue serving traffic during operator restart
+- Changes to operator resources are not expected to be frequent
+
+**Likelihood**: High (by design), **Severity**: Low
+
+#### Risk: Resource configuration drift
+
+**Impact**: Manual changes to deployment could be overwritten by controller
+
+**Mitigation**:
+- Controller reconciliation loop detects and corrects drift
+- Document that configuration should be via IngressController CR, not direct deployment edits
+- Admission webhooks prevent direct deployment modifications
+
+**Likelihood**: Low
+
+### Drawbacks
+
+1. **Increased API complexity**: Adds another version and configuration surface
+2. **Maintenance burden**: Requires maintaining v1alpha1 API version and conversion logic
+3. **Operator self-modification**: Operator modifying its own deployment adds complexity
+4. **Documentation overhead**: Need to document new field and migration path
+5. **Testing complexity**: Must test version conversion and upgrade scenarios
+
+## Design Details
+
+### Open Questions
+
+1. **Q**: Should we support auto-scaling (VPA) in the future?
+   - **A**: Out of scope for initial implementation, but API should not preclude it
+
+2. **Q**: Should we add validation for minimum resource values?
+   - **A**: Start with warnings/documentation, consider hard validation if issues arise
+
+3. **Q**: Should this apply to all IngressControllers or only the default?
+   - **A**: Initial implementation only default, but API supports any IngressController
+
+4. **Q**: How do we handle the operator modifying its own deployment safely?
+   - **A**: Use owner references carefully, reconcile loop with backoff
+
+### Test Plan
+
+#### Unit Tests
+
+- **API conversion tests**: v1 ↔ v1alpha1 conversion correctness
+- **Controller reconciliation logic**: Mock deployment updates
+- **Resource requirement validation**: Edge cases and invalid inputs
+- **Default value handling**: Ensure defaults applied correctly
+
+Coverage target: >80% for new code
+
+#### Integration Tests
+
+- **API server integration**: v1alpha1 CRD registration and serving
+- **Conversion webhook**: Automatic conversion between versions
+- **Controller watches**: IngressController changes trigger reconciliation
+
+#### E2E Tests
+
+- **Create IngressController with operatorResourceRequirements**
+  - Verify operator deployment is updated with correct resources
+  - Verify operator continues functioning normally
+  
+- **Update existing IngressController to add resource requirements**
+  - Verify rolling update occurs
+  - Verify no disruption to router functionality
+  
+- **Remove resource requirements (revert to defaults)**
+  - Verify deployment reverts to default values
+  
+- **Upgrade scenario tests**
+  - Upgrade from version without feature to version with feature
+  - Verify existing IngressControllers continue working
+  - Verify v1 API remains functional
+  
+- **Downgrade scenario tests**
+  - Downgrade from version with v1alpha1 to version without
+  - Verify graceful degradation (v1alpha1 fields ignored)
+
+#### Manual Testing
+
+- Test in resource-constrained environments (e.g., single-node)
+- Verify QoS class changes as expected (None → Burstable → Guaranteed)
+- Test with various resource configurations (very low, very high)
+- Test operator behavior when limits are hit (OOMKill, CPU throttling)
+- Test with multiple IngressController instances
+
+### Graduation Criteria
+
+#### Dev Preview -> Tech Preview (v1alpha1)
+
+- [x] Feature implemented behind feature gate
+- [x] Unit and integration tests passing
+- [x] E2E tests passing in CI
+- [x] Documentation published in OpenShift docs
+- [x] Enhancement proposal approved
+- [ ] Feedback collected from at least 3 early adopters
+- [ ] Known issues documented
+
+#### Tech Preview -> GA (promotion to v1)
+
+This section describes criteria for graduating from v1alpha1 to v1 (stable API).
+
+- [ ] Sufficient field testing (2+ minor releases in Tech Preview)
+- [ ] No major bugs reported for 2 consecutive releases
+- [ ] Performance impact assessed and documented
+- [ ] API design validated by diverse user scenarios
+- [ ] At least 10 production users providing positive feedback
+- [ ] All tests consistently passing
+- [ ] Documentation complete and reviewed
+- [ ] Upgrade/downgrade tested extensively
+- [ ] API review completed and approved for promotion
+
+Timeline estimate: 6-12 months after Tech Preview release
+
+#### Removing a deprecated feature
+
+N/A - this is a new feature
+
+### Upgrade / Downgrade Strategy
+
+#### Upgrade
+
+**From version without feature → version with feature:**
+
+1. CRD updated to include v1alpha1 version
+2. Existing IngressController CRs remain at v1 (storage version)
+3. Operator deployment updated with default resource limits
+4. Users can opt-in to v1alpha1 API to customize resources
+5. No breaking changes to existing functionality
+
+**User action required**: None for default behavior
+
+**User action optional**: Update to v1alpha1 API to customize operator resources
+
+#### Downgrade
+
+**From version with feature → version without feature:**
+
+1. v1alpha1 API becomes unavailable
+2. IngressController CRs remain at v1 (storage version, unaffected)
+3. v1alpha1-specific fields (operatorResourceRequirements) are ignored
+4. Operator deployment falls back to static manifest defaults
+5. No data loss as v1 remains storage version
+
+**User impact**: Loss of custom operator resource configuration, reverts to defaults
+
+#### Version Skew
+
+Supported version skew follows standard OpenShift practices:
+- API server and operator may be one minor version apart during upgrades
+- v1 API compatibility maintained across all versions
+- Conversion webhooks handle any necessary translations
+
+### Version Skew Strategy
+
+#### Operator and API Server Skew
+
+During cluster upgrades, the API server may be updated before or after the ingress-operator:
+
+**Scenario 1**: API server updated first (has v1alpha1), operator not yet updated
+- v1alpha1 CRs accepted by API server
+- Old operator version ignores v1alpha1 fields (reads via v1 API)
+- No impact, custom resources wait for operator upgrade
+
+**Scenario 2**: Operator updated first (supports v1alpha1), API server not yet updated
+- Operator can handle v1alpha1 resources
+- API server doesn't serve v1alpha1 yet
+- Users continue using v1 API until API server updates
+
+**Maximum skew**: 1 minor version (OpenShift standard)
+
+### Operational Aspects of API Extensions
+
+#### Failure Modes
+
+1. **Invalid resource values**: 
+   - Rejected by Kubernetes validation
+   - User receives clear error message
+   - Operator continues with existing configuration
+
+2. **Controller failure**: 
+   - Operator deployment remains at current configuration
+   - Deployment status reflects error
+   - Operator logs provide debugging information
+
+3. **API conversion failure**: 
+   - Request fails with error message
+   - User notified of conversion issue
+   - Existing resources unaffected
+
+4. **Operator restart loop due to low resources**:
+   - Kubernetes backoff prevents rapid restarts
+   - Events and logs indicate resource pressure
+   - Admin can update IngressController to increase resources
+
+#### Support Procedures
+
+Standard OpenShift support procedures apply:
+
+**Gathering debug information**:
+```bash
+# View IngressController configuration
+oc get ingresscontroller default -n openshift-ingress-operator -o yaml
+
+# View operator deployment
+oc describe deployment ingress-operator -n openshift-ingress-operator
+
+# Check operator logs
+oc logs -n openshift-ingress-operator deployment/ingress-operator -c ingress-operator
+
+# Check pod resource usage
+oc adm top pod -n openshift-ingress-operator
+
+# Check QoS class
+oc get pod -n openshift-ingress-operator -o jsonpath='{.items[*].status.qosClass}'
+```
+
+**Common issues and resolutions**:
+- OOMKilled operator: Increase memory limits
+- CPU throttling: Increase CPU limits or reduce requests if not needed
+- Configuration not applied: Check operator logs for reconciliation errors
+
+## Implementation History
+
+- 2025-01-14: Enhancement proposed
+- TBD: Enhancement approved
+- TBD: API implementation merged to openshift/api
+- TBD: Controller implementation merged to cluster-ingress-operator
+- TBD: Feature available in Tech Preview (target: OpenShift 4.X)
+- TBD: Promotion to GA (target: OpenShift 4.Y, ~2 releases after Tech Preview)
+
+## Alternatives
+
+### Alternative 1: Configuration via ConfigMap
+
+Use a ConfigMap for operator resource configuration instead of API field.
+
+**Pros**:
+- Simpler to implement
+- No API version changes needed
+- Easy to update without CRD changes
+
+**Cons**:
+- Less type-safe
+- Doesn't follow OpenShift patterns
+- No automatic validation
+- Harder to discover and document
+
+**Decision**: Rejected - API-based configuration is the established OpenShift pattern
+
+### Alternative 2: Modify v1 API directly
+
+Add `operatorResourceRequirements` field directly to stable v1 API.
+
+**Pros**:
+- No need for v1alpha1 version
+- Simpler for users (one API version)
+
+**Cons**:
+- Changes stable API (breaking compatibility promise)
+- Cannot iterate on design easily
+- Difficult to remove if issues found
+- Against OpenShift API stability guarantees
+
+**Decision**: Rejected - Use v1alpha1 for new features as per OpenShift conventions
+
+### Alternative 3: Separate CRD for operator configuration
+
+Create a new OperatorConfiguration CRD (similar to how cluster monitoring works).
+
+**Pros**:
+- Separation of concerns
+- Can configure multiple operators uniformly
+
+**Cons**:
+- Increases API surface unnecessarily
+- IngressController is the logical place for ingress-operator configuration
+- More CRDs to manage
+- Inconsistent with how other operators handle self-configuration
+
+**Decision**: Rejected - IngressController CR is the appropriate configuration location
+
+### Alternative 4: Operator command-line flags or environment variables
+
+Configure operator resources via deployment environment variables or command flags.
+
+**Pros**:
+- Very simple to implement
+- No API changes needed
+
+**Cons**:
+- Not GitOps friendly
+- Requires direct deployment modification
+- Not discoverable via API
+- Doesn't follow OpenShift declarative configuration patterns
+- Difficult to audit and version control
+
+**Decision**: Rejected - Declarative API configuration is required
+
+### Alternative 5: Use OperatorHub/OLM configuration
+
+Leverage Operator Lifecycle Manager (OLM) subscription configuration.
+
+**Pros**:
+- Follows OLM patterns
+- Could work for OLM-managed operators
+
+**Cons**:
+- Ingress operator is not OLM-managed (it's a cluster operator)
+- Adds OLM dependency
+- Not applicable to this operator's deployment model
+
+**Decision**: Rejected - Not applicable to cluster operators
+
+## Infrastructure Needed
+
+### Development Infrastructure
+
+- Standard OpenShift CI/CD pipeline (already exists)
+- No special hardware or cloud resources required
+
+### Testing Infrastructure
+
+- CI jobs for unit, integration, and E2E tests (leverage existing CI)
+- Access to test clusters for manual testing (existing QE infrastructure)
+- Performance testing environment for load testing (optional, future work)
+
+### Documentation Infrastructure
+
+- OpenShift documentation repository access
+- Standard docs.openshift.com publishing pipeline
+
+### Monitoring Infrastructure
+
+- Existing operator metrics (no new infrastructure needed)
+- Alert rules may be added in future iterations
+
+## Dependencies
+
+### Code Dependencies
+
+- `github.com/openshift/api` - API definitions (will be updated)
+- `k8s.io/api` - Kubernetes core types
+- `sigs.k8s.io/controller-runtime` - Controller framework
+
+### Team Dependencies
+
+- **Ingress team**: Implementation and maintenance
+- **API team**: API review and approval
+- **Docs team**: Documentation
+- **QE team**: Testing
+- **ART team**: Release and build processes
+

From d708dc0014ed452a6917c10b8efbf281ba1f67ed Mon Sep 17 00:00:00 2001
From: Jose Ortiz <jose.orpa@gmail.com>
Date: Mon, 27 Oct 2025 16:28:19 +0100
Subject: [PATCH 2/4] Update operator-resource-configuration.md

Minor updates
---
 .../operator-resource-configuration.md        | 43 +++++++------------
 1 file changed, 16 insertions(+), 27 deletions(-)

diff --git a/enhancements/ingress/operator-resource-configuration.md b/enhancements/ingress/operator-resource-configuration.md
index 1f017fdc09..21e9132ffd 100644
--- a/enhancements/ingress/operator-resource-configuration.md
+++ b/enhancements/ingress/operator-resource-configuration.md
@@ -1,19 +1,19 @@
 ---
 title: ingress-operator-resource-configuration
 authors:
-  - "@jortizpa"
+  - "@joseorpa"
 reviewers:
-  - "@Miciah"
-  - "@frobware"
-  - "@candita"
-  - "@danehans"
+  - "TBD"
+  - "TBD"
+  - "TBD"
+  - "TBD"
 approvers:
-  - "@deads2k"
+  - "TBD"
 api-approvers:
-  - "@JoelSpeed"
-  - "@deads2k"
-creation-date: 2025-01-14
-last-updated: 2025-01-14
+  - "TBD"
+  - "TBD"
+creation-date: 2025-10-28
+last-updated: 2025-10-28
 tracking-link:
   - https://issues.redhat.com/browse/RFE-1476
 see-also:
@@ -67,17 +67,6 @@ Acceptance Criteria:
 - Operator performs adequately under high load
 - Configuration survives operator restarts and upgrades
 
-#### Story 3: Compliance Requirements
-
-As a compliance officer, I need all pods in my OpenShift cluster to have both 
-resource requests and limits defined for auditing, cost allocation, and capacity 
-planning purposes.
-
-Acceptance Criteria:
-- All operator containers can have limits configured
-- Configuration is auditable via oc commands
-- Meets organizational policy requirements
-
 ### Goals
 
 - Allow configuration of resource requests and limits for ingress-operator containers
@@ -114,7 +103,7 @@ Acceptance Criteria:
 ### API Extensions
 
 Create a new v1alpha1 API version for IngressController in the 
-`operator.openshift.io` group, following the pattern established by 
+`operator.openshift.io` group, following the pattern made for example by 
 [cluster monitoring v1alpha1 configuration](https://github.com/openshift/api/blob/94481d71bb6f3ce6019717ea7900e6f88f42fa2c/config/v1alpha1/types_cluster_monitoring.go#L172-L193).
 
 #### New API Types
@@ -258,7 +247,7 @@ Particularly beneficial for single-node OpenShift (SNO) deployments where:
 
 #### API Versioning Strategy
 
-- **v1 API**: Remains stable and unchanged (storage version)
+- **v1 API**: Remains stable and unchanged (stored version)
 - **v1alpha1 API**: Served but not stored
 - **Conversion**: Automatic conversion between versions via conversion webhooks
 - **Field handling**: v1alpha1-specific fields are dropped when reading via v1 API
@@ -444,7 +433,7 @@ This section describes criteria for graduating from v1alpha1 to v1 (stable API).
 - [ ] Upgrade/downgrade tested extensively
 - [ ] API review completed and approved for promotion
 
-Timeline estimate: 6-12 months after Tech Preview release
+Timeline estimate: Next major release after Tech Preview release
 
 #### Removing a deprecated feature
 
@@ -501,7 +490,7 @@ During cluster upgrades, the API server may be updated before or after the ingre
 - API server doesn't serve v1alpha1 yet
 - Users continue using v1 API until API server updates
 
-**Maximum skew**: 1 minor version (OpenShift standard)
+**Maximum skew**: 1 minor version 
 
 ### Operational Aspects of API Extensions
 
@@ -556,7 +545,7 @@ oc get pod -n openshift-ingress-operator -o jsonpath='{.items[*].status.qosClass
 
 ## Implementation History
 
-- 2025-01-14: Enhancement proposed
+- 2025-10-28: Enhancement proposed
 - TBD: Enhancement approved
 - TBD: API implementation merged to openshift/api
 - TBD: Controller implementation merged to cluster-ingress-operator
@@ -650,7 +639,7 @@ Leverage Operator Lifecycle Manager (OLM) subscription configuration.
 
 ### Development Infrastructure
 
-- Standard OpenShift CI/CD pipeline (already exists)
+- Standard OpenShift CI/CD pipeline 
 - No special hardware or cloud resources required
 
 ### Testing Infrastructure

From 97527c23a23c0c8b4052491b9e41ab2469c40ef8 Mon Sep 17 00:00:00 2001
From: Jose Ortiz <jose.orpa@gmail.com>
Date: Mon, 27 Oct 2025 16:29:21 +0100
Subject: [PATCH 3/4] Update operator-resource-configuration.md

Minor updates
---
 enhancements/ingress/operator-resource-configuration.md | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/enhancements/ingress/operator-resource-configuration.md b/enhancements/ingress/operator-resource-configuration.md
index 21e9132ffd..d1d692bb74 100644
--- a/enhancements/ingress/operator-resource-configuration.md
+++ b/enhancements/ingress/operator-resource-configuration.md
@@ -12,8 +12,8 @@ approvers:
 api-approvers:
   - "TBD"
   - "TBD"
-creation-date: 2025-10-28
-last-updated: 2025-10-28
+creation-date: 2025-10-27
+last-updated: 2025-10-27
 tracking-link:
   - https://issues.redhat.com/browse/RFE-1476
 see-also:
@@ -545,7 +545,7 @@ oc get pod -n openshift-ingress-operator -o jsonpath='{.items[*].status.qosClass
 
 ## Implementation History
 
-- 2025-10-28: Enhancement proposed
+- 2025-10-27: Enhancement proposed
 - TBD: Enhancement approved
 - TBD: API implementation merged to openshift/api
 - TBD: Controller implementation merged to cluster-ingress-operator

From b498028ac110356020cada96b70ca4569d230355 Mon Sep 17 00:00:00 2001
From: jortizpa <jortizpa@redhat.com>
Date: Wed, 29 Oct 2025 12:59:40 +0100
Subject: [PATCH 4/4] Some adjustements after first review

---
 .../operator-resource-configuration.md        | 796 +++++++++++-------
 1 file changed, 510 insertions(+), 286 deletions(-)

diff --git a/enhancements/ingress/operator-resource-configuration.md b/enhancements/ingress/operator-resource-configuration.md
index d1d692bb74..96a2e86134 100644
--- a/enhancements/ingress/operator-resource-configuration.md
+++ b/enhancements/ingress/operator-resource-configuration.md
@@ -1,19 +1,15 @@
 ---
-title: ingress-operator-resource-configuration
+title: ingress-router-resource-configuration
 authors:
   - "@joseorpa"
 reviewers:
-  - "TBD"
-  - "TBD"
-  - "TBD"
-  - "TBD"
+  - "@miciah"
 approvers:
-  - "TBD"
+  - TBD
 api-approvers:
-  - "TBD"
-  - "TBD"
+  - TBD
 creation-date: 2025-10-27
-last-updated: 2025-10-27
+last-updated: 2025-10-29
 tracking-link:
   - https://issues.redhat.com/browse/RFE-1476
 see-also:
@@ -22,67 +18,97 @@ replaces: []
 superseded-by: []
 ---
 
-# Ingress Operator Resource Configuration
+# Ingress Router Resource Configuration
 
 ## Summary
 
-This enhancement proposes adding the ability to configure resource limits and 
-requests for the ingress-operator deployment containers via a new v1alpha1 API 
-field in the IngressController custom resource.
+This enhancement proposes adding the ability to configure resource limits for 
+ingress router pods (HAProxy deployments) via a new field in the v1 IngressController 
+API, gated behind a feature gate. This will allow router pods to achieve Guaranteed 
+QoS class by setting limits equal to requests.
 
 ## Motivation
 
-Currently, the ingress-operator deployment has hardcoded resource requests 
-(CPU: 10m, Memory: 56Mi for the main container, and CPU: 10m, Memory: 40Mi for 
-the kube-rbac-proxy sidecar) with no resource limits defined. This presents 
-challenges for:
-
-1. **Clusters with resource constraints**: Cannot guarantee QoS guarantees without limits
-2. **Large-scale deployments**: May need higher resource allocation
-3. **Compliance requirements**: Some organizations require all pods have limits
-4. **Resource accounting**: Better cost allocation and resource planning
-
-Related to [RFE-1476](https://issues.redhat.com/browse/RFE-1476).
+Currently, ingress router pods (the HAProxy deployments that handle ingress traffic) 
+are created with resource requests only (CPU: 200m, Memory: 256Mi) but no resource 
+limits defined. According to [RFE-1476](https://issues.redhat.com/browse/RFE-1476), 
+this presents challenges for:
+
+1. **QoS Class Requirements**: Without limits, router pods have "Burstable" QoS class. 
+   Setting limits equal to requests achieves "Guaranteed" QoS class, providing better 
+   stability and predictability for critical ingress infrastructure.
+2. **Compliance requirements**: Some organizations require all pods have both requests 
+   and limits defined for auditing, cost allocation, and capacity planning purposes.
+3. **Resource accounting**: Better cost allocation and resource planning when limits 
+   are explicitly defined.
+4. **Cluster resource constraints**: Guaranteed QoS provides better protection against 
+   resource contention and eviction.
+
+While the IngressController v1 API currently allows configuring resource requests via 
+`spec.nodePlacement.resources`, customers need the ability to also set **limits** to 
+achieve Guaranteed QoS class. This enhancement introduces this capability via a new 
+field in the v1 API, protected behind a feature gate during the Tech Preview period.
 
 ### User Stories
 
-#### Story 1: Platform Administrator Needs Resource Limits
+#### Story 1: Platform Administrator Needs Guaranteed QoS for Router Pods
 
-As a platform administrator, I want to set resource limits on the ingress-operator 
-to ensure it has a QoS class of "Guaranteed" for critical cluster infrastructure.
+As a platform administrator, I want to set resource limits on ingress router pods 
+to ensure they have a QoS class of "Guaranteed" for critical ingress infrastructure. 
+Currently, router pods only have requests defined, giving them "Burstable" QoS class, 
+which makes them susceptible to resource contention and potential eviction.
 
 Acceptance Criteria:
 - Can specify resource limits via IngressController CR
-- Operator pod reflects the configured limits
-- Pod achieves QoS class "Guaranteed" when requests == limits
+- Router pods reflect the configured limits
+- Router pods achieve QoS class "Guaranteed" when requests == limits
+- Configuration applies to all router pod replicas
+
+#### Story 2: High-Traffic Application Requiring Resource Guarantees
+
+As an operator of a high-traffic e-commerce platform, I need guaranteed resource 
+allocation for my ingress router pods to ensure consistent performance during traffic 
+spikes (seasonal sales, marketing events). Without resource limits, the pods have 
+Burstable QoS and may be throttled or evicted under cluster resource pressure, 
+causing service disruptions.
+
+Acceptance Criteria:
+- Can configure resource limits matching requests for Guaranteed QoS
+- Router pods maintain stable performance under cluster resource pressure
+- Configuration survives router pod restarts and upgrades
 
-#### Story 2: Large Cluster Operator
+#### Story 3: Resource-Constrained Edge Deployment
 
-As an operator of a large-scale cluster with thousands of routes, I need to 
-increase the ingress-operator's resource allocation to handle the increased load 
-from managing many IngressController instances and routes.
+As an operator of an edge computing deployment with limited resources, I need to 
+set strict resource limits on ingress router pods to prevent them from consuming 
+excessive resources and impacting other critical workloads. Without limits, router 
+pods with Burstable QoS can burst beyond their requests, potentially starving other 
+pods in the resource-constrained environment.
 
 Acceptance Criteria:
-- Can configure higher CPU and memory allocations
-- Operator performs adequately under high load
-- Configuration survives operator restarts and upgrades
+- Can configure resource limits to cap maximum resource consumption
+- Router pods do not exceed defined resource boundaries
+- Guaranteed QoS ensures router pods get their allocated resources under pressure
+- Other workloads on the node are protected from router resource overuse
 
 ### Goals
 
-- Allow configuration of resource requests and limits for ingress-operator containers
-- Follow established patterns from cluster monitoring configuration
+- Allow configuration of resource limits for ingress router pods (HAProxy containers)
+- Add feature to v1 API protected by a feature gate for simplicity
 - Maintain backward compatibility with existing IngressController v1 API
-- Use v1alpha1 API version for this Tech Preview feature
+- Use feature gate for Tech Preview period, then promote to Default feature set for GA
+- Enable router pods to achieve Guaranteed QoS class
 - Provide sensible defaults that work for most deployments
-- Support both the ingress-operator and kube-rbac-proxy containers
+- Support configuration for router container and sidecar containers (logs, metrics)
 
 ### Non-Goals
 
-- Configuring resources for router pods (these are separate workloads managed by the operator)
-- Auto-scaling or dynamic resource adjustment based on load
-- Configuring resources for other operators in the cluster
-- Modifying the v1 API (stable API remains unchanged)
+- Configuring resources for the ingress-operator deployment itself (the controller)
+- Auto-scaling or dynamic resource adjustment based on traffic load
+- Modifying the existing v1 API `spec.nodePlacement.resources` field (remains unchanged)
+- Creating a separate v1alpha1 API version (using v1 with feature gate instead)
 - Vertical Pod Autoscaler (VPA) integration (may be future work)
+- Horizontal Pod Autoscaler (HPA) configuration (separate concern)
 
 ## Proposal
 
@@ -90,100 +116,160 @@ Acceptance Criteria:
 
 **Platform Administrator** is a human responsible for configuring the OpenShift cluster.
 
-1. Platform administrator creates or updates an IngressController CR using the 
-   v1alpha1 API version
-2. Platform administrator sets the `operatorResourceRequirements` field with 
-   desired resource limits/requests
-3. The ingress-operator watches for changes to the IngressController CR
-4. A new operator deployment controller reconciles the operator's own deployment 
-   with the specified resources
-5. Kubernetes performs a rolling restart of the operator pods with the new resource configuration
-6. Platform administrator verifies the changes with `oc describe deployment ingress-operator -n openshift-ingress-operator`
+1. Platform administrator enables the `IngressRouterResourceLimits` feature gate
+2. Platform administrator creates or updates an IngressController CR using the v1 API
+3. Platform administrator sets the new `resources` field with desired resource limits/requests
+4. The ingress-operator watches for changes to the IngressController CR
+5. The ingress-operator reconciles the router deployment with the specified resources
+6. Kubernetes performs a rolling restart of the router pods with the new resource configuration
+7. Router pods achieve Guaranteed QoS class (when limits == requests)
+8. Platform administrator verifies the changes with `oc describe deployment router-default -n openshift-ingress`
+9. Platform administrator confirms QoS class with `oc get pod -n openshift-ingress -o jsonpath='{.items[*].status.qosClass}'`
 
 ### API Extensions
 
-Create a new v1alpha1 API version for IngressController in the 
-`operator.openshift.io` group, following the pattern made for example by 
-[cluster monitoring v1alpha1 configuration](https://github.com/openshift/api/blob/94481d71bb6f3ce6019717ea7900e6f88f42fa2c/config/v1alpha1/types_cluster_monitoring.go#L172-L193).
+Add a new field to the existing v1 IngressController API in the `operator.openshift.io` 
+group, gated behind a feature gate. This approach is preferred by the networking team 
+for its simplicity - adding the feature directly to the stable v1 API while protecting 
+it behind a feature gate during the Tech Preview period.
+
+The new field allows configuring resource **limits** for router pods. The existing v1 API's 
+`spec.nodePlacement.resources` field currently allows setting requests, but this 
+enhancement adds a new field to also set limits, enabling router pods to achieve 
+Guaranteed QoS class.
+
+#### Feature Gate
+
+This feature requires a new feature gate: **`IngressRouterResourceLimits`**
+
+The feature gate controls whether the new v1 API field is recognized and enforced 
+by the ingress-operator. Initially, the feature gate will be part of the 
+TechPreviewNoUpgrade feature set, and will be promoted to the Default feature set 
+once the feature graduates to GA.
+
+**Enabling the Feature Gate:**
+
+To enable the feature gate in your OpenShift cluster, you can use either the patch command 
+or apply a FeatureGate configuration.
+
+**Option 1: Enable all Tech Preview features (includes IngressRouterResourceLimits):**
+```bash
+oc patch featuregate cluster --type merge --patch '{"spec":{"featureSet":"TechPreviewNoUpgrade"}}'
+```
+
+**Option 2: Enable only the specific feature gate using patch command:**
+```bash
+oc patch featuregate cluster --type merge --patch '{"spec":{"featureSet":"CustomNoUpgrade","customNoUpgrade":{"enabled":["IngressRouterResourceLimits"]}}}'
+```
+
+**Option 3: Apply a custom FeatureGate configuration file:**
+```yaml
+apiVersion: config.openshift.io/v1
+kind: FeatureGate
+metadata:
+  name: cluster
+spec:
+  featureSet: CustomNoUpgrade
+  customNoUpgrade:
+    enabled:
+    - IngressRouterResourceLimits
+```
+
+Apply with:
+```bash
+oc apply -f featuregate.yaml
+```
+
+**Note**: Enabling feature gates may require cluster components to restart. For 
+production environments, test in non-production clusters first. Using `TechPreviewNoUpgrade` 
+or `CustomNoUpgrade` means the cluster cannot be upgraded and should only be used for 
+testing.
+
+#### API Changes
 
-#### New API Types
+Add a new field to the existing `IngressControllerSpec` in the v1 API:
 
 ```go
-package v1alpha1
+package v1
 
 import (
     metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"
     corev1 "k8s.io/api/core/v1"
-    
-    operatorv1 "github.com/openshift/api/operator/v1"
 )
 
-// IngressController describes a managed ingress controller for the cluster.
-// This is a v1alpha1 Tech Preview API that extends the v1 API with additional
-// configuration options.
-//
-// Compatibility level 4: No compatibility is provided, the API can change at any point for any reason.
-// +openshift:compatibility-gen:level=4
-type IngressController struct {
-    metav1.TypeMeta   `json:",inline"`
-    metav1.ObjectMeta `json:"metadata,omitempty"`
-
-    // spec is the specification of the desired behavior of the IngressController.
-    Spec IngressControllerSpec `json:"spec,omitempty"`
-    
-    // status is the most recently observed status of the IngressController.
-    Status operatorv1.IngressControllerStatus `json:"status,omitempty"`
-}
-
-// IngressControllerSpec extends the v1 IngressControllerSpec with v1alpha1 fields.
+// IngressControllerSpec is the specification of the desired behavior of the IngressController.
 type IngressControllerSpec struct {
-    // Embed the entire v1 spec for backwards compatibility
-    operatorv1.IngressControllerSpec `json:",inline"`
+    // ... existing v1 fields ...
 
-    // operatorResourceRequirements defines resource requirements for the
-    // ingress operator's own containers (not the router pods managed by the operator).
-    // This allows configuring CPU and memory limits/requests for the operator deployment.
+    // tuning defines parameters for tuning the performance of ingress controller pods.
+    // +optional
+    Tuning *IngressControllerTuning `json:"tuning,omitempty"`
+
+    // resources defines resource requirements (requests and limits) for the
+    // router pods (HAProxy containers). This field allows setting resource limits
+    // to achieve Guaranteed QoS class for router pods.
+    //
+    // When this field is set, it takes precedence over spec.nodePlacement.resources
+    // for configuring router pod resources.
     //
-    // When not specified, the operator uses default resource requirements:
-    //   ingress-operator container: requests(cpu: 10m, memory: 56Mi), limits(cpu: 10m, memory: 56Mi)
-    //   kube-rbac-proxy container: requests(cpu: 10m, memory: 40Mi), limits(cpu: 10m, memory: 40Mi)
+    // When not specified, defaults to:
+    //   router container:
+    //     requests: cpu: 200m, memory: 256Mi
+    //     limits: none (Burstable QoS)
     //
-    // Note: Changing these values will cause the ingress-operator pod to restart.
+    // To achieve Guaranteed QoS, set limits equal to requests:
+    //   resources:
+    //     routerContainer:
+    //       requests:
+    //         cpu: 200m
+    //         memory: 256Mi
+    //       limits:
+    //         cpu: 200m
+    //         memory: 256Mi
+    //
+    // Note: Changing these values will cause router pods to perform a rolling restart.
     //
     // +optional
-    // +openshift:enable:FeatureGate=IngressOperatorResourceManagement
-    OperatorResourceRequirements *OperatorResourceRequirements `json:"operatorResourceRequirements,omitempty"`
+    // +openshift:enable:FeatureGate=IngressRouterResourceLimits
+    Resources *RouterResourceRequirements `json:"resources,omitempty"`
 }
 
-// OperatorResourceRequirements defines resource requirements for ingress operator containers.
-// Similar to the pattern used in cluster monitoring configuration.
-type OperatorResourceRequirements struct {
-    // ingressOperatorContainer specifies resource requirements for the
-    // ingress-operator container in the operator deployment.
+// RouterResourceRequirements defines resource requirements for ingress router pod containers.
+type RouterResourceRequirements struct {
+    // routerContainer specifies resource requirements (requests and limits) for the
+    // router (HAProxy) container in router pods.
     //
     // If not specified, defaults to:
-    //   requests: cpu: 10m, memory: 56Mi
-    //   limits: cpu: 10m, memory: 56Mi
+    //   requests: cpu: 200m, memory: 256Mi
+    //   limits: none
     //
     // +optional
-    IngressOperatorContainer *corev1.ResourceRequirements `json:"ingressOperatorContainer,omitempty"`
+    RouterContainer *corev1.ResourceRequirements `json:"routerContainer,omitempty"`
 
-    // kubeRbacProxyContainer specifies resource requirements for the
-    // kube-rbac-proxy sidecar container in the operator deployment.
+    // metricsContainer specifies resource requirements for the metrics sidecar
+    // container in router pods.
     //
-    // If not specified, defaults to:
-    //   requests: cpu: 10m, memory: 40Mi
-    //   limits: cpu: 10m, memory: 40Mi
+    // If not specified, uses Kubernetes default behavior (no requests or limits).
+    //
+    // +optional
+    MetricsContainer *corev1.ResourceRequirements `json:"metricsContainer,omitempty"`
+    
+    // logsContainer specifies resource requirements for the logs sidecar container
+    // in router pods (if logs sidecar is enabled).
+    //
+    // If not specified, uses Kubernetes default behavior (no requests or limits).
     //
     // +optional
-    KubeRbacProxyContainer *corev1.ResourceRequirements `json:"kubeRbacProxyContainer,omitempty"`
+    LogsContainer *corev1.ResourceRequirements `json:"logsContainer,omitempty"`
 }
 ```
 
 #### Example Usage
 
+**Example 1: Setting limits to achieve Guaranteed QoS**
+
 ```yaml
-apiVersion: operator.openshift.io/v1alpha1
+apiVersion: operator.openshift.io/v1
 kind: IngressController
 metadata:
   name: default
@@ -193,114 +279,199 @@ spec:
   replicas: 2
   domain: apps.example.com
   
-  # New v1alpha1 field for operator resource configuration
-  operatorResourceRequirements:
-    ingressOperatorContainer:
+  # New resources field for router pod resource configuration with limits
+  # This achieves Guaranteed QoS by setting limits equal to requests
+  # Requires IngressRouterResourceLimits feature gate to be enabled
+  resources:
+    routerContainer:
       requests:
-        cpu: 20m
-        memory: 100Mi
+        cpu: 200m
+        memory: 256Mi
       limits:
-        cpu: 100m
-        memory: 200Mi
-    kubeRbacProxyContainer:
+        cpu: 200m
+        memory: 256Mi
+```
+
+**Example 2: Higher resources for high-traffic clusters**
+
+```yaml
+apiVersion: operator.openshift.io/v1
+kind: IngressController
+metadata:
+  name: default
+  namespace: openshift-ingress-operator
+spec:
+  replicas: 3
+  domain: apps.example.com
+  
+  # Configure resources for router and metrics containers
+  resources:
+    routerContainer:
       requests:
-        cpu: 10m
-        memory: 40Mi
+        cpu: 500m
+        memory: 512Mi
       limits:
+        cpu: 1000m
+        memory: 1Gi
+    metricsContainer:
+      requests:
         cpu: 50m
-        memory: 80Mi
+        memory: 64Mi
+      limits:
+        cpu: 100m
+        memory: 128Mi
+```
+
+**Example 3: Precedence - new resources field over nodePlacement.resources**
+
+```yaml
+apiVersion: operator.openshift.io/v1
+kind: IngressController
+metadata:
+  name: default
+  namespace: openshift-ingress-operator
+spec:
+  replicas: 2
+  
+  # Existing nodePlacement.resources field (will be ignored when spec.resources is set)
+  nodePlacement:
+    nodeSelector:
+      matchLabels:
+        node-role.kubernetes.io/worker: ""
+    resources:
+      requests:
+        cpu: 100m
+        memory: 128Mi
+  
+  # New resources field takes precedence over nodePlacement.resources
+  resources:
+    routerContainer:
+      requests:
+        cpu: 200m
+        memory: 256Mi
+      limits:
+        cpu: 200m
+        memory: 256Mi
 ```
 
 #### API Validation
 
 The following validations will be enforced:
 
-1. **Resource limits must be >= requests**: Kubernetes standard validation
-2. **Minimum values** (recommendations, not enforced):
-   - ingress-operator container: cpu >= 10m, memory >= 56Mi
-   - kube-rbac-proxy container: cpu >= 10m, memory >= 40Mi
-3. **API conversion**: v1alpha1-specific fields are dropped when converting to v1
+1. **Resource limits must be >= requests**: Kubernetes standard validation enforced by API server
+2. **Feature gate check**: If `IngressRouterResourceLimits` feature gate is disabled, 
+   the `resources` field will be ignored (with a warning event logged)
+3. **Minimum values** (recommendations, not hard limits):
+   - Router container: cpu >= 100m, memory >= 128Mi recommended for production
+   - Values below recommendations will generate warning events but not block the request
+4. **Precedence validation**: When both `spec.resources` and `spec.nodePlacement.resources` 
+   are set, `spec.resources` takes precedence and a warning event is logged about the 
+   ignored `nodePlacement.resources` field
 
 ### Topology Considerations
 
 #### Hypershift / Hosted Control Planes
 
 In Hypershift environments:
-- The management cluster runs the ingress-operator for the management cluster
-- Each hosted cluster's control plane runs its own ingress-operator
+- The management cluster has its own IngressController for management traffic
+- Each hosted cluster has its own IngressController for guest traffic
 - This enhancement applies to both contexts independently
-- Configuration is specific to each IngressController instance
+- Configuration is specific to each IngressController instance's router pods
+- Router pods for hosted clusters run in the hosted cluster namespace
 
 #### Standalone Clusters
 
-Standard behavior - configuration applies to the cluster's ingress-operator deployment 
-in the `openshift-ingress-operator` namespace.
+Standard behavior - configuration applies to router pod deployments in the 
+`openshift-ingress` namespace managed by the IngressController.
 
 #### Single-node Deployments
 
 Particularly beneficial for single-node OpenShift (SNO) deployments where:
 - Resource constraints are tighter
-- Setting appropriate limits helps prevent resource contention
-- Guaranteed QoS class improves stability
+- Setting appropriate limits helps prevent resource contention for router pods
+- Guaranteed QoS class improves stability for critical ingress infrastructure
+- Single router replica benefits from predictable resource allocation
 
 ### Implementation Details/Notes/Constraints
 
-#### API Versioning Strategy
+#### Feature Gate Strategy
 
-- **v1 API**: Remains stable and unchanged (stored version)
-- **v1alpha1 API**: Served but not stored
-- **Conversion**: Automatic conversion between versions via conversion webhooks
-- **Field handling**: v1alpha1-specific fields are dropped when reading via v1 API
-- **Compatibility**: Existing v1 clients continue working without changes
+- **Feature Gate**: `IngressRouterResourceLimits`
+- **Initial State**: Part of TechPreviewNoUpgrade feature set
+- **GA Promotion**: Move to Default feature set when graduating to GA
+- **Field Protection**: The new `spec.resources` field is protected by the feature gate
+- **Compatibility**: Existing v1 API clients continue working; new field ignored when feature gate disabled
 
 #### Controller Implementation
 
-A new controller (`operator-deployment-controller`) in the cluster-ingress-operator 
-watches the default IngressController CR and reconciles the operator's own deployment 
-when `operatorResourceRequirements` is specified.
-
-**Controller responsibilities:**
-1. Watch IngressController resources (v1alpha1)
-2. Reconcile `ingress-operator` Deployment in `openshift-ingress-operator` namespace
-3. Update container resource specifications
-4. Handle error cases gracefully (invalid values, conflicts, etc.)
+The existing deployment controller in the cluster-ingress-operator will be enhanced to 
+handle the new `resources` field when reconciling router deployments.
+
+**Controller enhancements:**
+1. Watch IngressController resources (v1 API)
+2. Check `IngressRouterResourceLimits` feature gate status
+3. When feature gate is enabled and `spec.resources` field is set:
+   - Use `spec.resources` field to configure router pod container resources
+   - Ignore `spec.nodePlacement.resources` field (log warning if both are set)
+4. When feature gate is disabled or `spec.resources` field is not set:
+   - Fall back to `spec.nodePlacement.resources` behavior (current behavior)
+   - Log warning event if `spec.resources` is set but feature gate is disabled
+5. Reconcile router `Deployment` in `openshift-ingress` namespace
+6. Update container resource specifications (router, metrics, logs containers)
+7. Handle error cases gracefully (invalid values, conflicts, etc.)
+8. Generate events for configuration issues (warnings, validation failures)
 
 #### Default Behavior
 
-When `operatorResourceRequirements` is not set or when using the v1 API:
+When the `spec.resources` field is not set:
+
+**Current behavior (unchanged):**
+- Router pods use `spec.nodePlacement.resources` if set
+- If not set, defaults to:
+  - Router container: requests(cpu: 200m, memory: 256Mi), no limits
+  - QoS class: Burstable
 
-**Current state** (what exists now):
-- ingress-operator container: requests only (cpu: 10m, memory: 56Mi), no limits
-- kube-rbac-proxy container: requests only (cpu: 10m, memory: 40Mi), no limits
+**With `spec.resources` field set (and feature gate enabled):**
+- Router pods use `spec.resources` configuration
+- Users can set limits to achieve Guaranteed QoS:
+  - Router container: requests(cpu: 200m, memory: 256Mi), limits(cpu: 200m, memory: 256Mi)
+  - QoS class: Guaranteed
 
-**New default** (after this enhancement):
-- Static manifest updated to include limits matching requests
-- ingress-operator container: requests(cpu: 10m, memory: 56Mi), limits(cpu: 10m, memory: 56Mi)
-- kube-rbac-proxy container: requests(cpu: 10m, memory: 40Mi), limits(cpu: 10m, memory: 40Mi)
-- This provides QoS class "Guaranteed" by default
+**Backward compatibility:**
+- Existing IngressControllers continue working unchanged
+- New field is ignored when feature gate is disabled
+- `spec.nodePlacement.resources` behavior remains available
 
 #### Upgrade Behavior
 
 When upgrading to a version with this enhancement:
-1. Existing deployments get updated manifests with new default limits
-2. IngressController CRs remain at v1 unless explicitly changed
-3. No user action required for default behavior
-4. Users can opt-in to v1alpha1 to customize resources
+1. New `resources` field is added to v1 IngressController API (gated by feature gate)
+2. Feature gate `IngressRouterResourceLimits` is part of TechPreviewNoUpgrade feature set
+3. Existing IngressController CRs continue working unchanged
+4. Router pods continue with existing resource configuration (no automatic changes)
+5. No user action required - existing behavior preserved
+6. Users can enable feature gate and use new `resources` field when ready
+7. When feature is promoted to GA, feature gate moves to Default feature set
 
 ### Risks and Mitigations
 
-#### Risk: User sets resources too low, operator becomes unhealthy
+#### Risk: User sets resources too low, router pods become unhealthy
 
-**Impact**: Operator may OOMKill, fail to reconcile, or become unresponsive
+**Impact**: Router pods may OOMKill, fail to handle traffic, or become unresponsive, 
+causing ingress traffic disruptions
 
 **Mitigation**:
-- Document minimum recommended values
+- Document minimum recommended values (cpu: 200m, memory: 256Mi as baseline)
 - Add validation warnings (not blocking) for values below minimums
-- Include troubleshooting guide for common issues
-- Monitor operator health metrics
+- Include troubleshooting guide for common issues (OOM, CPU throttling)
+- Monitor router pod health metrics
+- Provide example configurations for common scenarios (low/medium/high traffic)
+- CPU throttling and memory pressure metrics available via Prometheus
 
 **Likelihood**: Medium
 
+**Detection**: Router pod restarts, increased error rates, degraded performance
+
 #### Risk: Incompatibility with existing tooling expecting v1 API only
 
 **Impact**: External tools may not recognize v1alpha1 resources
@@ -313,127 +484,155 @@ When upgrading to a version with this enhancement:
 
 **Likelihood**: Low
 
-#### Risk: Operator restart causes brief unavailability
+#### Risk: Router pod rolling restart causes brief traffic disruption
 
-**Impact**: Configuration changes trigger pod restart, brief reconciliation delay
+**Impact**: Configuration changes trigger rolling restart of router pods, potential 
+brief connection disruptions during pod replacement
 
 **Mitigation**:
-- Document that changes trigger rolling restart (expected behavior)
-- Operator restart is typically < 30 seconds
-- Router pods continue serving traffic during operator restart
-- Changes to operator resources are not expected to be frequent
+- Document that changes trigger rolling restart (expected Kubernetes behavior)
+- Rolling restart minimizes impact - only one pod restarted at a time
+- Connection draining allows graceful termination of existing connections
+- Load balancer redistributes traffic to remaining healthy pods
+- Changes to router resources are not expected to be frequent operations
+- Recommended to perform during maintenance windows for production systems
 
-**Likelihood**: High (by design), **Severity**: Low
+**Likelihood**: High (by design), **Severity**: Low to Medium (depends on traffic patterns)
 
 #### Risk: Resource configuration drift
 
-**Impact**: Manual changes to deployment could be overwritten by controller
+**Impact**: Manual changes to router pod deployment could be overwritten by operator reconciliation
 
 **Mitigation**:
-- Controller reconciliation loop detects and corrects drift
-- Document that configuration should be via IngressController CR, not direct deployment edits
-- Admission webhooks prevent direct deployment modifications
+- Operator reconciliation loop detects and corrects drift automatically
+- Document that configuration must be via IngressController CR, not direct deployment edits
+- Events generated when drift is detected and corrected
+- Router deployments are managed resources - manual changes not supported
 
 **Likelihood**: Low
 
 ### Drawbacks
 
-1. **Increased API complexity**: Adds another version and configuration surface
+1. **Increased API complexity**: Adds v1alpha1 version and another configuration mechanism
 2. **Maintenance burden**: Requires maintaining v1alpha1 API version and conversion logic
-3. **Operator self-modification**: Operator modifying its own deployment adds complexity
-4. **Documentation overhead**: Need to document new field and migration path
-5. **Testing complexity**: Must test version conversion and upgrade scenarios
+3. **Overlapping configuration**: Two ways to configure resources (v1 `nodePlacement.resources` 
+   and v1alpha1 `resources`) may confuse users
+4. **Documentation overhead**: Need to document new field, precedence rules, and migration path
+5. **Testing complexity**: Must test version conversion, upgrade scenarios, and feature gate behavior
+6. **Feature gate dependency**: Adds operational complexity with feature gate management
 
 ## Design Details
 
 ### Open Questions
 
-1. **Q**: Should we support auto-scaling (VPA) in the future?
-   - **A**: Out of scope for initial implementation, but API should not preclude it
+1. **Q**: Should we support auto-scaling (VPA) for router pods in the future?
+   - **A**: Out of scope for initial implementation, but API design should not preclude it
 
-2. **Q**: Should we add validation for minimum resource values?
-   - **A**: Start with warnings/documentation, consider hard validation if issues arise
+2. **Q**: Should we add hard validation for minimum resource values?
+   - **A**: Start with warnings/documentation, consider hard validation if widespread issues arise
 
 3. **Q**: Should this apply to all IngressControllers or only the default?
-   - **A**: Initial implementation only default, but API supports any IngressController
+   - **A**: API supports any IngressController, including custom IngressControllers
+
+4. **Q**: What happens if both v1 and v1alpha1 resources are set?
+   - **A**: v1alpha1 takes precedence, with warning event logged. Documented in API validation section.
 
-4. **Q**: How do we handle the operator modifying its own deployment safely?
-   - **A**: Use owner references carefully, reconcile loop with backoff
+5. **Q**: Should we eventually merge this into v1 API or keep separate?
+   - **A**: After Tech Preview proving stable, consider promoting to v1 (GA) in future release
 
 ### Test Plan
 
 #### Unit Tests
 
 - **API conversion tests**: v1 ↔ v1alpha1 conversion correctness
-- **Controller reconciliation logic**: Mock deployment updates
+- **Feature gate handling**: Behavior with gate enabled/disabled
+- **Controller reconciliation logic**: Mock router deployment updates with v1alpha1 resources
 - **Resource requirement validation**: Edge cases and invalid inputs
 - **Default value handling**: Ensure defaults applied correctly
+- **Precedence logic**: v1alpha1 resources override v1 nodePlacement.resources
 
 Coverage target: >80% for new code
 
 #### Integration Tests
 
 - **API server integration**: v1alpha1 CRD registration and serving
-- **Conversion webhook**: Automatic conversion between versions
-- **Controller watches**: IngressController changes trigger reconciliation
+- **Conversion webhook**: Automatic conversion between v1 and v1alpha1 versions
+- **Controller watches**: IngressController changes trigger router deployment reconciliation
+- **Feature gate integration**: Verify feature gate controls field recognition
 
 #### E2E Tests
 
-- **Create IngressController with operatorResourceRequirements**
-  - Verify operator deployment is updated with correct resources
-  - Verify operator continues functioning normally
+- **Create IngressController with v1alpha1 resources field**
+  - Verify router deployment is updated with correct resource limits
+  - Verify router pods achieve Guaranteed QoS class
+  - Verify router continues handling traffic normally
   
-- **Update existing IngressController to add resource requirements**
-  - Verify rolling update occurs
-  - Verify no disruption to router functionality
+- **Update existing IngressController to add resource limits**
+  - Verify rolling update of router pods occurs
+  - Verify no traffic disruption during rolling restart
+  - Verify new pods have correct QoS class
   
 - **Remove resource requirements (revert to defaults)**
-  - Verify deployment reverts to default values
+  - Verify router deployment reverts to default values
+  - Verify router pods revert to Burstable QoS
+  
+- **Test v1 and v1alpha1 precedence**
+  - Set both v1 nodePlacement.resources and v1alpha1 resources
+  - Verify v1alpha1 takes precedence
+  - Verify warning event is generated
+  
+- **Feature gate disabled scenario**
+  - Set v1alpha1 resources field with feature gate disabled
+  - Verify field is ignored with warning
+  - Verify fallback to v1 behavior
   
 - **Upgrade scenario tests**
   - Upgrade from version without feature to version with feature
-  - Verify existing IngressControllers continue working
+  - Verify existing IngressControllers continue working unchanged
   - Verify v1 API remains functional
+  - Enable feature gate and verify v1alpha1 works
   
 - **Downgrade scenario tests**
   - Downgrade from version with v1alpha1 to version without
   - Verify graceful degradation (v1alpha1 fields ignored)
+  - Verify router pods continue with v1 configuration
 
 #### Manual Testing
 
 - Test in resource-constrained environments (e.g., single-node)
-- Verify QoS class changes as expected (None → Burstable → Guaranteed)
+- Verify QoS class changes as expected (Burstable → Guaranteed)
 - Test with various resource configurations (very low, very high)
-- Test operator behavior when limits are hit (OOMKill, CPU throttling)
+- Test router pod behavior when limits are hit (OOMKill, CPU throttling)
 - Test with multiple IngressController instances
+- Monitor router performance metrics (latency, throughput) with different resource configs
+- Test traffic handling during resource limit OOM scenarios
 
 ### Graduation Criteria
 
-#### Dev Preview -> Tech Preview (v1alpha1)
+#### Dev Preview -> Tech Preview (feature gated v1 API)
 
 - [x] Feature implemented behind feature gate
 - [x] Unit and integration tests passing
 - [x] E2E tests passing in CI
 - [x] Documentation published in OpenShift docs
 - [x] Enhancement proposal approved
-- [ ] Feedback collected from at least 3 early adopters
-- [ ] Known issues documented
 
-#### Tech Preview -> GA (promotion to v1)
+#### Tech Preview -> GA (promote feature gate to Default)
 
-This section describes criteria for graduating from v1alpha1 to v1 (stable API).
+This section describes criteria for graduating from Tech Preview (feature gate in 
+TechPreviewNoUpgrade) to GA (feature gate in Default) in the same release development 
+cycle. For a straightforward feature like this, the typical approach is to introduce as 
+Tech Preview and graduate to GA within the same release.
 
-- [ ] Sufficient field testing (2+ minor releases in Tech Preview)
-- [ ] No major bugs reported for 2 consecutive releases
-- [ ] Performance impact assessed and documented
-- [ ] API design validated by diverse user scenarios
-- [ ] At least 10 production users providing positive feedback
-- [ ] All tests consistently passing
+- [ ] Feature implemented and stable during Tech Preview period
+- [ ] No major bugs or design issues discovered during Tech Preview
+- [ ] Unit, integration, and E2E tests passing consistently
+- [ ] Performance impact assessed and documented (minimal/acceptable)
 - [ ] Documentation complete and reviewed
-- [ ] Upgrade/downgrade tested extensively
-- [ ] API review completed and approved for promotion
 
-Timeline estimate: Next major release after Tech Preview release
+**Timeline**: Promote to GA (move feature gate to Default feature set) in the same release 
+cycle if Tech Preview period shows no issues. If significant issues are discovered, address 
+them and consider promotion in the next release.
 
 #### Removing a deprecated feature
 
@@ -445,34 +644,36 @@ N/A - this is a new feature
 
 **From version without feature → version with feature:**
 
-1. CRD updated to include v1alpha1 version
-2. Existing IngressController CRs remain at v1 (storage version)
-3. Operator deployment updated with default resource limits
-4. Users can opt-in to v1alpha1 API to customize resources
+1. v1 IngressController API updated with new `resources` field (protected by feature gate)
+2. Feature gate added to TechPreviewNoUpgrade feature set
+3. Existing IngressController CRs remain functional and unchanged
+4. Users can enable feature gate and use new `resources` field
 5. No breaking changes to existing functionality
 
 **User action required**: None for default behavior
 
-**User action optional**: Update to v1alpha1 API to customize operator resources
+**User action optional**: Enable feature gate and use new `resources` field to configure router pod resource limits
 
 #### Downgrade
 
 **From version with feature → version without feature:**
 
-1. v1alpha1 API becomes unavailable
-2. IngressController CRs remain at v1 (storage version, unaffected)
-3. v1alpha1-specific fields (operatorResourceRequirements) are ignored
-4. Operator deployment falls back to static manifest defaults
-5. No data loss as v1 remains storage version
+1. Feature gate `IngressRouterResourceLimits` no longer recognized
+2. IngressController CRs with `spec.resources` field set will have it ignored
+3. The field remains in the CR but is not processed
+4. Router pods fall back to `spec.nodePlacement.resources` configuration or defaults
+5. No data loss - CR remains valid, field just ignored
+6. Router pods may lose Guaranteed QoS if it was only configured via `spec.resources`
 
-**User impact**: Loss of custom operator resource configuration, reverts to defaults
+**User impact**: Loss of custom router resource limits configured via the feature-gated 
+field; reverts to `nodePlacement.resources` or defaults
 
 #### Version Skew
 
 Supported version skew follows standard OpenShift practices:
 - API server and operator may be one minor version apart during upgrades
 - v1 API compatibility maintained across all versions
-- Conversion webhooks handle any necessary translations
+- Feature gate status synchronized across components
 
 ### Version Skew Strategy
 
@@ -480,17 +681,17 @@ Supported version skew follows standard OpenShift practices:
 
 During cluster upgrades, the API server may be updated before or after the ingress-operator:
 
-**Scenario 1**: API server updated first (has v1alpha1), operator not yet updated
-- v1alpha1 CRs accepted by API server
-- Old operator version ignores v1alpha1 fields (reads via v1 API)
-- No impact, custom resources wait for operator upgrade
+**Scenario 1**: API server updated first (knows about new field), operator not yet updated
+- New `resources` field accepted by API server
+- Old operator version ignores the field (doesn't know about it yet)
+- No impact, field configuration waits for operator upgrade
 
-**Scenario 2**: Operator updated first (supports v1alpha1), API server not yet updated
-- Operator can handle v1alpha1 resources
-- API server doesn't serve v1alpha1 yet
-- Users continue using v1 API until API server updates
+**Scenario 2**: Operator updated first (can process new field), API server not yet updated
+- Operator can process `resources` field
+- API server already knows about v1 API schema
+- Field works immediately once feature gate is enabled
 
-**Maximum skew**: 1 minor version 
+**Maximum skew**: 1 minor version (OpenShift standard)
 
 ### Operational Aspects of API Extensions
 
@@ -502,8 +703,8 @@ During cluster upgrades, the API server may be updated before or after the ingre
    - Operator continues with existing configuration
 
 2. **Controller failure**: 
-   - Operator deployment remains at current configuration
-   - Deployment status reflects error
+   - Router deployment remains at current configuration
+   - IngressController status reflects error
    - Operator logs provide debugging information
 
 3. **API conversion failure**: 
@@ -511,10 +712,17 @@ During cluster upgrades, the API server may be updated before or after the ingre
    - User notified of conversion issue
    - Existing resources unaffected
 
-4. **Operator restart loop due to low resources**:
+4. **Router pod restart loop due to low resources**:
    - Kubernetes backoff prevents rapid restarts
-   - Events and logs indicate resource pressure
-   - Admin can update IngressController to increase resources
+   - Events and logs indicate resource pressure (OOMKilled, etc.)
+   - Admin can update IngressController to increase resource limits
+   - Traffic may be degraded during restart loop
+
+5. **Feature gate disabled but `spec.resources` field used**:
+   - `spec.resources` field is ignored
+   - Warning event logged
+   - Falls back to `spec.nodePlacement.resources` behavior
+   - No traffic impact
 
 #### Support Procedures
 
@@ -525,23 +733,37 @@ Standard OpenShift support procedures apply:
 # View IngressController configuration
 oc get ingresscontroller default -n openshift-ingress-operator -o yaml
 
-# View operator deployment
-oc describe deployment ingress-operator -n openshift-ingress-operator
+# View router deployment
+oc describe deployment router-default -n openshift-ingress
+
+# Check router pod resource configuration
+oc get deployment router-default -n openshift-ingress -o jsonpath='{.spec.template.spec.containers[*].resources}'
 
-# Check operator logs
-oc logs -n openshift-ingress-operator deployment/ingress-operator -c ingress-operator
+# Check router pod resource usage
+oc adm top pod -n openshift-ingress
 
-# Check pod resource usage
-oc adm top pod -n openshift-ingress-operator
+# Check router pod QoS class
+oc get pod -n openshift-ingress -l ingresscontroller.operator.openshift.io/deployment-ingresscontroller=default -o jsonpath='{.items[*].status.qosClass}'
 
-# Check QoS class
-oc get pod -n openshift-ingress-operator -o jsonpath='{.items[*].status.qosClass}'
+# Check router pod events (for OOMKilled, etc.)
+oc get events -n openshift-ingress --field-selector involvedObject.kind=Pod
+
+# Check operator logs for reconciliation
+oc logs -n openshift-ingress-operator deployment/ingress-operator -c ingress-operator | grep -i resource
+
+# Check feature gate status
+oc get featuregate cluster -o yaml | grep IngressRouterResourceLimits
 ```
 
 **Common issues and resolutions**:
-- OOMKilled operator: Increase memory limits
-- CPU throttling: Increase CPU limits or reduce requests if not needed
-- Configuration not applied: Check operator logs for reconciliation errors
+- **OOMKilled router pods**: Increase memory limits in `spec.resources.routerContainer`
+- **CPU throttling**: Increase CPU limits or verify requests match actual load
+- **Configuration not applied**: 
+  - Check feature gate is enabled
+  - Check operator logs for errors
+  - Verify `spec.resources` field is properly set
+- **Burstable QoS when Guaranteed expected**: Ensure limits equal requests
+- **`spec.resources` field ignored**: Verify feature gate is enabled
 
 ## Implementation History
 
@@ -556,7 +778,7 @@ oc get pod -n openshift-ingress-operator -o jsonpath='{.items[*].status.qosClass
 
 ### Alternative 1: Configuration via ConfigMap
 
-Use a ConfigMap for operator resource configuration instead of API field.
+Use a ConfigMap for router pod resource configuration instead of API field.
 
 **Pros**:
 - Simpler to implement
@@ -565,47 +787,34 @@ Use a ConfigMap for operator resource configuration instead of API field.
 
 **Cons**:
 - Less type-safe
-- Doesn't follow OpenShift patterns
+- Doesn't follow OpenShift patterns (IngressController is the proper API)
 - No automatic validation
 - Harder to discover and document
+- Not GitOps friendly
+- Separation from other router configuration
 
-**Decision**: Rejected - API-based configuration is the established OpenShift pattern
-
-### Alternative 2: Modify v1 API directly
-
-Add `operatorResourceRequirements` field directly to stable v1 API.
-
-**Pros**:
-- No need for v1alpha1 version
-- Simpler for users (one API version)
-
-**Cons**:
-- Changes stable API (breaking compatibility promise)
-- Cannot iterate on design easily
-- Difficult to remove if issues found
-- Against OpenShift API stability guarantees
-
-**Decision**: Rejected - Use v1alpha1 for new features as per OpenShift conventions
+**Decision**: Rejected - API-based configuration via IngressController is the established OpenShift pattern
 
-### Alternative 3: Separate CRD for operator configuration
+### Alternative 2: Separate CRD for ingress configuration
 
-Create a new OperatorConfiguration CRD (similar to how cluster monitoring works).
+Create a new IngressConfiguration CRD separate from IngressController.
 
 **Pros**:
-- Separation of concerns
-- Can configure multiple operators uniformly
+- Separation of concerns (configuration vs. controller spec)
+- Could handle other ingress-level configuration
 
 **Cons**:
 - Increases API surface unnecessarily
-- IngressController is the logical place for ingress-operator configuration
+- IngressController is the logical place for router pod configuration
 - More CRDs to manage
-- Inconsistent with how other operators handle self-configuration
+- Inconsistent with existing IngressController design patterns
+- Confusing for users - where to configure what?
 
-**Decision**: Rejected - IngressController CR is the appropriate configuration location
+**Decision**: Rejected - IngressController CR is the appropriate location for router configuration
 
-### Alternative 4: Operator command-line flags or environment variables
+### Alternative 3: Router deployment annotations or environment variables
 
-Configure operator resources via deployment environment variables or command flags.
+Configure router pod resources via deployment annotations or environment variables.
 
 **Pros**:
 - Very simple to implement
@@ -613,33 +822,48 @@ Configure operator resources via deployment environment variables or command fla
 
 **Cons**:
 - Not GitOps friendly
-- Requires direct deployment modification
+- Requires direct deployment modification (operator would overwrite)
 - Not discoverable via API
 - Doesn't follow OpenShift declarative configuration patterns
 - Difficult to audit and version control
+- Operator reconciliation would fight manual changes
 
-**Decision**: Rejected - Declarative API configuration is required
+**Decision**: Rejected - Declarative API configuration via IngressController is required
 
-### Alternative 5: Use OperatorHub/OLM configuration
+### Alternative 4: Use v1alpha1 API version instead of v1 with feature gate
 
-Leverage Operator Lifecycle Manager (OLM) subscription configuration.
+Create a separate v1alpha1 API version for IngressController and add the new `resources` 
+field there, following the pattern used by some other OpenShift components like cluster 
+monitoring.
 
 **Pros**:
-- Follows OLM patterns
-- Could work for OLM-managed operators
+- Clear separation between stable (v1) and experimental (v1alpha1) APIs
+- Can iterate on API design during Tech Preview without affecting v1
+- Field only visible in v1alpha1, clearer signal that it's experimental
+- Can change or remove field design if Tech Preview reveals issues
+- Follows pattern used by some OpenShift components
 
 **Cons**:
-- Ingress operator is not OLM-managed (it's a cluster operator)
-- Adds OLM dependency
-- Not applicable to this operator's deployment model
-
-**Decision**: Rejected - Not applicable to cluster operators
+- Adds API complexity with multiple versions
+- Requires API conversion webhooks between v1 and v1alpha1
+- Users must explicitly switch to v1alpha1 API to use the feature
+- More maintenance burden (two API versions to support)
+- Need to eventually promote changes back to v1 for GA
+- Networking team prefers simpler approach of v1 with feature gate
+- For straightforward features like this, v1 alpha1 adds unnecessary complexity
+- Not all OpenShift APIs follow this pattern - many add fields to v1 with feature gates
+
+**Decision**: Rejected in favor of adding field to v1 API with feature gate. It is the simpler 
+approach that adds the field directly to v1 protected by a feature gate. 
+This avoids API versioning complexity while still providing Tech Preview 
+protection. For a straightforward additive feature like this, the additional complexity of 
+v1alpha1 is not warranted.
 
 ## Infrastructure Needed
 
 ### Development Infrastructure
 
-- Standard OpenShift CI/CD pipeline 
+- Standard OpenShift CI/CD pipeline (already exists)
 - No special hardware or cloud resources required
 
 ### Testing Infrastructure