moving damping constant to class global

michaellin6 · michaellin6 · commit 9eac0955e44a · 2025-10-31T17:10:27.000-07:00
diff --git a/docs/source/overview/imitation-learning/teleop_imitation.rst b/docs/source/overview/imitation-learning/teleop_imitation.rst
@@ -293,12 +293,12 @@ Visualizing results
 ^^^^^^^^^^^^^^^^^^^
 
 .. tip::
-   
+
    **Important: Testing Multiple Checkpoint Epochs**
-   
-   When evaluating policy performance, it is common for different training epochs to yield significantly different results. 
-   If you don't see the expected performance, **always test policies from various epochs** (not just the final checkpoint) 
-   to find the best-performing model. Model performance can vary substantially across training, and the final epoch 
+
+   When evaluating policy performance, it is common for different training epochs to yield significantly different results.
+   If you don't see the expected performance, **always test policies from various epochs** (not just the final checkpoint)
+   to find the best-performing model. Model performance can vary substantially across training, and the final epoch
    is not always optimal.
 
 By inferencing using the generated model, we can visualize the results of the policy:
@@ -326,7 +326,7 @@ By inferencing using the generated model, we can visualize the results of the po
 
 .. tip::
 
-   **If you don't see expected performance results:** Test policies from multiple checkpoint epochs, not just the final one. 
+   **If you don't see expected performance results:** Test policies from multiple checkpoint epochs, not just the final one.
    Policy performance can vary significantly across training epochs, and intermediate checkpoints often outperform the final model.
 
 .. note::
@@ -530,7 +530,7 @@ Visualize the results of the trained policy by running the following command, us
 
 .. tip::
 
-   **If you don't see expected performance results:** It is critical to test policies from various checkpoint epochs. 
+   **If you don't see expected performance results:** It is critical to test policies from various checkpoint epochs.
    Performance can vary significantly between epochs, and the best-performing checkpoint is often not the final one.
 
 .. figure:: https://download.isaacsim.omniverse.nvidia.com/isaaclab/images/gr-1_steering_wheel_pick_place_policy.gif
@@ -664,7 +664,7 @@ Visualize the trained policy performance:
 
 .. tip::
 
-   **If you don't see expected performance results:** Always test policies from various checkpoint epochs. 
+   **If you don't see expected performance results:** Always test policies from various checkpoint epochs.
    Different epochs can produce significantly different results, so evaluate multiple checkpoints to find the optimal model.
 
 .. figure:: https://download.isaacsim.omniverse.nvidia.com/isaaclab/images/locomanipulation-g-1_steering_wheel_pick_place.gif
@@ -878,7 +878,7 @@ Visualize the results of the trained policy by running the following command, us
 
 .. tip::
 
-   **If you don't see expected performance results:** Test policies from various checkpoint epochs, not just the final one. 
+   **If you don't see expected performance results:** Test policies from various checkpoint epochs, not just the final one.
    Policy performance can vary substantially across training, and intermediate checkpoints often yield better results.
 
 .. figure:: https://download.isaacsim.omniverse.nvidia.com/isaaclab/images/gr-1_nut_pouring_policy.gif
diff --git a/source/isaaclab/isaaclab/controllers/pink_ik/null_space_posture_task.py b/source/isaaclab/isaaclab/controllers/pink_ik/null_space_posture_task.py
@@ -77,6 +77,9 @@ class NullSpacePostureTask(Task):
 
     """
 
+    # Regularization factor for pseudoinverse computation to ensure numerical stability
+    PSEUDOINVERSE_DAMPING_FACTOR: float = 1e-9
+
     def __init__(
         self,
         cost: float,
@@ -242,16 +245,13 @@ def compute_jacobian(self, configuration: Configuration) -> np.ndarray:
         # Use fast pseudoinverse computation with direct LAPACK/BLAS calls
         m, n = J_combined.shape
 
-        # Determine damping factor for numerical stability
-        damping = 1e-9
-
         # Wide matrix (typical for robotics): use left pseudoinverse
         # J^+ = J^T @ inv(J @ J^T + λ²I)
         # This is faster because we invert an m×m matrix instead of n×n
 
         # Compute J @ J^T using BLAS (faster than numpy)
         JJT = blas.dgemm(1.0, J_combined, J_combined.T)
-        np.fill_diagonal(JJT, JJT.diagonal() + damping**2)
+        np.fill_diagonal(JJT, JJT.diagonal() + self.PSEUDOINVERSE_DAMPING_FACTOR**2)
 
         # Use LAPACK's Cholesky factorization (dpotrf = Positive definite TRiangular Factorization)
         L, info = lapack.dpotrf(JJT, lower=1, clean=False, overwrite_a=True)