feat: Extracting Edges from 2D Sensor Module #1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

hlee9212 wants to merge 2 commits into dev from edge_extraction

hlee9212 commented Oct 23, 2025

Hi @nielsleadholm, here are the two files for 2D Sensor Module that can currently extract edges. Let me know if you'd also like to see any experimental configs as well. :)

hlee9212 added 2 commits

October 22, 2025 17:58


          feat: add edge detection utils

ac5c97c


          feat: add 2D sensor module

ef423e3

hlee9212 requested a review from nielsleadholm

October 23, 2025 01:08

nielsleadholm commented Oct 23, 2025

Great thanks Hojae, just fyi that @vkakerbeck has offered to help review this given my earlier than expected paternity leave.

nielsleadholm requested review from vkakerbeck and removed request for nielsleadholm

October 23, 2025 09:54

nielsleadholm assigned vkakerbeck

vkakerbeck reviewed

View reviewed changes

vkakerbeck left a comment •

edited

Loading

Nice, this is very useful to read through. I left a couple of comments and questions, mostly around readability and variable names. On a higher level, it would be nice if we don't have to introduce opencv as a new dependency, especially if it is just to do things we are already doing with other libraries in Monty (like gaussian smoothing and convolution).

Some things that would be nice to test but don't have to be part of this PR is to not throw away the color information, and to compare gabor filters for more precise angle detection.

src/tbp/monty/frameworks/models/two_d_sensor_module.py

+                                (default: DEFAULT_WINDOW_SIGMA from edge_detection_utils)
+                              - kernel_size: Kernel size for Gaussian blur
+                                (default: DEFAULT_KERNEL_SIZE from edge_detection_utils)
+                              - edge_threshold: Minimum edge strength threshold (default: 0.1)

vkakerbeck Oct 31, 2025

What is the range for this value? What does 0.1 mean?

src/tbp/monty/frameworks/models/two_d_sensor_module.py

+                              - kernel_size: Kernel size for Gaussian blur
+                                (default: DEFAULT_KERNEL_SIZE from edge_detection_utils)
+                              - edge_threshold: Minimum edge strength threshold (default: 0.1)
+                              - coherence_threshold: Minimum coherence threshold (default: 0.05)

vkakerbeck Oct 31, 2025

Can you elaborate a bit more here on what the coherence_threshold is? From the description I am not getting a lot more info than the variable name.

src/tbp/monty/frameworks/models/two_d_sensor_module.py

+                      observed_state, telemetry = self._habitat_observation_processor.process(data)
+                      if observed_state.use_state and observed_state.get_on_object():
+                          observed_state = self.extract_2d_edge(

vkakerbeck Oct 31, 2025

Why would this not be part of what the _habitat_observation_processor? I am not sure if you are changing anything else in the SM, but if not then you could just implement a new observation processor instead of a new SM.

src/tbp/monty/frameworks/models/two_d_sensor_module.py

+                      state.morphological_features["pose_vectors"] = np.vstack(
+                          [
+                              surface_normal,

vkakerbeck Oct 31, 2025

If we get the pose from the surface texture, we don't want to include the surface normal here. The point of the 2D SM is that it sends the same information up for the logo on a flat surface as for a logo on a cylinder. This would not be the case if we include the surface normal. Surface normals encode information about the 3D structure of the object.

src/tbp/monty/frameworks/models/two_d_sensor_module.py

+                      if "edge_strength" in self.features:
+                          state.non_morphological_features["edge_strength"] = edge_strength
+                      if "coherence" in self.features:
+                          state.non_morphological_features["coherence"] = coherence

vkakerbeck Oct 31, 2025

What is coherence? Maybe add some more info on that. It could also help to give it a more informative name like "2d_edge_coherence"

src/tbp/monty/frameworks/utils/edge_detection_utils.py

Comment on lines +113 to +115

+                  Jxx = cv2.GaussianBlur(Jxx, (ksize, ksize), win_sigma)  # noqa: N806
+                  Jyy = cv2.GaussianBlur(Jyy, (ksize, ksize), win_sigma)  # noqa: N806
+                  Jxy = cv2.GaussianBlur(Jxy, (ksize, ksize), win_sigma)  # noqa: N806

vkakerbeck Oct 31, 2025

If we are only interested in the center pixel, do we need to gaussian blur the entire image? Couldn't we just look at the window size around the center pixel and extract the most consistent edge direction?

src/tbp/monty/frameworks/utils/edge_detection_utils.py

+                  r, c = get_patch_center(*gray.shape)
+                  jxx, jyy, jxy = float(Jxx[r, c]), float(Jyy[r, c]), float(Jxy[r, c])
+                  disc = np.sqrt((jxx - jyy) ** 2 + 4.0 * (jxy**2))

vkakerbeck Oct 31, 2025

what does disc stand for? What is happening here?

src/tbp/monty/frameworks/utils/edge_detection_utils.py


		edge_strength = np.sqrt(max(lam1, 0.0))

		coherence = (lam1 - lam2) / (lam1 + lam2 + EPSILON)

vkakerbeck Oct 31, 2025

I still don't quite understand what coherence is after reading this? Can you try more expressive variable names or adding comments of what is being calculated here?

src/tbp/monty/frameworks/utils/edge_detection_utils.py

+                      Patch with annotations drawn on it.
+                  """
+                  patch_with_pose = patch.copy()
+                  center_y, center_x = patch.shape[0] // 2, patch.shape[1] // 2

vkakerbeck Oct 31, 2025

If you want to add the get_patch_center function, you should use it here too

src/tbp/monty/frameworks/utils/edge_detection_utils.py


		from typing import Tuple

		import cv2

vkakerbeck Oct 31, 2025

Can we use scipy.signal.convolve as we do here https://github.com/thousandbrainsproject/tbp.monty/blob/0d8eb9f12a32cdb8e201698001a5030757796d99/src/tbp/monty/frameworks/environment_utils/transforms.py#L154 instead of adding a new dependency?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet