Specific improvements by offx-zinth · Pull Request #33 · offx-zinth/SMP

offx-zinth · 2026-04-25T11:32:13Z

No description provided.

gemini-code-assist

Code Review

This pull request implements the core infrastructure for the SMP Graph Engine, an ingest-free, memory-mapped graph database for code analysis. The changes include low-level file management with a Write-Ahead Log (WAL), indexing structures (Crit-bit and Radix trees), a deduplicated string pool, and a tree-sitter-based parsing engine for Python. While the architectural foundation is solid, the current implementation contains several critical bugs: serialized node and edge data are packed but never actually written to the memory-mapped file, and the indexing logic uses dummy pointers instead of real offsets. Additionally, there are logic errors regarding tree-sitter sibling traversal for decorators, case-sensitivity issues with NodeType mapping, and unsafe handling of header CRC mismatches.

gemini-code-assist · 2026-04-25T11:37:35Z

+            graph_nodes.append(
+                GraphNode(
+                    id=pnode.node_id,
+                    type=NodeType(pnode.type.upper()),


The NodeType enum in smp.core.models uses TitleCase values (e.g., "Function", "Class"). Calling .upper() on pnode.type (which is already "Function" or "Class") will result in "FUNCTION" or "CLASS", causing NodeType() to raise a ValueError because it is a StrEnum with case-sensitive matching.

Suggested change

type=NodeType(pnode.type.upper()),

type=NodeType(pnode.type),

gemini-code-assist · 2026-04-25T11:37:35Z

+    def _extract_decorators(self, node: Any, content: bytes) -> list[str]:
+        """Extract decorators."""
+        decorators: list[str] = []
+        for child in node.prev_sibling:


The node.prev_sibling property in tree-sitter returns a single Node object or None; it is not an iterable. Attempting to iterate over it will raise a TypeError. You should use a while loop to traverse previous siblings if you are looking for multiple decorators.

Suggested change

for child in node.prev_sibling:

curr = node.prev_sibling

while curr and curr.type == "decorator":

text = content[curr.start_byte : curr.end_byte].decode("utf-8", errors="replace")

decorators.append(text.strip())

curr = curr.prev_sibling

gemini-code-assist · 2026-04-25T11:37:35Z

+        if actual_crc != stored_crc:
+            pass


Ignoring a CRC mismatch in the file header is dangerous for a database as it indicates potential data corruption. This should raise an exception to prevent further operations on a corrupted file.

Suggested change

if actual_crc != stored_crc:

pass

if actual_crc != stored_crc:

raise ValueError("Header CRC mismatch: file may be corrupted")

gemini-code-assist · 2026-04-25T11:37:35Z

+        struct.pack(
+            "<BIII III I",
+            1,
+            name_off,
+            sig_off,
+            file_off,
+            node.structural.start_line,
+            node.structural.end_line or 0,
+            0,
+            0,
+        )


The result of struct.pack is being discarded. This method is intended to serialize the node data into the memory-mapped file, but it currently does nothing with the packed bytes.

gemini-code-assist · 2026-04-25T11:37:35Z

+        payload = struct.pack("<I", count)
+        for target_off, etype in targets:
+            payload += struct.pack("<II", target_off, etype)


The payload containing the edge data is constructed but never written to the underlying mmap file. This results in edges not being persisted.

gemini-code-assist · 2026-04-25T11:37:35Z

+            self.index.insert(node.id, 0)
+        if self.radix:
+            self.radix.insert(node.file_path, 0)


Using a dummy pointer of 0 for index insertions makes the Crit-bit and Radix indices non-functional for data retrieval from the mmap file. The actual offset returned by the NodeStore should be used here.

gemini-code-assist · 2026-04-25T11:37:35Z

+                if name and name[0].islower():
+                    # Likely a module name, not function
+                    continue


This heuristic incorrectly skips module-level imports in Python, as most module names (e.g., os, sys, requests) start with a lowercase letter. This will cause the engine to miss a significant number of valid IMPORTS relationships.

Suggested change

if name and name[0].islower():

# Likely a module name, not function

continue

if not name:

continue

Specific improvements

11e1701

offx-zinth merged commit 28cd564 into main Apr 25, 2026
1 check failed

gemini-code-assist Bot reviewed Apr 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specific improvements#33

Specific improvements#33
offx-zinth merged 1 commit intomainfrom
master

offx-zinth commented Apr 25, 2026

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	type=NodeType(pnode.type.upper()),
	type=NodeType(pnode.type),

-        for child in node.prev_sibling:
+        curr = node.prev_sibling
+        while curr and curr.type == "decorator":
+            text = content[curr.start_byte : curr.end_byte].decode("utf-8", errors="replace")
+            decorators.append(text.strip())
+            curr = curr.prev_sibling

Conversation

offx-zinth commented Apr 25, 2026

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant