feat(security): implement strict upstream validation for URIs and pagination to mitigate SPARQL injection by kaiprodevops · Pull Request #37 · eclipse-thingweb/domus-tdd-api

kaiprodevops · 2026-03-28T11:29:33Z

Overview

As part of an ongoing architectural review to enhance TD Directories with MCP-driven capabilities, I conducted a security audit of the TDD API. While this audit was motivated by the need to build robust guardrails for future AI agents (which can unpredictably hallucinate malformed parameters), the vulnerabilities discovered are critical for the existing REST API security.

This PR implements a Zero-Trust upstream validation layer, effectively mitigating SPARQL Injection across multiple attack vectors, and resolves a sophisticated streaming architecture bypass.

Vulnerabilities Discovered

During Red Team penetration testing, I identified two critical injection vectors:

URI & Identifier Injection: Unsanitized URIs were being directly interpolated into SPARQL templates. An attacker could craft a URI with structural characters (e.g., > } ;) to break the Abstract Syntax Tree (AST) and execute unauthorized administrative commands like DROP GRAPH.
Pagination Injection & Generator Bypass: The sort_order parameter was vulnerable to injection. More critically, I discovered an architectural flaw: because the GET /things endpoint uses a Python streaming generator (yield), validation occurring inside the database layer was executed after Flask had sent the initial HTTP headers. This bypassed the global error handler entirely, causing the WSGI server to crash mid-stream and leak raw HTML 500 Internal Server Error traces.
Race Condition in Concurrent TD Retrieval: The get_paginated_tds() function used ThreadPoolExecutor to fetch TDs concurrently but failed to wait for all tasks to complete before returning results.
UTF-8 Encoding Issue on Windows: The subprocess calling Node.js for JSON-LD framing used the system default encoding (cp1252 on Windows), causing UnicodeDecodeError when processing TDs with international characters. This resulted in silent failures for TDs containing UTF-8 characters.

The Fix: "Shift-Left" Validation + Concurrency Hardening

To address this without disrupting the core business logic, I implemented the following:

Centralized Sanitization (tdd/validators.py): Created a dedicated validation module using strict RFC 3986-compliant Regex for URIs and explicit allowlists (ASC/DESC) for pagination.
Controller-Level Interception: Shifted the pagination validation upstream directly into the routing layer (tdd/__init__.py). By validating parameters before the generator is instantiated, we completely eliminated the lazy evaluation bypass.
Structured Error Handling (tdd/errors.py): To ensure robust and consistent error reporting, I implemented a dedicated SecurityValidationErrorclass, wired the new validators to trigger this specific exception, ensuring that malicious inputs are elegantly caught and converted into structured JSON-LD 400 Bad Request responses, maintaining the API's contract consistency.
Concurrency Fix (tdd/td.py): Added explicit task completion waiting using concurrent.futures.as_completed() to ensure all concurrent TD retrieval tasks finish before returning results. This maintains the parallel execution performance while guaranteeing data integrity.
Encoding Fix (tdd/common.py): Added explicit encoding='utf-8' parameter to the subprocess call, ensuring consistent UTF-8 handling across all platforms, particularly Windows.

Red Team Test Results (Proof of Concept)

I tested the endpoints using a custom-crafted AST breakout payload(CONSTRUCT { ?s ?p ?o } WHERE { GRAPH urn:test { ?s ?p ?o } } ; DROP SILENT GRAPH ; #> { ?s ?p ?o } }):
urn:test%3E%20%7B%20%3Fs%20%3Fp%20%3Fo%20%7D%20%7D%20%3B%20DROP%20SILENT%20GRAPH%20%3CALL%3E%20%3B%20%23

Before the Fix: The payload successfully reached the database, crashing the Fuseki parser and returning an unhandled 500 error (or crashing the stream for pagination).
After the Fix: The API gateway intercepts the structural characters immediately, returning a safe, structured 400 Bad Request.

[Before]

[After]

…e SPARQL injection & generator bypass

kaiprodevops · 2026-03-29T05:21:14Z

Hi @wiresio ,
I’ve just submitted this PR to implement a strict "Shift-Left" upstream validation layer for the TDD API.

A major driver for this architectural update is laying the groundwork for Enhancing TD Directories with MCP-driven Capabilities. Since future LLM agents interacting via MCP can unpredictably hallucinate parameters or inject structural characters, it was critical to secure the API boundary. This upstream validation ensures that malformed tool calls are intercepted before reaching the SPARQL engine or Python generators, returning a structured 400 Bad Request to enable LLM self-correction rather than causing a 500 Internal Server Error crash.

Additionally, this PR introduces a workaround for the RDFlib Blank Node (BNode) pagination data loss and fixes the Windows UTF-8 subprocess encoding issue.

I would be incredibly grateful for your feedback whenever you have a chance to look at this. Please let me know if anything needs to be changed. Thanks again for maintaining such an amazing project!

wiresio · 2026-03-30T09:38:34Z

Thanks @kaiprodevops! Please allow me some time to have a close look.

wiresio · 2026-03-30T13:42:34Z

Hi @kaiprodevops, here is my Claude powered feedback:

The changes are directionally correct and address real SPARQL injection risks. The approach (allowlist at the trust boundary before interpolation into SPARQL templates) is sound. However, there are several issues worth raising:

IMHO it is great to prepare the API before making use of it in an MCP server!

Issues found

No tests for the new validators (high priority)
validators.py is entirely untested. Given this is security-critical code, unit tests should cover:

Valid URIs passing through validate_uri

URIs containing <, >, {, }, spaces being rejected

validate_sort_order with "asc", "ASC", "Desc", empty string, and invalid values

validate_uris with mixed valid/invalid lists

Malicious input echoed in error messages and logs
In validators.py:28:
logger.warning(f"SECURITY ALERT: Malformed or unsafe URI blocked: {uri}")
raise SecurityValidationError(f"Malformed or unsafe URI detected: {uri}")
Log injection: A crafted URI containing \n can corrupt log entries.

Information leakage: The raw (attacker-controlled) URI is included in the HTTP 400 response body, enabling attackers to probe the allowlist efficiently.
Recommendation: log a sanitized/truncated/repr of the URI, and return a generic error message without the input.

Double validate_sort_order call
validate_sort_order is called in the route handler in tdd/init.py, and then again inside get_paginated_tds in td.py:355. The second call is redundant. Additionally, init.py calls .lower() on the already-uppercase result of validate_sort_order, which is inconsistent (though harmless functionally).

validate_uri applied to database-sourced URIs
In sparql.py:249 (delete_named_graph) and td.py:271 (delete_graphs), validate_uri is called on URIs that were retrieved from the SPARQL store — not from user input. If any legitimate stored graph URI contains a character outside the allowlist (e.g. a fragment # combined with percent-encoding anomalies, or a URN), these operations will break silently with a 400 rather than propagating a meaningful error. The validate_uri guard should be applied at the external trust boundary (request parameters, TD payload), not on round-tripped database values.

int() cast in get_paginated_tds is redundant
td.py:359-360's safe_limit = int(limit) / safe_offset = int(offset) are already done in init.py before calling get_paginated_tds. The redundancy is harmless but adds noise. If the intent is to protect all callers, a ValueError here would surface as an unhandled 500 rather than a clean 400.

Thread-safety of all_tds list (pre-existing, but not fixed)
In td.py:384, send_request appends to all_tds from multiple threads without a lock. The PR adds correct exception propagation via task.result() (a genuine fix), but the shared mutation is still present. In CPython this works due to the GIL, but it's not guaranteed behavior. Consider returning results from the futures directly instead of using a shared list.

So, maybe despite 1., only minor changes needed for this PR.

kaiprodevops · 2026-03-30T19:07:35Z

Hi @wiresio ,
Thank you so much for taking the time to provide such a detailed and insightful review.
I will implement these fixes and ping you once the updated commits are ready for another look. Thanks again for the excellent guidance!

Signed-off-by: kaiprodev <warmtigerca@gmail.com>

into security/upstream-input-validation

Signed-off-by: kaiprodev <warmtigerca@gmail.com>

kaiprodevops · 2026-03-31T04:34:41Z

Hi @wiresio,
quick update: all suggested fixes are in and the latest commits are pushed. Appreciate another review when you get a chance.

wiresio · 2026-03-31T08:29:47Z

Thanks @kaiprodevops, looks really good now!

kaiprodevops added 5 commits March 28, 2026 01:50

feat(security): implement strict upstream input validation to mitigat…

cab55dc

…e SPARQL injection & generator bypass

style: format code with black and resolve flake8 whitespace warnings

0795b62

fix: introduce i18n-compliant SecurityValidationError class

a0bcc64

style: add missing newline at end of errors.py

c2d2008

fix: resolve race condition and UTF-8 encoding issues in TD retrieval

f88286e

kaiprodevops added 4 commits March 30, 2026 23:38

fix: harden input validators and add security-focused tests

7b5a2a9

Signed-off-by: kaiprodev <warmtigerca@gmail.com>

Merge branch 'main' of https://github.com/eclipse-thingweb/domus-tdd-api

16e6edd

into security/upstream-input-validation

style: fix flake8 linting errors and apply black formatting

2aa8743

Signed-off-by: kaiprodev <warmtigerca@gmail.com>

style: remove unused validator imports to fix flake8 F401

e2bf98e

Signed-off-by: kaiprodev <warmtigerca@gmail.com>

wiresio merged commit 623998e into eclipse-thingweb:main Mar 31, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(security): implement strict upstream validation for URIs and pagination to mitigate SPARQL injection#37

feat(security): implement strict upstream validation for URIs and pagination to mitigate SPARQL injection#37
wiresio merged 9 commits intoeclipse-thingweb:mainfrom
kaiprodevops:security/upstream-input-validation

kaiprodevops commented Mar 28, 2026 •

edited

Loading

Uh oh!

kaiprodevops commented Mar 29, 2026

Uh oh!

wiresio commented Mar 30, 2026 •

edited

Loading

Uh oh!

wiresio commented Mar 30, 2026

Uh oh!

kaiprodevops commented Mar 30, 2026

Uh oh!

kaiprodevops commented Mar 31, 2026

Uh oh!

Uh oh!

wiresio commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kaiprodevops commented Mar 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Vulnerabilities Discovered

The Fix: "Shift-Left" Validation + Concurrency Hardening

Red Team Test Results (Proof of Concept)

Uh oh!

kaiprodevops commented Mar 29, 2026

Uh oh!

wiresio commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wiresio commented Mar 30, 2026

Uh oh!

kaiprodevops commented Mar 30, 2026

Uh oh!

kaiprodevops commented Mar 31, 2026

Uh oh!

Uh oh!

wiresio commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kaiprodevops commented Mar 28, 2026 •

edited

Loading

wiresio commented Mar 30, 2026 •

edited

Loading