Fix #215 & #217 #218

acidicoala · 2025-10-01T14:04:51Z

While I was on a roll, I decided to create a branch from my branch so that I could work on a fix for #217 and a few other QoL improvements. This branch includes everything from #216 plus additional commits. I did it this way, so that you could chose if how you wish to review those fixes - in isolation or together. If you chose to approve and merge this PR, I will close the #216. If you chose to approve and merge the #216, I will close this PR and create a new one with the fix for #217 specifically.

This PR implements a fix for #217 as it was described in the first post of the issue, by replacing the call rel32 instruction with push imm32.

With those 2 major issues fixed, there were still some minor issues preventing all tests from passing, so they were fixed in this branch as well. Specifically:

Added tests for issues Trampoline calls fail when prologue calls routine reading next instruction #215 and Detour fails when prologue contains Inline call to read SP idiom #217: We can't rely on Linux binaries to test those fixes since those binaries may or may not contain problematic instructions. Instead, inline assembly was used to construct appropriate test cases.
Added diagnostics in test mode: When testing for various edge cases, like fixes for Trampoline calls fail when prologue calls routine reading next instruction #215, Detour fails when prologue contains Inline call to read SP idiom #217, instruction translations, etc., we need to be able to confirm that said fixes actually took place. Until now this was only possible to do manually by inspecting the output logs. But now when building tests (POLYHOOK_BUILD_DLL OFF), a new PLH_DIAGNOSTICS definition will be compiled. When this definition is enabled, IHook class will contain a new uint32_t m_diagnostics member variable, which holds flags for various diagnostics. All flags are defined in the Diagnostic enum class in Enums.hpp (which itself is defined only if PLH_DIAGNOSTICS is defined). When a special fix gets applied, a corresponding diagnostic flag gets set, and then in tests we can verify that using the IHook::hasDiagnostic(Diagnostic d) helper. For instance, this has allowed me to uncover that no translation was actually taking place in the dlmopen instruction, so I disabled that test for now. For now these diagnostics are used in tests for Trampoline calls fail when prologue calls routine reading next instruction #215 and Detour fails when prologue contains Inline call to read SP idiom #217 where they verify that the expected fixes did in fact take place. I can see other flags being added in the future, making the tests more reliable. Of course in library (non-test) builds, none of the diagnostic functions, enums and variables will be available, so this has no performance impact outside of tests. This design was inspired by the nlohmann/json library, which features a JSON_DIAGNOSTICS flag that works in a similar manner.
Finally, the CI workflow was optimized to remove redundant & duplicate options, so keep the code lean and easily readable.

With all these changes, we should expect all tests to pass from this point forward.

gemini-code-assist · 2025-10-01T14:05:20Z

Summary of Changes

Hello @acidicoala, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request focuses on enhancing the robustness of the hooking mechanism by addressing two critical issues (#215 and #217) related to stack pointer manipulation during function calls. It introduces intelligent instruction translation for specific call patterns and establishes a new, compile-time-conditional diagnostic framework to improve the reliability and verifiability of tests. These changes ensure more accurate and stable detours, particularly in complex x86 scenarios, and are supported by new, targeted test cases and a cleaner CI setup.

Highlights

Fixes for Issues Trampoline calls fail when prologue calls routine reading next instruction #215 and Detour fails when prologue contains Inline call to read SP idiom #217: Implemented specific fixes for issues Trampoline calls fail when prologue calls routine reading next instruction #215 (call to routine returning ESP) and Detour fails when prologue contains Inline call to read SP idiom #217 (inline call to read ESP) by replacing problematic call rel32 instructions with mov reg, imm32 or push imm32 respectively, ensuring correct stack pointer handling during detours.
New Diagnostic System for Testing: Introduced a PLH_DIAGNOSTICS definition that, when enabled for test builds, adds a m_diagnostics member to the IHook class. This allows tracking which special fixes (like those for Trampoline calls fail when prologue calls routine reading next instruction #215 and Detour fails when prologue contains Inline call to read SP idiom #217) have been applied, making tests more reliable. This system is compiled out in non-test builds to avoid performance impact.
Dedicated Test Cases: Added new test cases using inline assembly to specifically verify the fixes for issues Trampoline calls fail when prologue calls routine reading next instruction #215 and Detour fails when prologue contains Inline call to read SP idiom #217. The dlmopen translation test was temporarily disabled due to its unreliability and the need for synthetic test cases.
CI Workflow Optimization: The Continuous Integration workflow has been optimized by removing redundant and duplicate options, aiming for a leaner and more readable configuration.

Ignored Files

Ignored by pattern: .github/workflows/** (1)
- .github/workflows/main.yml

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces fixes for issues #215 and #217 by patching specific instruction patterns, and adds a diagnostics framework to verify these fixes in tests. The changes are well-structured and the addition of synthetic tests for these specific edge cases is a great improvement. My main feedback focuses on improving the error handling in the new instruction patching logic to make it more robust by propagating failures, which will prevent potential crashes if the patching fails at runtime.

gemini-code-assist · 2025-10-01T14:08:07Z

polyhook2/Detour/x86Detour.hpp

+    void fixSpecialCases(insts_t& prologue);
+    void fixCallToRoutineReturningSP(Instruction& callInst, const insts_t& routine);
+    void fixInlineCallToReadSP(Instruction& callInst);


These functions perform instruction patching which can fail (e.g., if asmjit fails to generate code). Returning void hides these failures, which could lead to crashes or undefined behavior later in the hooking process if it continues with unpatched instructions. It would be more robust to change their return type to bool to propagate success or failure.

Suggested change

void fixSpecialCases(insts_t& prologue);

void fixCallToRoutineReturningSP(Instruction& callInst, const insts_t& routine);

void fixInlineCallToReadSP(Instruction& callInst);

bool fixSpecialCases(insts_t& prologue);

bool fixCallToRoutineReturningSP(Instruction& callInst, const insts_t& routine);

bool fixInlineCallToReadSP(Instruction& callInst);

Hmm, this suggestion is not without merit. I can properly implement it, if @stevemk14ebr is OK with it.

gemini-code-assist · 2025-10-01T14:08:07Z

sources/x86Detour.cpp

+void x86Detour::fixCallToRoutineReturningSP(Instruction& callInst, const insts_t& routine) {
+    Log::log(
+        "Fixing special case [call to routine returning ESP]:\n" + instsToStr(std::vector{callInst}),
+        ErrorLevel::INFO
+    );
+
+    const auto destReg = routine[0].getOperands().substr(0, 3);
+    const uint32_t originalAddress = callInst.getAddress();
+    const uint32_t originalNextAddress = originalAddress + callInst.size();
+
+    // AsmTK parses strings for AsmJit, which generates the binary code.
+    asmjit::CodeHolder code;
+    asmjit::JitRuntime asmjitRt;
+    code.init(asmjitRt.environment());
+
+    asmjit::x86::Assembler assembler(&code);
+    asmtk::AsmParser parser(&assembler);
+
+    // Parse the instructions via AsmTK
+    if (const auto error = parser.parse(std::format("mov {}, {:#x}", destReg, originalNextAddress).c_str())) {
+        Log::log(std::format("AsmTK error: {}", asmjit::DebugUtils::errorAsString(error)), ErrorLevel::SEV);
+        return;
+    }
+
+    // Generate the binary code via AsmJit
+    uint64_t movAddress = 0;
+    if (const auto error = asmjitRt.add(&movAddress, &code)) {
+        Log::log(std::format("AsmJIT error: {}", asmjit::DebugUtils::errorAsString(error)), ErrorLevel::SEV);
+        return;
+    }
+
+    // Replace `call rel32` instruction with `mov reg, imm32`. Both are 5 bytes long.
+
+    callInst = m_disasm.disassemble(movAddress, movAddress, movAddress + callInst.size(), *this)[0];
+    callInst.setAddress(originalAddress);
+}


Following the change in the header, this function should return bool to indicate whether the instruction patching was successful. On failure (e.g., from parser.parse or asmjitRt.add), it should return false. On success, it should return true at the end.

bool x86Detour::fixCallToRoutineReturningSP(Instruction& callInst, const insts_t& routine) { Log::log( "Fixing special case [call to routine returning ESP]:\n" + instsToStr(std::vector{callInst}), ErrorLevel::INFO ); const auto destReg = routine[0].getOperands().substr(0, 3); const uint32_t originalAddress = callInst.getAddress(); const uint32_t originalNextAddress = originalAddress + callInst.size(); // AsmTK parses strings for AsmJit, which generates the binary code. asmjit::CodeHolder code; asmjit::JitRuntime asmjitRt; code.init(asmjitRt.environment()); asmjit::x86::Assembler assembler(&code); asmtk::AsmParser parser(&assembler); // Parse the instructions via AsmTK if (const auto error = parser.parse(std::format("mov {}, {:#x}", destReg, originalNextAddress).c_str())) { Log::log(std::format("AsmTK error: {}", asmjit::DebugUtils::errorAsString(error)), ErrorLevel::SEV); return false; } // Generate the binary code via AsmJit uint64_t movAddress = 0; if (const auto error = asmjitRt.add(&movAddress, &code)) { Log::log(std::format("AsmJIT error: {}", asmjit::DebugUtils::errorAsString(error)), ErrorLevel::SEV); return false; } // Replace `call rel32` instruction with `mov reg, imm32`. Both are 5 bytes long. callInst = m_disasm.disassemble(movAddress, movAddress, movAddress + callInst.size(), *this)[0]; callInst.setAddress(originalAddress); return true; }

gemini-code-assist · 2025-10-01T14:08:08Z

sources/x86Detour.cpp

+void x86Detour::fixInlineCallToReadSP(Instruction& callInst) {
+    Log::log(
+        "Fixing special case [inline call to to read ESP]:\n" + instsToStr(std::vector{callInst}),
+        ErrorLevel::INFO
+    );
+
+    const uint32_t originalAddress = callInst.getAddress();
+    const uint32_t originalNextAddress = originalAddress + callInst.size();
+
+    // AsmTK parses strings for AsmJit, which generates the binary code.
+    asmjit::CodeHolder code;
+    asmjit::JitRuntime asmjitRt;
+    code.init(asmjitRt.environment());
+
+    asmjit::x86::Assembler assembler(&code);
+    asmtk::AsmParser parser(&assembler);
+
+    // Parse the instructions via AsmTK
+    if (const auto error = parser.parse(std::format("push {:#x}", originalNextAddress).c_str())) {
+        Log::log(std::format("AsmTK error: {}", asmjit::DebugUtils::errorAsString(error)), ErrorLevel::SEV);
+        return;
+    }
+
+    // Generate the binary code via AsmJit
+    uint64_t pushAddress = 0;
+    if (const auto error = asmjitRt.add(&pushAddress, &code)) {
+        Log::log(std::format("AsmJIT error: {}", asmjit::DebugUtils::errorAsString(error)), ErrorLevel::SEV);
+        return;
+    }
+
+    // Replace `call rel32` instruction with `push imm32`. Both are 5 bytes long.
+
+    callInst = m_disasm.disassemble(pushAddress, pushAddress, pushAddress + callInst.size(), *this)[0];
+    callInst.setAddress(originalAddress);
+}


This function should return bool to indicate success or failure, for the same reasons as fixCallToRoutineReturningSP. This ensures that any failures during instruction patching are properly handled.

bool x86Detour::fixInlineCallToReadSP(Instruction& callInst) { Log::log( "Fixing special case [inline call to to read ESP]:\n" + instsToStr(std::vector{callInst}), ErrorLevel::INFO ); const uint32_t originalAddress = callInst.getAddress(); const uint32_t originalNextAddress = originalAddress + callInst.size(); // AsmTK parses strings for AsmJit, which generates the binary code. asmjit::CodeHolder code; asmjit::JitRuntime asmjitRt; code.init(asmjitRt.environment()); asmjit::x86::Assembler assembler(&code); asmtk::AsmParser parser(&assembler); // Parse the instructions via AsmTK if (const auto error = parser.parse(std::format("push {:#x}", originalNextAddress).c_str())) { Log::log(std::format("AsmTK error: {}", asmjit::DebugUtils::errorAsString(error)), ErrorLevel::SEV); return false; } // Generate the binary code via AsmJit uint64_t pushAddress = 0; if (const auto error = asmjitRt.add(&pushAddress, &code)) { Log::log(std::format("AsmJIT error: {}", asmjit::DebugUtils::errorAsString(error)), ErrorLevel::SEV); return false; } // Replace `call rel32` instruction with `push imm32`. Both are 5 bytes long. callInst = m_disasm.disassemble(pushAddress, pushAddress, pushAddress + callInst.size(), *this)[0]; callInst.setAddress(originalAddress); return true; }

gemini-code-assist · 2025-10-01T14:08:08Z

sources/x86Detour.cpp

+void x86Detour::fixSpecialCases(insts_t& prologue) {
+    for (auto& instruction: prologue) {
+        if (const auto routine = getRoutineReturningSP(instruction)) {
+            // Fix for #215 https://github.com/stevemk14ebr/PolyHook_2_0/issues/215
+            fixCallToRoutineReturningSP(instruction, *routine);
+            PLH_SET_DIAGNOSTIC(Diagnostic::FixedCallToRoutineReadingSP);
+        } else if (isInlineCallToReadSP(instruction)) {
+            // Fix for #217 https://github.com/stevemk14ebr/PolyHook_2_0/issues/217
+            fixInlineCallToReadSP(instruction);
+            PLH_SET_DIAGNOSTIC(Diagnostic::FixedInlineCallToReadSP);
+        }
+    }
+}


This function should return bool and check the return values of the helper functions (fixCallToRoutineReturningSP and fixInlineCallToReadSP) to propagate any failures up the call stack.

Suggested change

void x86Detour::fixSpecialCases(insts_t& prologue) {

for (auto& instruction: prologue) {

if (const auto routine = getRoutineReturningSP(instruction)) {

// Fix for #215 https://github.com/stevemk14ebr/PolyHook_2_0/issues/215

fixCallToRoutineReturningSP(instruction, *routine);

PLH_SET_DIAGNOSTIC(Diagnostic::FixedCallToRoutineReadingSP);

} else if (isInlineCallToReadSP(instruction)) {

// Fix for #217 https://github.com/stevemk14ebr/PolyHook_2_0/issues/217

fixInlineCallToReadSP(instruction);

PLH_SET_DIAGNOSTIC(Diagnostic::FixedInlineCallToReadSP);

}

}

}

bool x86Detour::fixSpecialCases(insts_t& prologue) {

for (auto& instruction: prologue) {

if (const auto routine = getRoutineReturningSP(instruction)) {

// Fix for #215 https://github.com/stevemk14ebr/PolyHook_2_0/issues/215

if (!fixCallToRoutineReturningSP(instruction, *routine)) return false;

PLH_SET_DIAGNOSTIC(Diagnostic::FixedCallToRoutineReadingSP);

} else if (isInlineCallToReadSP(instruction)) {

// Fix for #217 https://github.com/stevemk14ebr/PolyHook_2_0/issues/217

if (!fixInlineCallToReadSP(instruction)) return false;

PLH_SET_DIAGNOSTIC(Diagnostic::FixedInlineCallToReadSP);

}

}

return true;

}

gemini-code-assist · 2025-10-01T14:08:08Z

sources/x86Detour.cpp

 bool x86Detour::makeTrampoline(insts_t& prologue, insts_t& trampolineOut) {
    assert(!prologue.empty());
+
+    fixSpecialCases(prologue);


The return value of fixSpecialCases should be checked here. If it fails, makeTrampoline should also fail and return false to prevent the hook from proceeding with potentially corrupted instructions.

Suggested change

fixSpecialCases(prologue);

if (!fixSpecialCases(prologue)) {

Log::log("Failed to fix special case instructions", ErrorLevel::SEV);

return false;

}

acidicoala · 2025-10-01T15:26:30Z

@stevemk14ebr, I've implemented the suggestion related to error checking in fixup functions. I've also took liberty to eliminate build warnings that were bugging me this whole time. This PR is now ready for your review.

stevemk14ebr · 2025-10-01T19:36:11Z

Great changes! Really appreciate the recent work for linux compat

acidicoala · 2025-10-01T19:37:52Z

Thanks a ton!

acidicoala added 14 commits September 30, 2025 17:55

Fix stevemk14ebr#215 & Disassembler test

cfdcdb8

Fix stevemk14ebr#215 for general case

bfc5540

Fix asmjit warning

414a413

Ignore routine reading SP in followJmp

b81d905

Remove duplicate size check

9c4b655

Refactored routine size check

7118280

Updated log message

593a349

Updated log message

dd312de

Fix stevemk14ebr#217 (inline call to read SP)

a0cff20

Added tests for stevemk14ebr#215 and stevemk14ebr#217

781f61a

Added diagnostics for better testing

5994803

Fixed non-test build

ac80749

Fix CI

899d7a4

Disabled unreliable translation test on linux

d3100bf

gemini-code-assist bot reviewed Oct 1, 2025

View reviewed changes

acidicoala added 3 commits October 1, 2025 20:11

Added boolean return to fixup function

b05d754

Eliminate warnings related to NULL comparison

8e0d360

Eliminated warnings related to calling conventions on Linux

1bd5faf

stevemk14ebr approved these changes Oct 1, 2025

View reviewed changes

stevemk14ebr merged commit 09a882e into stevemk14ebr:master Oct 1, 2025
12 checks passed

stevemk14ebr mentioned this pull request Oct 1, 2025

Fix/#215 #216

Closed

acidicoala deleted the fix/#217 branch October 1, 2025 19:41

-    fixSpecialCases(prologue);
+    if (!fixSpecialCases(prologue)) {
+        Log::log("Failed to fix special case instructions", ErrorLevel::SEV);
+        return false;
+    }

Uh oh!

Fix #215 & #217 #218

Fix #215 & #217 #218

Uh oh!

Conversation

acidicoala commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot commented Oct 1, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

acidicoala Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

stevemk14ebr Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

acidicoala commented Oct 1, 2025

Uh oh!

Uh oh!

stevemk14ebr commented Oct 1, 2025

Uh oh!

acidicoala commented Oct 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

acidicoala commented Oct 1, 2025 •

edited

Loading