mining: add getMemoryLoad() and track template non-mempool memory footprint #33922

Sjors · 2025-11-21T15:34:14Z

Implements a way to track the memory footprint of all non-mempool transactions that are still being referenced by block templates, see discussion in #33899. It does not impose a limit.

IPC clients can query this footprint (total, across all clients) using the getMemoryLoad() IPC method. Its client-side usage is demonstrated here:

Log getMemoryLoad() stratum-mining/sv2-tp#63

Additionally, the functional test in interface_ipc.py is expanded to demonstrate how template memory management works: templates are not released until the client disconnects or calls the destroy() method. The latter happens automatically for clients using libmultiprocess, as sv2-tp does. In the Python tests it also happens when references are destroyed or go out of scope.

The PR starts with preparation refactor commits:

Tweaks interface_ipc.py so destroy() calls happen in an order that's useful to later demonstrate memory management
Change std::unique_ptr<BlockTemplate> block_template from a static defined in rpc/mining.cpp to NodeContext. This prevents a crash when we switch to a non-trivial destructor later (which uses m_node).

Then the main commits:

Add template_tx_refs to NodeContext to track how many templates contain any given transaction. This map is updated by the BlockTemplate constructor and destructor.
Add GetTemplateMemoryUsage() which loops over this map and sums up the memory footprint for transactions outside the mempool
Expose this information to IPC clients via getMemoryLoad() and add test coverage

DrahtBot · 2025-11-21T15:34:21Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage & Benchmarks

For details see: https://corecheck.dev/bitcoin/bitcoin/pulls/33922.

Reviews

See the guideline for information on the review process.

Type	Reviewers
Concept ACK	ismaelsadeeq, ryanofsky

If your review is incorrectly listed, please copy-paste  into the comment that the bot should ignore.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#34003 (test: interface_ipc.py minor fixes and cleanup by ryanofsky)
#33966 (refactor: disentangle miner startup defaults from runtime options by Sjors)
#33965 (mining: fix -blockreservedweight shadows IPC option by Sjors)
#33936 (mining: pass missing context to createNewBlock() and checkBlock() by Sjors)
#33819 (mining: getCoinbase() returns struct instead of raw tx by Sjors)
#33795 (test: Ignore error message give from python because of PYTHON_GIL by kevkevinpal)
#32420 (miner: drop dummy extraNonce in coinbase scriptSig for templates requested via IPC by Sjors)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

Sjors · 2025-11-21T15:36:54Z

I haven't benchmarked this yet on mainnet, so I'm not sure if checking every (unique) transaction for mempool presence is unacceptably expensive.

If people prefer, I could also add a way for the getblocktemplate RPC to opt-out of the memory bookkeeping, since it holds on to one template max and no longer than a minute.

DrahtBot · 2025-11-21T16:05:43Z

🚧 At least one of the CI tasks failed.
_{Task tidy: https://github.com/bitcoin/bitcoin/actions/runs/19575422916/job/56059300316}
_{LLM reason (✨ experimental): clang-tidy flagged fatal errors (loop variable copied for range-based for causing a warnings-as-errors failure) in interfaces.cpp, breaking the CI run.}

Hints

Try to run the tests locally, according to the documentation. However, a CI failure may still
happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being
incompatible with the current code in the target branch). If so, make sure to rebase on the latest
commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the
affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

Sjors · 2025-11-21T16:17:32Z

src/node/interfaces.cpp

+
+        TxTemplateMap& tx_refs{*Assert(m_tx_template_refs)};
+        // Don't track the dummy coinbase, because it can be modified in-place
+        // by submitSolution()


b9306b7: in addition, we might be wiping the dummy coinbase from the template later: Sjors#106

ismaelsadeeq

Concept ACK

I think it would be better if we have internal memory management for the mining interface IPC, since we hold on to the block templates.

I would suggest the following approach:

Add memory budget for the mining interface.
Introduce a tracking list of recently built block templates and total memory usage.
Add templates to the list and increment the memory usage after every createnewblock or waitnext return.
Whenever the memory budget is exhausted, we should release templates in FIFO order.

I think since we create a new template after a time interval elapses even if fees increase and that interval is usually enough for the client to receive and distribute the template to miners, this mechanism should be safe as the miners have long switch to most recent template when the budget elapsed because of the time interval being used in between returns of waitnext.

Mining interface clients should also handle their own memory internally.

Currently, I don’t see much use for the exposed getMemoryLoad method. In my opinion, we should not rely on the IPC client to manage our memory.

Sjors · 2025-11-21T16:34:13Z

In my opinion, we should not rely on the IPC client to manage our memory.

Whenever the memory budget is exhausted, we should release templates in FIFO order

It seems counter intuitive, but from a memory management perspective IPC clients are treated no different than our own code. And if we started FIFO deleting templates that are used by our own code, we'd crash.

So I think FIFO deletion should be a last resort (not implemented here).

There's another reason why we should give clients an opportunity to gracefully release templates in whatever order they prefer. Maybe there's 100 downstream ASIC's, one of which is very slow at loading templates, so it's only given a new template when the tip changes, not when there's a fee change. In that scenario you have a specific template that the client wants to "defend" at all cost.

In practice I'm hoping none of this matters and we can pick and recommend defaults that make it unlikely to get close to a memory limit, other than during some weird token launch.

ismaelsadeeq · 2025-11-21T17:38:04Z

It seems counter intuitive, but from a memory management perspective IPC clients are treated no different than our own code. And if we started FIFO deleting templates that are used by our own code, we'd crash.

IMHO I think we should separate that, and treat clients differently from our own code, because they are different codebases and separate applications with their own memory.

Maybe there are 100 downstream ASICs, one of which is very slow at loading templates, so it’s only given a new template when the tip changes, not when there’s a fee change. In that scenario you have a specific template that the client wants to “defend” at all costs.

I see your point but I don’t think that’s a realistic scenario, and I think we shouldn’t design software to be one-size-fits-all.
If you want to use only single block templates, then use createnewblock and create a new block template and mine that continuously until the chain tip changes or you mine a block.

waitNext returning indicates that we assume your miners are switching from the block they are currently mining to the new one they receive.
Depending on the budget (which I assume is large), many templates would need to be returned before we exhaust it.

Delegating template eviction responsibility to the client can put us in a situation where they handle it poorly and cause us to OOM (but I guess your argument is that we rather take that chance than being in a situation where we make miners potentially lose on rewards).
However I think if there is a clean separation of concerns between the Bitcoin Core node and its clients and clear interface definition and expectations that should not happen, and I believe the mining interface should not differ in that respect.
Otherwise, if we do want a one-size-fits-all solution capable of handling the scenario you described, we should rethink the design entirely and revert to an approach where we do not retain block templates.

Sjors · 2025-11-24T10:49:07Z

Delegating template eviction responsibility to the client can put us in a situation where they handle it poorly and cause us to OOM

Note that it's already the clients responsibility, that's inherent to how multiprocess works.

In the scenario where they handle it poorly, we can use FIFO deletion. All getMemoryLoad() does is give clients an opportunity to handle it better. If they're fine with FIFO, then they never have to call this method.

treat clients differently from our own code

We currently don't track whether any given CBlockTemplate is owned by an IPC client or by our internal code. Once we introduce FIFO deletion all call sites will have to check if it's been deleted since, or we need to exempt them from the memory accounting.

an approach where we do not retain block templates.

Afaik that means revalidating the block from scratch, removing one advantage the submitBlock() approach has over the submitblock RPC (I haven't benchmarked this though).

Sjors · 2025-11-24T16:56:34Z

I tracked the non-mempool transaction memory footprint for half a day on mainnet, using fairly aggressive template update criteria (minimum fee delta 1 sat and no more than once per second). So far the footprint is minuscule, but of course this depends on the mempool weather:

The memory spike after each new block is because sv2-tp holds on to templates from previous blocks for 10 seconds. Those ~3 MB spikes may look impressive, but keep in mind that the default mempool is 300 MB.

Sjors · 2025-11-25T15:56:12Z

I restructured the implementation and commits a bit.

The TxTemplateMap now lives on the NodeContext rather than MinerImpl (interface). This reflects the fact that we want to track the global memory footprint instead of per client. It's a lightweight member template_tx_refs which should be easy to fold into a block template manager later.

It's also less code churn because I don't have to touch the BlockTemplateImpl constructor.

It also made it easier to move GetTemplateMemoryUsage from interface.cpp to miner.cpp, where it's more reusable.

This in turn let me split out a separate commit that introduces the actual getMemoryLoad() interface method. So even if we decide against including that method, the rest of the PR should be useful. However I do think it's worth keeping, it's already been a helpful debugging and monitoring tool.

I added some comments to point out that we don't hold a mempool.cs lock during the calculation because we don't need an accurate result (mempool drift) and we don't want to bog down transaction relay with a potentially long lock (1-3ms in my testing so far).

Sjors · 2025-11-25T17:22:05Z

mining_getblocktemplate_longpoll.py triggered a stack-use-after-return, due to block_template being static (to allow template reuse between RPC calls). I added a commit d752dcc to move this longpoll template to the node context. This seems more appropriate anyway since BlockTemplate has a m_node member, so it shouldn't be able to outlive the node.

One caveat is that gbt_template has to be cleared before template_tx_refs, so I swapped them and added a comment (cde248a -> 9c667c3.

Expanded the PR description.

src/node/interfaces.cpp

Sjors · 2025-12-03T08:19:54Z

Here's a slightly more realistic plot from last night on a well connected node running on an Intel i5-8400:

It's connected to DMND pool, declaring custom templates and getting them approved, but not actually mining. Due to their rate limiting I set -sv2interval=20, so if fees go up, it waits at least 20 seconds before generating a new template. It does not wait when the tip changes.

The machine also runs a lightning node and BTCPay so the moment block comes in the system is quite busy.

Prepare template destruction handling for a later commit that checks memory management: - add destroy_template helper which awaits the result and avoids calling destroy() if we never received a template - reverse order and prevent template override. This ensures template and template2 (which don't have transactions) are destroyed last. Additionally, expand the test to demonstrate how setting feeThreshold to MAX_MONEY ignores new mempool transactions. This extra transaction is needed in a later commit (to add coverage for reference counting).

The getblocktemplate RPC uses a static BlockTemplate, which goes out of scope only after the node completed its shutdown sequence. This becomes a problem when a later commit implements a destructor that uses m_node.

IPC clients can hold on to block templates indefinately, which has the same impact as when the node holds a shared pointer to the CBlockTemplate. Because each template in turn tracks CTransactionRefs, transactions that are removed from the mempool will have not have their memory cleared. This commit adds bookkeeping to the block template constructor and destructor that will let us track the resulting memory footprint.

Calculate the non-mempool memory footprint for template transaction references. Add bench logging to collect data on whether caching or simplified heuristics are needed, such as not checking for mempool presence.

Allow IPC clients to inspect the amount of memory consumed by non-mempool transactions in blocks. Returns a MemoryLoad struct which can later be expand to e.g. include a limit. Expand the interface_ipc.py test to demonstrate the behavior and to illustrate how clients can call destroy() to reduce memory pressure.

ryanofsky · 2025-12-03T21:36:31Z

Concept ACK e8f8f7f. All the changes here seem good and mostly straightforward. The getMemoryLoad() function seems useful by itself and the underlying tracking would seem to provide almost everything needed to limit memory used by block templates.

I am a little concerned about the idea of proactively deleting block templates in FIFO order on behalf of clients, since it seems like this could increase complexity server-side, and client-side if clients have to deal with templates disappearing without being notified. Just not returning new templates after a certain amount of memory has been used would like a simpler approach.

re: #33922 (comment)

Additionally, the functional test in interface_ipc.py is expanded to demonstrate how template memory management works: templates are not released until the client disconnects or calls the destroy() method.

Would be good if this said templates are also released if the python references are destroyed or go out of scope. (This stood out because I tested this yesterday in #33940 (comment).)

Sjors · 2025-12-04T10:46:46Z

Just not returning new templates after a certain amount of memory has been used would like a simpler approach.

It is, but refusing to make new templates doesn't stop the footprint of existing templates from growing. The worst case extra memory footprint for existing templates is the full size of the mempool.

This is rather unlikely though, it would only happen if between two blocks the entire mempool was gradually RBF'd in such a way that each transaction was at the top of the mempool briefly, and thus made it into a template.

Would be good if this said templates are also released

Added a sentence to the PR description.

ryanofsky

Code review e8f8f7f. This looks good except for a thread safety issue I think you can address by adding a mutex.

re: #33922 (comment)

Would be good if this said templates are also released

Added a sentence to the PR description.

Sorry, I should have made a more specific suggestion. The problem is is that this sentence is not accurate: "templates are not released until the client disconnects or calls the destroy() method." Templates will be released if the client drops references to them, even if it never disconnects or calls destroy. I would just change it to "templates are not released until the client drops references to them, or calls the template destroy method, or disconnects"

ryanofsky · 2025-12-04T17:37:39Z

src/rpc/mining.cpp

    static CBlockIndex* pindexPrev;
    static int64_t time_start;
-    static std::unique_ptr<BlockTemplate> block_template;
+    std::unique_ptr<BlockTemplate>& block_template{node.gbt_template};


In commit "rpc: move static block_template to node context" (a5eee29)

I think it would actually be nice to move all these static variables to a struct or class like @ismaelsadeeq's BlockTemplateCache from #33421. But this could be a followup, and doesn't need to complicate this PR.

ryanofsky · 2025-12-04T17:55:02Z

src/node/context.h

-    //! Cache latest getblocktemplate result for BIP 22 long polling
+    //! Track how many templates (which we hold on to on behalf of connected IPC
+    //! clients) are referencing each transaction.
+    TxTemplateMap template_tx_refs;


In commit "mining: track non-mempool memory usage" (7c4d03d)

This map can updated from multiple threads, so it needs a mutex to be used safely. I think I'd suggest combining template_tx_refs and gbt_template variables and a mutex into single struct called something like BlockTemplateState and adding a unique_ptr to that struct as a member here. The struct could be replaced with a cache class in #33421.

ryanofsky · 2025-12-04T18:11:07Z

test/functional/interface_ipc.py

            assert_equal(template7.to_dict(), {})

+            self.log.debug("Memory load should be zero because there was no mempool churn")
+            with self.nodes[0].assert_debug_log(["Calculate template transaction reference memory footprint"]):


In commit "ipc: add getMemoryLoad()" (e8f8f7f)

Seems ok to assert this log message is logged, but I'm wondering if there was a particular reason for doing this. Was the idea to pair the LOG_TIME_MILLIS_WITH_CATEGORY and assert_debug_log calls together?

DrahtBot added the Mining label Nov 21, 2025

Sjors mentioned this pull request Nov 21, 2025

Block template memory management (for IPC clients) #33899

Open

Sjors force-pushed the 2025/11/ipc-memusage branch from 21ad8c1 to f22413f Compare November 21, 2025 16:04

DrahtBot added the CI failed label Nov 21, 2025

Sjors commented Nov 21, 2025

View reviewed changes

Sjors force-pushed the 2025/11/ipc-memusage branch from f22413f to 3b77529 Compare November 21, 2025 16:22

ismaelsadeeq reviewed Nov 21, 2025

View reviewed changes

DrahtBot removed the CI failed label Nov 21, 2025

This was referenced Nov 22, 2025

mining: getCoinbase() returns struct instead of raw tx #33819

Open

node: add BlockTemplateCache #33421

Draft

miner: drop dummy extraNonce in coinbase scriptSig for templates requested via IPC #32420

Open

This was referenced Nov 24, 2025

Log getMemoryLoad() stratum-mining/sv2-tp#63

Draft

mining: pass missing context to createNewBlock() and checkBlock() #33936

Draft

ryanofsky mentioned this pull request Nov 24, 2025

Memory leak when using IPC mining interface #33940

Open

ryanofsky mentioned this pull request Nov 24, 2025

Multiprocess tracking issue #28722

Open

Sjors mentioned this pull request Nov 25, 2025

Mining interface tracking issue #33777

Open

11 tasks

Sjors force-pushed the 2025/11/ipc-memusage branch from 3b77529 to 24592b7 Compare November 25, 2025 15:47

Sjors force-pushed the 2025/11/ipc-memusage branch from 24592b7 to 03dcfae Compare November 25, 2025 17:21

DrahtBot added the CI failed label Nov 25, 2025

Sjors force-pushed the 2025/11/ipc-memusage branch from 03dcfae to ac1e97a Compare November 25, 2025 18:13

DrahtBot removed the CI failed label Nov 25, 2025

DrahtBot mentioned this pull request Nov 29, 2025

refactor: disentangle miner startup defaults from runtime options #33966

Draft

1 task

brunoerg reviewed Dec 2, 2025

View reviewed changes

src/node/interfaces.cpp Show resolved Hide resolved

Sjors added 5 commits December 3, 2025 10:32

rpc: move static block_template to node context

a5eee29

The getblocktemplate RPC uses a static BlockTemplate, which goes out of scope only after the node completed its shutdown sequence. This becomes a problem when a later commit implements a destructor that uses m_node.

mining: add GetTemplateMemoryUsage()

42e5f9b

Calculate the non-mempool memory footprint for template transaction references. Add bench logging to collect data on whether caching or simplified heuristics are needed, such as not checking for mempool presence.

Sjors force-pushed the 2025/11/ipc-memusage branch from ac1e97a to e8f8f7f Compare December 3, 2025 09:36

This was referenced Dec 4, 2025

test: interface_ipc.py minor fixes and cleanup #34003

Open

mining: fix -blockreservedweight shadows IPC option #33965

Open

test: Ignore error message give from python because of PYTHON_GIL #33795

Open

ryanofsky reviewed Dec 4, 2025

View reviewed changes

mining: add getMemoryLoad() and track template non-mempool memory footprint #33922

Are you sure you want to change the base?

mining: add getMemoryLoad() and track template non-mempool memory footprint #33922

Conversation

Sjors commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DrahtBot commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Coverage & Benchmarks

Reviews

Conflicts

Uh oh!

Sjors commented Nov 21, 2025

Uh oh!

DrahtBot commented Nov 21, 2025

Uh oh!

Sjors Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

ismaelsadeeq left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Sjors commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ismaelsadeeq commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sjors commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sjors commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sjors commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sjors commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Sjors commented Dec 3, 2025

Uh oh!

ryanofsky commented Dec 3, 2025

Uh oh!

Sjors commented Dec 4, 2025

Uh oh!

ryanofsky left a comment

Choose a reason for hiding this comment

Uh oh!

ryanofsky Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

ryanofsky Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

ryanofsky Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Sjors commented Nov 21, 2025 •

edited

Loading

DrahtBot commented Nov 21, 2025 •

edited

Loading

ismaelsadeeq left a comment •

edited

Loading

Sjors commented Nov 21, 2025 •

edited

Loading

ismaelsadeeq commented Nov 21, 2025 •

edited

Loading

Sjors commented Nov 24, 2025 •

edited

Loading

Sjors commented Nov 24, 2025 •

edited

Loading

Sjors commented Nov 25, 2025 •

edited

Loading

Sjors commented Nov 25, 2025 •

edited

Loading