functions to control nonce space and timeouts for all chip topologies by adammwest · Pull Request #420 · bitaxeorg/ESP-Miner

adammwest · 2024-10-20T16:32:24Z

What
relavent Issues/Prs

Goals

to set the nonce space to 100% for configurations that allow
support extended jobs for SV2

Search Space
there is the nonce space 32 bits
there is the version space 16 bits (BIP320)
there is ntime space ~12 bits (mpt, current time + 7200)
there is extranonce2 space ~64+bits

General mining info for ASICS
Typically ASICs will mine nonces first, but now they are so fast they have to mine more things,
in terms of cheapness ntime is good and versions are good due to ASICBOOST.

The general hierarchy is
hash boards -> chips -> cores

The older chips (Bitaxe Max), were supplied midstates
BM1397
the bitcoin header is to big to fit in 1 SHA compression so its split into 2

block0 block1
[midstate0][0,2,3,....]
[midstate1][0,2,3,....]
[midstate2][0,2,3,....]
[midstate3][0,2,3,....]
the purpose is share the message scheduler of SHA1 (first sha second block)
but this budens the controller to send work very fast to the chips

anyway there were 4 midstates and 672 small cores
so that means 168 cores on the chip each core does a independant search on the nonce range
core0 [0,2,3,....]
core1 [0,2,3,....] * offset1
...
core167 [0,2,3,....] * offset167

but how do we divide 168 cores into 2^32, we cant so this means there is a hole in the search.
168/256 = ~62.5% of the nonce space is covered due to the HW cores

in more recent BITMAIN chips
BM1370 we have version rolling

with version rolling
we manipulate the nversion field so we have more space to search, we have 2^16 available values
the BM1370 has 128 cores and 2040 small cores, there are 8 cores missing, 4 cores are missing on core 15 and 4 on core 127
these are the version generators

now our pipeline is more advanced
we generate 16 versions and supply then to each core
core 0 [0,1,2,3,....]
...
core 127 [0,1,2,3,....] * offset128
then we repeat, until 2^16 have run out or 4096 iterations.

Finally these ideas easily extend to multiple chips
when we have multiple chips we must also divide the nonce/version space per chip, then we can parellize.

CHIP 0
core n CHIP0 offset + [0,1,2,3,....] * n

CHIP 1
core n CHIP1 offset + [0,1,2,3,....] * n

In a simple system we have the following equation
time = size / freq
the PR tries to figure out the above timeout equation for different configurations

Finally, the nonce and version space is configurable as well.
currently in ESP-miner the nonce range is roughly 1/64

To control nonces we need hash counting number register
The hcn is a nonce limiter, and after it completes and restarts a new version is generated

figuring out the Maximum HCN is a solution to a dynamic equation, this enables setting the whole nonce space for different frequencies, chip length chains and core counts.

float hcn_space = (float)NONCE_SPACE / big_cores_up / asic_count_up;
double hcn_max = hcn_space * (double)FREQ_MULT / frequency * 0.5f;

Note: The size is dependent on frequency which is odd.

after calculating the HCN max which is the equivalent to nonce space 100%, the timeout is easy, you just need non parallel space of the chip and frequency

double fullspace_timeout_s = serial_versions * serial_nonces / ((double)frequency_mhz * 1000.0 * 1000.0);

Additional Note

The CNO allows 16 bit division in the BM1370 of a chain of chips

This should work for any versions/hcn value/frequency/chip count

Testing TODO
hex
gammaturbo
gamma
supra
ultra
max
naja

mutatrum · 2024-11-21T10:35:17Z

There is an off-by-one error somewhere:

I (17578) ASIC_task: ASIC Job Interval: 1812.53 ms
[...]
I (18228) create_jobs_task: Set chip version rolls 65535
I (18228) stratum_task: rx: {"params":[10000],"id":null,"method":"mining.set_difficulty"}
I (18238) create_jobs_task: Set chip fullscan 1812.498908

coming from this code in create_jobs_task.c:

            //calulate update to fullscan_ms as new version rolling
            double new_version_percent = (double)version_rolls / (double)65536.0;
            double prcnt_change = new_version_percent/GLOBAL_STATE->version_space_percent;
            GLOBAL_STATE->asic_job_frequency_ms *= prcnt_change;
            GLOBAL_STATE->version_space_percent = new_version_percent;
            ESP_LOGI(TAG, "Set chip fullscan %f", GLOBAL_STATE->asic_job_frequency_ms);

I set the BM1368_FULLSCAN_PERCENT to 1.0. The initial asic job interval is set 1/65536 shorter. Don't think that's supposed to happen?

Should it be:

            double new_version_percent = (double)(version_rolls + 1) / (double)65536.0;

Not sure where the 65536.0 is coming from? Is that the maximum it can be?

Other than that, it seems to be hashing fine with these settings on my Supra.

mutatrum · 2025-02-10T08:55:23Z

Is the BM1397 still unknown, as it doesn't set register 0x10 at all?

mapio · 2025-03-12T09:43:21Z

double new_version_percent = (double)version_rolls / (double)65536.0;

By the way, literal 65536.0 is already a double in specified in Section 6.4.4.2 ("Floating constants") of "C standard (ISO/IEC 9899:2018)", more precisely 6.4.4.2p4 states: "A floating constant has type double unless explicitly specified by a suffix.".

This implies that — see Section 6.3.1.8 ("Usual arithmetic conversions") — the other term of the division will be automatically converted to double.

So the idiomatic way of writing such expression should be double new_version_percent = version_rolls / 65536.0;.

Useless cast can confuse expert C programmers because they seem to imply that the standard can't be expected to hold (for some unspecified reason).

mutatrum · 2025-06-02T07:11:10Z

Ok, story time. I've ported this to dev-latest, and it's running on my Gamma with default frequency/voltage. I took out all the percent configurations, so it'll scan the whole nonce range over the full version_mask, with maximum asic job interval. I got this:

I (14210) ASIC_task: ASIC Job Interval: 261213.00 ms

It then starts hashing, first job:

I (17370) bm1370Module: Job ID: 18, Core: 110/4, Ver: 004A8000

Until it reaches the end:

I (273530) bm1370Module: Job ID: 18, Core: 42/7, Ver: 1FA8E000

Indeed, 261 seconds later it was done. It starts with ver 0x00000000 and ends at 0x1FFFFFFF, which is the version_mask for this pool (ckpool). So, as long as the pool doesn't flush the jobs, it only switches to a new job once every 4 minutes 21 seconds. This means it ignores almost every mining.notify, and on average only switches ~4 jobs per block. No duplicate shares reported.

Interesting thing is, the pool doesn't like this. After 100 seconds of working on the same job, you get Invalid JobID errors on submits, and after enough of those, you get a client.reconnect. I reduced the ASIC job interval to 60 seconds as to not piss of the pool and the hashrate is normal, no reconnects, and after 9 hours:

Another observation: the pool wanted to switch from the starting difficulty of 10000 to a lower difficulty after a few minutes, but when running with the 261 seconds ASIC job interval it took 27 minutes before that came into effect, after a mining.notify with a job flush. So a really long job interval points to something that's not correct yet.

If this can be used on all chips, it would be possible to eliminate the jobs queue completely. Just let it run, and on each notify, flush or not, just create one new job, send it to the ASIC and it just works, with a lot more simplified code. If you get a new mining.notify every 5 seconds, one needs to have 50Th/s on a single device to even start rolling extranonce_2 for a new job.

Related issues are #824 and #939.

skot · 2025-06-02T08:19:57Z

That's pretty exciting! I assume this won't work on the BM1397 (bitaxeMax) and prolly not on the upcoming BZM2 as they don't roll version.

Nonetheless I think this is worth implementing.

Removed nonce_percentage and timeout_percentage to simplify the code

mutatrum · 2025-06-02T08:22:40Z

Code to test: https://github.com/mutatrum/ESP-Miner/tree/fullscan-fix-revisited

mutatrum · 2025-06-02T08:24:11Z

That's pretty exciting! I assume this won't work on the BM1397 (bitaxeMax) and prolly not on the upcoming BZM2 as they don't roll version.

Nonetheless I think this is worth implementing.

It's super interesting, there is code for the BM1397, not sure how it'll handle though, I only have a Supra and Gamma.

And how does it need to be controlled? Having the full scan range and the maximum ASIC job timeout clearly causes issues, both with the pool as well as with ESP-Miner itself, as jobs are active way too long.

mutatrum · 2025-08-18T12:13:28Z

It seems like we could switch to new work ASAP no matter how clean_jobs is set?

yes we can. i recommend to merge this!^^

Only if we have rolling nonce for BM1397.

github-actions · 2026-02-21T17:40:18Z

Test Results

47 tests +5 47 ✅ +5 0s ⏱️ ±0s
1 suites ±0 0 💤 ±0
1 files ±0 0 ❌ ±0

Results for commit f7af8ca. ± Comparison against base commit 5156662.

♻️ This comment has been updated with latest results.

…mon)

adammwest · 2026-02-23T12:49:26Z

components/asic/asic.c

+            // no version-rolling so same Nonce Space is splitted between Big Cores
+            return calculate_bm_timeout_ms(freq, asic_count, small_cores, cores, 4.0, ASIC_SET_TIMEOUT_PERCENT, 20);
        case BM1366:
+            // ASIC_calculate_bm_timeout_ms(GLOBAL_STATE, GLOBAL_STATE->version_mask >> 13, 1.0);


change comment ASIC_calculate_bm_timeout_ms to calculate_bm_timeout_ms,
consider adding new default config default_timeout to device_config.h

maybe move // ASIC_calculate_bm_timeout_ms(GLOBAL_STATE, GLOBAL_STATE->version_mask >> 13, 1.0);
to the PR description rather than have a comment for future use

adammwest changed the title ~~WIP: Fullscans~~ WIP: Calulating fullscan_ms and space computed by the chip Oct 20, 2024

adammwest changed the title ~~WIP: Calulating fullscan_ms and space computed by the chip~~ WIP: Calculating fullscan_ms and space computed by the chip Oct 20, 2024

adammwest changed the title ~~WIP: Calculating fullscan_ms and space computed by the chip~~ Calculating fullscan_ms and space computed by the chip Oct 25, 2024

adammwest changed the title ~~Calculating fullscan_ms and space computed by the chip~~ Calculating scan time and space computed by the chip Oct 25, 2024

adammwest changed the title ~~Calculating scan time and space computed by the chip~~ Calculating scan time and space computed by BM chips Oct 25, 2024

adammwest mentioned this pull request Oct 25, 2024

Consolidation of serial comms issues #350

Open

adammwest changed the title ~~Calculating scan time and space computed by BM chips~~ fix: Calculating scan time and space computed by BM chips Nov 7, 2024

mutatrum mentioned this pull request Nov 24, 2024

Feature: don't calculate asic_job_frequency but measure it. #514

Open

WantClue requested a review from Georges760 December 2, 2024 22:46

WantClue added the help wanted Extra attention is needed label Dec 2, 2024

Georges760 self-assigned this Dec 2, 2024

adammwest changed the title ~~fix: Calculating scan time and space computed by BM chips~~ Knowlage of registers (hcn 10 and cno C0) Feb 9, 2025

adammwest changed the title ~~Knowlage of registers (hcn 10 and cno C0)~~ functions to control nonce space timeouts using HCN, CNO (hcn 10 and cno C0) for all chips Feb 9, 2025

adammwest changed the title ~~functions to control nonce space timeouts using HCN, CNO (hcn 10 and cno C0) for all chips~~ functions to control nonce space timeouts using registers HCN, CNO for all chips Feb 9, 2025

adammwest changed the title ~~functions to control nonce space timeouts using registers HCN, CNO for all chips~~ functions to control nonce space and timeouts using registers HCN, CNO for all chips Feb 9, 2025

adammwest changed the title ~~functions to control nonce space and timeouts using registers HCN, CNO for all chips~~ functions to control nonce space and timeouts for all chip topologies Feb 9, 2025

adammwest force-pushed the fullscan_fix branch from 7ea45fe to 1138572 Compare February 9, 2025 21:47

skot force-pushed the master branch from dd4ecbc to 2f38e20 Compare March 15, 2025 20:16

adammwest force-pushed the fullscan_fix branch from 4937bbc to 7ce9298 Compare March 25, 2025 18:33

mutatrum mentioned this pull request Apr 6, 2025

Refining the algorithm for extranonce2. #824

Open

mutatrum mentioned this pull request May 26, 2025

Custom nonce and extra nonce range #939

Open

WantClue force-pushed the master branch from 35ad8b2 to f6c9276 Compare May 30, 2025 20:52

mutatrum added a commit to mutatrum/ESP-Miner that referenced this pull request Jun 2, 2025

Initial port of fullscan-fix (bitaxeorg#420)

d105d30

Removed nonce_percentage and timeout_percentage to simplify the code

mutatrum mentioned this pull request Nov 5, 2025

Optimize construct_bm_job #1321

Merged

mutatrum mentioned this pull request Feb 17, 2026

Add Stratum V2 (SV2) protocol support #1553

Draft

7 tasks

adammwest force-pushed the fullscan_fix branch from 2cf61c7 to 5156662 Compare February 21, 2026 11:15

adammwest added 2 commits February 21, 2026 13:02

add outline for bm1366

8d0e6de

remake fullscan code

1e8b97b

adammwest changed the base branch from v2.8.x to master February 21, 2026 15:56

remove type cast

0e9d472

adammwest added 19 commits February 21, 2026 19:01

formatting changes

f97788b

add unity test

b18a95e

update params for the tests

69a0f69

add semicolons for tests

b73403f

change common.c/h to asic_common (so test build correctly imports com…

f94fd49

…mon)

update guard

84baa18

rectify units

c87ca1c

fix test calls

2253d8f

add next power of 2 function

cda3a74

better names, fix types

8d3ee3a

remake timeout fn

782ad83

add a fall back test

801c1fa

fix unit tests

4f0f841

fix cocmpile time error (var name)

0b6095c

update comments

3a535b1

change largest power of to to next power of 2

e8a5f8f

add some more test cases

5af3b2d

change types int ->uint32_t

3b3eb4a

update tests

c7f8a9c

adammwest commented Feb 23, 2026

View reviewed changes

add default_asic_timeout to config

f7af8ca

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

functions to control nonce space and timeouts for all chip topologies#420

functions to control nonce space and timeouts for all chip topologies#420
adammwest wants to merge 23 commits intobitaxeorg:masterfrom
adammwest:fullscan_fix

adammwest commented Oct 20, 2024 •

edited

Loading

Uh oh!

mutatrum commented Nov 21, 2024 •

edited

Loading

Uh oh!

mutatrum commented Feb 10, 2025

Uh oh!

mapio commented Mar 12, 2025

Uh oh!

mutatrum commented Jun 2, 2025 •

edited

Loading

Uh oh!

skot commented Jun 2, 2025

Uh oh!

mutatrum commented Jun 2, 2025

Uh oh!

mutatrum commented Jun 2, 2025

Uh oh!

mutatrum commented Aug 18, 2025

Uh oh!

github-actions bot commented Feb 21, 2026 •

edited

Loading

Uh oh!

adammwest Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Comments

Conversation

adammwest commented Oct 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mutatrum commented Nov 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mutatrum commented Feb 10, 2025

Uh oh!

mapio commented Mar 12, 2025

Uh oh!

mutatrum commented Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

skot commented Jun 2, 2025

Uh oh!

mutatrum commented Jun 2, 2025

Uh oh!

mutatrum commented Jun 2, 2025

Uh oh!

mutatrum commented Aug 18, 2025

Uh oh!

github-actions bot commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

adammwest Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

adammwest commented Oct 20, 2024 •

edited

Loading

mutatrum commented Nov 21, 2024 •

edited

Loading

mutatrum commented Jun 2, 2025 •

edited

Loading

github-actions bot commented Feb 21, 2026 •

edited

Loading