One-pass `alter` #548

sjakobi · 2025-11-12T12:56:38Z

This is a continuation of @oberblastmeister's work done in #471.

Resolves #392.

TODO:

Use WW to avoid allocating Maybes
Move Collision-handling code out of line
Undo 2e7b1c4 to reduce Core size.

Co-authored-by: Simon Jakobi <simon.jakobi@gmail.com>

sjakobi · 2025-11-12T13:12:28Z

I've made alter and update to inline aggressively, and at this stage, alter is faster than the version on master at all sizes:

master:

$ cabal run fine-grained -- -p alter --stdev 1 -p Int
All
  HashMap.Strict
    alter (1000x)
      presentKey
        Int
          1:      OK
            20.4 μs ± 407 ns
          10:     OK
            38.4 μs ± 449 ns
          100:    OK
            52.0 μs ± 979 ns
          1000:   OK
            70.7 μs ± 1.3 μs
          10000:  OK
            81.4 μs ± 1.5 μs
          100000: OK
            98.0 μs ± 1.5 μs
      absentKey
        Int
          0:      OK
            19.9 μs ± 329 ns
          1:      OK
            24.6 μs ± 442 ns
          10:     OK
            35.1 μs ± 481 ns
          100:    OK
            44.9 μs ± 807 ns
          1000:   OK
            55.5 μs ± 897 ns
          10000:  OK
            63.2 μs ± 1.1 μs
          100000: OK
            85.8 μs ± 457 ns

65af25c:

≻ cabal run fine-grained -- -p alter --stdev 1 -p Int
All
  HashMap.Strict
    alter (1000x)
      presentKey
        Int
          1:      OK
            11.7 μs ±  43 ns
          10:     OK
            29.3 μs ± 200 ns
          100:    OK
            38.8 μs ± 440 ns
          1000:   OK
            60.4 μs ± 678 ns
          10000:  OK
            71.8 μs ± 254 ns
          100000: OK
            91.1 μs ± 1.3 μs
      absentKey
        Int
          0:      OK
            10.3 μs ±  51 ns
          1:      OK
            14.6 μs ± 210 ns
          10:     OK
            24.3 μs ± 323 ns
          100:    OK
            33.7 μs ± 407 ns
          1000:   OK
            44.5 μs ± 750 ns
          10000:  OK
            55.1 μs ± 674 ns
          100000: OK
            81.7 μs ± 766 ns

The Core size for these functions is pretty huge though: Strict.alter has 720 terms now, Strict.update has 536.

There are still a few things to improve though.

This reverts commit 2e7b1c4.

sjakobi · 2025-11-12T23:32:59Z

Core sizes at 50e2490:

Strict.alter: 576 terms
Strict.update: 434 terms
$walterCollision: 168 terms

treeowl · 2025-11-13T00:59:27Z

Large. Probably not something we'd want to INLINE. One trick around that is to use manual worker-wrapper to try to "unbox" the passed function. Roughly speaking,

newtype Maybe# a = Maybe# (# (##) | a
 #)
pattern Just# :: a -> Maybe# a
pattern Just# a = Maybe# (# | a #)
pattern Nothing# :: Maybe# a
pattern Nothing# = Maybe# (# (##) | #)
{-# COMPLETE Nothing#, Just# #-}
toMaybe :: Maybe# a -> Maybe a
fromMaybe :: Maybe a -> Maybe# a

alter f = alter# $ \m# -> fromMaybe (f (toMaybe m#))

alter# :: (Hashable k, Eq k) => (Maybe# a -> Maybe# a) -> HashMap k a -> k -> HashMap k a

In the (I believe typical) case that the passed function is known, small, and non-recursive, GHC will inline it into the function passed to alter#, getting rid of the maybes.

sjakobi · 2025-11-13T01:46:43Z

At the current state of this branch, the Maybes from the application of the function argument are eliminated by inlining the function into alter, but the resulting Core size is, of course, quite enormous.

I guess this Maybe#-scheme could possibly help recover some of the performance lost by not inlining alter.

treeowl · 2025-11-13T01:58:48Z

I guess this Maybe#-scheme could possibly help recover some of the performance lost by not inlining alter.

Exactly.

oberblastmeister and others added 13 commits November 12, 2025 12:04

alter now runs in one pass

9b06248

remove redundant constraints

8df6d0e

Update Data/HashMap/Internal.hs

8e02578

Co-authored-by: Simon Jakobi <simon.jakobi@gmail.com>

add to strict HashMap

61eb586

bang pattern

429fb1d

remove use of two for now for insert'

5779cc8

Use A.index# instead of the removed A.index function

9a3553d

Change position of pointer-equality check

2e7b1c4

Rename inner go function

915cc3d

Bangs and INLINE

0c2ffcf

Update documentation for two

b334d6a

Clean up u-c.cabal

1a643e6

Bring back docs on alterF

65af25c

sjakobi mentioned this pull request Nov 12, 2025

Run alter in one pass #471

Closed

sjakobi added 4 commits November 13, 2025 00:03

Extract alterCollision

64212ab

Revert "Change position of pointer-equality check"

303da19

This reverts commit 2e7b1c4.

alterCollision: Remove unnecessary comment

5fe4337

alterCollision: Add pointer equality check

50e2490

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

One-pass `alter` #548

One-pass `alter` #548

Uh oh!

sjakobi commented Nov 12, 2025 •

edited

Loading

Uh oh!

sjakobi commented Nov 12, 2025

Uh oh!

sjakobi commented Nov 12, 2025

Uh oh!

treeowl commented Nov 13, 2025 •

edited

Loading

Uh oh!

sjakobi commented Nov 13, 2025

Uh oh!

treeowl commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

One-pass alter #548

Are you sure you want to change the base?

One-pass alter #548

Uh oh!

Conversation

sjakobi commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sjakobi commented Nov 12, 2025

Uh oh!

sjakobi commented Nov 12, 2025

Uh oh!

treeowl commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sjakobi commented Nov 13, 2025

Uh oh!

treeowl commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

One-pass `alter` #548

One-pass `alter` #548

sjakobi commented Nov 12, 2025 •

edited

Loading

treeowl commented Nov 13, 2025 •

edited

Loading