8357554: Enable vectorization of Bool -> CMove with different type size (on riscv) #28231

Hamlin-Li · 2025-11-11T11:24:12Z

Hi,

Can you help to review this patch?

This patch enables the vectorization of statement like op_1 bop op_2 ? res_f_d_1 : res_f_d_2 in a loop, where op_x's size is different from res_f_d_x's.

To assist with code review, this pr contains only the shared code change, is splitted from #28230, which enable & implement the riscv part. The similar optimization could be extended to other platforms.

Some background

Previously, it's #25336, which was blocked by unsigned comparison issue. The issue was recently resolved by #27942, so I'm re-start working on this optimization.

This pr only relaxes one of the constraints in #25336, i.e. transform CMoveF/D to vector operations no matter what's the size of comparison's operator, but remove the optimization of transform CMoveI/L to vector operations which I think need more investigation.

Test

Jtreg

in progress...

Performance

check the performance data in #25341 on riscv.

Thanks

Progress

Change must be properly reviewed (1 review required, with at least 1 Reviewer)
Change must not contain extraneous whitespace
Commit message must refer to an issue

Issue

JDK-8357554: Enable vectorization of Bool -> CMove with different type size (on riscv) (Enhancement - P4)

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/28231/head:pull/28231
$ git checkout pull/28231

Update a local copy of the PR:
$ git checkout pull/28231
$ git pull https://git.openjdk.org/jdk.git pull/28231/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 28231

View PR using the GUI difftool:
$ git pr show -t 28231

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/28231.diff

Using Webrev

Link to Webrev Comment

bridgekeeper · 2025-11-11T11:26:22Z

👋 Welcome back mli! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

openjdk · 2025-11-11T11:27:46Z

❗ This change is not yet ready to be integrated.
See the Progress checklist in the description for automated requirements.

openjdk · 2025-11-11T11:28:31Z

⚠️ @Hamlin-Li This pull request contains merges that bring in commits not present in the target repository. Since this is not a "merge style" pull request, these changes will be squashed when this pull request in integrated. If this is your intention, then please ignore this message. If you want to preserve the commit structure, you must change the title of this pull request to Merge <project>:<branch> where <project> is the name of another project in the OpenJDK organization (for example Merge jdk:master).

openjdk · 2025-11-11T11:28:52Z

@Hamlin-Li The following label will be automatically applied to this pull request:

hotspot-compiler

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing list. If you would like to change these labels, use the /label pull request command.

Hamlin-Li · 2025-11-11T11:29:42Z

@eme64 Could you have a look? :)

Not sure who else can help to review it, feel free to help have a look if you're available. :)

Thanks!

mlbridge · 2025-11-11T11:32:41Z

Webrevs

galderz · 2025-11-11T13:45:44Z

Sounds like this PR should include some IR tests?

eme64

@Hamlin-Li Thanks for your continued effort on CMove!

Just a first initial comment. And yes, you'll need some IR tests. Would also be nice if we could get some aarch64 or x64 implementation, so we can test it. Maybe we can collaborate on this PR to make it work together :)

src/hotspot/share/opto/superword.cpp

Hamlin-Li · 2025-11-11T13:59:16Z

@galderz @eme64

As this pr does not change any behaviour (it's splitted from #28230, as suggested in previous review, check #25341 (comment) please), so the tests (jtreg & jmh) are put in #28230.

Or should I just close this one and use #28230 instead?

Hamlin-Li · 2025-11-11T14:02:16Z

@Hamlin-Li Thanks for your continued effort on CMove!

Just a first initial comment. And yes, you'll need some IR tests. Would also be nice if we could get some aarch64 or x64 implementation, so we can test it. Maybe we can collaborate on this PR to make it work together :)

Sure, I'm happy to have you involved in this one, are you interested in enabling and implementing aarch64 or x64 part? Should I close this one and use #28230 instead? Please kindly let know! :)

Hamlin-Li · 2025-11-11T14:04:26Z

@galderz @eme64 BTW, there is an assert fix in this pr, which is also in another specific pr: #28141. Please let me know if I should do it in this pr or not. Thanks!

refactor `is_velt_basic_type_compatible_use_def` Co-authored-by: Emanuel Peter <emanuel.peter@oracle.com>

eme64 · 2025-11-11T14:20:11Z

@Hamlin-Li We can also just go with a risv impl for now, and then we can do x64 and aarch64 separately.

Does this patch not affect the IR rules of the tests we have already in the code base? With an improvement, there is usually a chance to add IR rules to existing tests, or add new tests with new IR rules.

eme64 · 2025-11-11T14:52:31Z

src/hotspot/cpu/aarch64/matcher_aarch64.hpp

@@ -204,4 +204,9 @@
  static bool is_feat_fp16_supported() {
    return (VM_Version::supports_fphp() && VM_Version::supports_asimdhp());
  }
+
+  static bool supports_vector_different_use_def_size() {


This sounds extremely vague. Is this supposed to only be about CMove? Because we already have all sorts of instructions that allow different use and def types, such as conversion vectors. Those are already in use on aarch64 and x64.

eme64 · 2025-11-11T14:53:15Z

src/hotspot/share/opto/vectornode.cpp

+bool VectorNode::is_different_use_def_size_supported() {
+  return Matcher::supports_vector_different_use_def_size();
+}


Is this only a forwarding? What's the point of this?

eme64 · 2025-11-11T14:56:22Z

src/hotspot/share/opto/vectornode.hpp

+  // Return true if every bit in this vector is 1, e.g. based on the comparison
+  // result of 2 floats, set a double result.
+  static bool is_different_use_def_size_supported();


I'm a bit confused about your description here. It sounds like this method is looking at a specific vector, and returns results based on that. But that's not what's happening here, is it?

eme64

This seems like an empty refactor, and it's not clear what it solves. It also does not seem riscv specific. It would probably be better if you actually did this together with the patch that actually ensures vectorization for riscv, including IR tests and all. That's probably what you plan to do with #28230, right?

It is difficult to review the code here, without seeing how it all goes together.

eme64 · 2025-11-11T15:00:17Z

@Hamlin-Li Can you describe your general approach with #28230? How exactly will you deal with the type size change? Will you have a conversion of the mask, between the comparison and the blend? It may be good if you describe it in a bit of detail, so that we can allow aarch64 and x64 specialists to look at it, and see if the basic design is platform independent enough ;)

eme64 · 2025-11-11T15:05:08Z

@Hamlin-Li At a quick glance, #28230 also has some scalar backend implementations of CMove. I think you could just integrate those separately first, and only then do the vectorization.

Additionally: it may be easier to first ensure that the Vector API tests work for riscv backend vector instructions. And then we can work on Auto Vectorization once the all the backend instructions are already in place and tested via the Vector API.

That would be a way I usually see aarch64 and x64 engineers split up the work. Also makes it easier to get specialists for the area to review the code.

What do you think?

Hamlin-Li · 2025-11-11T15:10:41Z

@Hamlin-Li At a quick glance, #28230 also has some scalar backend implementations of CMove. I think you could just integrate those separately first, and only then do the vectorization.

Additionally: it may be easier to first ensure that the Vector API tests work for riscv backend vector instructions. And then we can work on Auto Vectorization once the all the backend instructions are already in place and tested via the Vector API.

That would be a way I usually see aarch64 and x64 engineers split up the work. Also makes it easier to get specialists for the area to review the code.

What do you think?

@eme64 Thank you for the suggestion. I'll do some investigation on vector API related things, and get back to this one later.

hamlin and others added 28 commits May 20, 2025 19:36

initial commit

86350bb

disable cmovei/l => vectorblend

58a7f7a

split from pr 25341

e967fea

initial commit

e27247b

Merge branch 'openjdk:master' into master

4a75e87

Merge branch 'openjdk:master' into master

4ee1df1

Merge branch 'openjdk:master' into master

0ff5e42

Merge branch 'openjdk:master' into master

924c4c9

Merge branch 'openjdk:master' into master

75dee02

Merge branch 'openjdk:master' into master

57973f4

Merge branch 'openjdk:master' into master

4b058ce

Merge branch 'openjdk:master' into master

b73a502

Merge branch 'openjdk:master' into master

8eba0c0

Merge branch 'openjdk:master' into master

7f36f23

Merge branch 'openjdk:master' into master

3089ec9

Merge branch 'openjdk:master' into master

2238d76

Merge branch 'openjdk:master' into master

c0358cf

Merge branch 'openjdk:master' into master

f54562f

Merge branch 'openjdk:master' into master

6635678

Merge branch 'master' into vectorize-CMove-Bool

bd5599b

disable riscv

2ba466b

disable Op_CMoveI/Op_CMoveL in VectorNode::opcode

2a0e1ad

revert supports_transform_cmove_to_vectorblend for all cpus

9e5f137

Merge branch 'openjdk:master' into master

736425c

fix JDK-8371297: assert in BoolTest

bc0c9b3

fix code path change in VectorNode::implemented

5b85c74

simplify

81996cf

comments

56b6e02

Hamlin-Li mentioned this pull request Nov 11, 2025

8357551: RISC-V: support CMoveF/D vectorization #28230

Draft

3 tasks

openjdk bot added the hotspot-compiler hotspot-compiler-dev@openjdk.org label Nov 11, 2025

openjdk bot added the rfr Pull request is ready for review label Nov 11, 2025

eme64 reviewed Nov 11, 2025

View reviewed changes

src/hotspot/share/opto/superword.cpp Show resolved Hide resolved

Update src/hotspot/share/opto/superword.cpp

cfbe0a6

refactor `is_velt_basic_type_compatible_use_def` Co-authored-by: Emanuel Peter <emanuel.peter@oracle.com>

fix typo

a89d26c

eme64 reviewed Nov 11, 2025

View reviewed changes

Merge branch 'openjdk:master' into master

a336b52

eme64 reviewed Nov 11, 2025

View reviewed changes

Merge branch 'master' into vectorize-CMove-Bool

8e84017

8357554: Enable vectorization of Bool -> CMove with different type size (on riscv) #28231

Are you sure you want to change the base?

8357554: Enable vectorization of Bool -> CMove with different type size (on riscv) #28231

Conversation

Hamlin-Li commented Nov 11, 2025 • edited by openjdk bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Some background

Test

Jtreg

Performance

Progress

Issue

Reviewing

Uh oh!

bridgekeeper bot commented Nov 11, 2025

Uh oh!

openjdk bot commented Nov 11, 2025

Uh oh!

openjdk bot commented Nov 11, 2025

Uh oh!

openjdk bot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Hamlin-Li commented Nov 11, 2025

Uh oh!

mlbridge bot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Webrevs

Uh oh!

galderz commented Nov 11, 2025

Uh oh!

eme64 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Hamlin-Li commented Nov 11, 2025

Uh oh!

Hamlin-Li commented Nov 11, 2025

Uh oh!

Hamlin-Li commented Nov 11, 2025

Uh oh!

eme64 commented Nov 11, 2025

Uh oh!

eme64 Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

eme64 Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

eme64 Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

eme64 left a comment

Choose a reason for hiding this comment

Uh oh!

eme64 commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eme64 commented Nov 11, 2025

Uh oh!

Hamlin-Li commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Hamlin-Li commented Nov 11, 2025 •

edited by openjdk bot

Loading

openjdk bot commented Nov 11, 2025 •

edited

Loading

mlbridge bot commented Nov 11, 2025 •

edited

Loading

eme64 commented Nov 11, 2025 •

edited

Loading

Hamlin-Li commented Nov 11, 2025 •

edited

Loading