ARM: MultiHeadAttention fp16s/a bf16s by EdVince · Pull Request #4139 · Tencent/ncnn

EdVince · 2022-08-12T09:46:15Z

现在的arm的multiheadattention只有我好几个月前pr的neon fp32 pack4的实现，这次pr把剩下的补齐：

fp32 pack1
fp16s pack1/4/8
fp16sa pack1/4/8
bf16s pack1/4 & naive

codecov-commenter · 2022-08-12T09:51:28Z

Codecov Report

Merging #4139 (92d6fc5) into master (acbaaa6) will decrease coverage by 0.03%.
The diff coverage is 90.59%.

❗ Current head 92d6fc5 differs from pull request most recent head 5bbf518. Consider uploading reports for the commit 5bbf518 to get more accurate results

@@            Coverage Diff             @@
##           master    #4139      +/-   ##
==========================================
- Coverage   94.43%   94.40%   -0.04%     
==========================================
  Files         748      749       +1     
  Lines      179004   180668    +1664     
==========================================
+ Hits       169046   170551    +1505     
- Misses       9958    10117     +159

Impacted Files	Coverage Δ
src/layer/arm/multiheadattention_arm_asimdhp.cpp	`84.54% <84.54%> (ø)`
src/layer/arm/multiheadattention_arm.cpp	`98.66% <98.62%> (-0.54%)`	⬇️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

EdVince · 2022-08-13T09:18:59Z

这几个fail掉的都是在test的时候精度不够报错的。
在fp16sa下，有几个小问题：

在测试输入的blob的尺寸太大的时候，会出现fp16sa的误差超过”1“，例如把输入blob从(32,128)增大到(64,256)就有可能会出错了
由于mha里面有softmax激活，所以结果数据会有一种荡来荡去的分布，在大部分的数据点都是满足精度的(这些点的数值往往在两位数的大小)，但在一些较小的激活点值就会超出精度(就是fail的情况，这些数值只有零点几的大小)
在失败的数值点，有一个现象就是期望值和具体计算出的值，有围绕0的趋势，就是一个是正的零点几，另一个是负的零点几
就算是fail了，但最终算出来的数据跟期望数还是大差不差的

up有什么看法呢？不知道是我写的有问题，还是mha计算链太长了，全用fp16sa精度遭不住，在0周围精度崩掉了。

tpoisonooo · 2022-08-15T02:30:53Z

写一下 int8 的呗... 这么做和 PR 3940 是冲突的...

EdVince · 2022-08-15T02:35:41Z

写一下 int8 的呗... 这么做和 PR 3940 是冲突的...

3940我看您好像都做完了呀，而且您的int8是naive实现呀，应该不会冲突的吧？

tpoisonooo · 2022-08-15T02:53:02Z

没写 arm 的 int8

EdVince · 2022-08-15T02:54:09Z

没写 arm 的 int8

好嘞，不过我得先去学一下int8是咋整的

tpoisonooo · 2022-08-15T06:59:50Z

很简单的，就是 weight/input 用 int8。 softmax 那个地方量化了会掉点，我试过 int4 softmax.

tpoisonooo · 2022-08-15T07:01:00Z

哪天有空我得再试试 int4 softmax，贼心不死。

nihui · 2023-02-23T02:44:34Z

move to #4463

add:arm multiheadattention fp16s/a bf16s

79a20a7

update

127f414

EdVince force-pushed the add-arm-multiheadattention-fp16-bf16 branch from 1cfd819 to 127f414 Compare August 12, 2022 10:33

nihui closed this Aug 12, 2022

nihui reopened this Aug 12, 2022

fix:weak test

92d6fc5

EdVince force-pushed the add-arm-multiheadattention-fp16-bf16 branch from 8a8ec47 to 92d6fc5 Compare August 14, 2022 12:22

apply code-format changes

5bbf518

nihui closed this Feb 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARM: MultiHeadAttention fp16s/a bf16s#4139

ARM: MultiHeadAttention fp16s/a bf16s#4139
EdVince wants to merge 4 commits intoTencent:masterfrom
EdVince:add-arm-multiheadattention-fp16-bf16

EdVince commented Aug 12, 2022

Uh oh!

codecov-commenter commented Aug 12, 2022 •

edited

Loading

Uh oh!

EdVince commented Aug 13, 2022

Uh oh!

tpoisonooo commented Aug 15, 2022 •

edited

Loading

Uh oh!

EdVince commented Aug 15, 2022

Uh oh!

tpoisonooo commented Aug 15, 2022

Uh oh!

EdVince commented Aug 15, 2022

Uh oh!

tpoisonooo commented Aug 15, 2022

Uh oh!

tpoisonooo commented Aug 15, 2022

Uh oh!

nihui commented Feb 23, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

EdVince commented Aug 12, 2022

Uh oh!

codecov-commenter commented Aug 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

EdVince commented Aug 13, 2022

Uh oh!

tpoisonooo commented Aug 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EdVince commented Aug 15, 2022

Uh oh!

tpoisonooo commented Aug 15, 2022

Uh oh!

EdVince commented Aug 15, 2022

Uh oh!

tpoisonooo commented Aug 15, 2022

Uh oh!

tpoisonooo commented Aug 15, 2022

Uh oh!

nihui commented Feb 23, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov-commenter commented Aug 12, 2022 •

edited

Loading

tpoisonooo commented Aug 15, 2022 •

edited

Loading