Arbitrary Matrix Shapes #9

marcelkant · 2024-11-12T15:04:01Z

This PR introduces initial support for handling arbitrary matrix shapes by adjusting both the Python model and the Softmax hardware module.

Changes

Python Model:

Modified the golden model for arbitrary shapes by padding the inputs with zeros as needed.
Updated to selectively ignore padded input values during the softmax operation

Current Limitations

The HWPE version does not yet work correctly. There is most likely a bug

ToDo

Fix HWPE tests with padded values

Important Changes: - Change scaling of Softmax from 2**7-1 to 2**8-1 Current Limitations: - Only works without biases - Only works with ReLU activation - FeedForward and MatMul do only work with one Tile

…63_E127_P64_F64_H1_B1

gamzeisl

Overall, looks good! Minor changes are needed though.

Fix CI.
Fix HWPE and extend CI tests.

gamzeisl · 2025-02-01T09:47:56Z

.gitlab-ci.yml

-    - python testGenerator.py -H 1 -S 64 -E 64 -P 64 -F 64 --activation gelu
-    - python testGenerator.py -H 1 -S 128 -E 192 -P 256 -F 256 --activation gelu
-    - python testGenerator.py -H 1 -S 192 -E 256 -P 128 -F 128 --activation relu
+    - python testGenerator.py -H 1 -S 64 -E 64 -P 64 -F 64 --activation gelu --skip-vector-validation


Is disabling validation (--skip-vector-validation) a good idea for all? I would say to enable it for a subset of tests

gamzeisl · 2025-02-01T09:48:30Z

.vscode/launch.json

                "-S${input:seq_len}",
                "-E${input:emb_len}",
                "-P${input:prj_len}",
+                "--no-bias"


This can be left out

Makefile

gamzeisl · 2025-02-01T09:52:36Z

PyITA/ITA.py

+        # assert (self.S % self.ITA_M == 0), "Sequence length must be divisible by ITA_M"
+        # assert (self.P % self.ITA_M == 0), "Projection space must be divisible by ITA_M"
+        # assert (self.E % self.ITA_M == 0), "Embedding size must be divisible by ITA_M"
+        # assert (self.F % self.ITA_M == 0), "Feedforward size must be divisible by ITA_M"


gamzeisl · 2025-02-01T09:54:02Z

PyITA/ITA.py

+        # print(f"qk: {qk.shape}")
+        # print(f"qk: {weight.shape}")
+


Why these are commented?

gamzeisl · 2025-02-01T09:54:29Z

PyITA/ITA.py

+        # fig, ax = plt.subplots(1, 2)  # Create a figure with two subplots
+        # im0 = ax[0].imshow(Input, cmap='viridis')
+        # im1 = ax[1].imshow(np.squeeze(weight, axis=0))
+
+        # # Add colorbars for each image if needed
+        # fig.colorbar(im0, ax=ax[0])
+        # fig.colorbar(im1, ax=ax[1])
+
+        # # Set titles for each subplot
+        # ax[0].set_title("Inputs")
+        # ax[1].set_title("Weights")
+
+        plt.show()


Why commented?

gamzeisl · 2025-02-01T09:55:04Z

PyITA/ITA.py

                                      self.requant_add_ffn[0])
        self.FFp_requant = self.apply_activation(self.FFp_requant, self.activation)
-
+    


can be omitted

PyITA/softmax.py

gamzeisl · 2025-02-01T10:00:30Z

src/ita_package.sv

+  typedef logic [WO-WI*2-2:0] seq_length_t;
+  typedef logic [WO-WI*2-2:0] proj_space_t;
+  typedef logic [WO-WI*2-2:0] embed_size_t;
+  typedef logic [WO-WI*2-2:0] ff_size_t;


WO-WI*2-2 is unclear, why the bit width is set to this?

I also do not know why this bit width was defined this way. It was already used for the S, E, and P dimensions, so I used it for the F dimension as well. Should I change it to a constant?

gamzeisl · 2025-02-01T10:01:01Z

src/ita_requantization_controller.sv

+  // always_comb begin
+  //   requant_mult  = ctrl_i.eps_mult[step_q4];
+  //   requant_shift = ctrl_i.right_shift[step_q4];
+  //   requant_add   = ctrl_i.add[step];
+  // end
+


Typo fix Co-authored-by: Gamze İslamoğlu <54476562+gamzeisl@users.noreply.github.com>

Co-authored-by: Gamze İslamoğlu <54476562+gamzeisl@users.noreply.github.com>

Xeratec and others added 24 commits November 4, 2024 16:59

[feature] WIP Support arbitrary matrix shapes

82fd85c

Important Changes: - Change scaling of Softmax from 2**7-1 to 2**8-1 Current Limitations: - Only works without biases - Only works with ReLU activation - FeedForward and MatMul do only work with one Tile

[change] Speedup CI by removing Python Dependencies

7d07f84

Added debug.py to print matrices

2401587

Small changes in the debug.py file

f6d1100

Small changes in ita_controller.sv

c80f81f

Started with the bias padding not finished yet

d993c04

Bias padding solution with exactly 8 errors for each phase

267850e

Added additional buffer for bias values

8534850

No buffering in the controller

08fa963

Changed count_q foto (count_q-1) for the bias padding

abf4d18

Added waves

fd7ae83

Added ctrl.ff_size for feedforward layer

22e835c

Bias padding works now but with quick fix

64732e4

count_q - 1 solution works for one tile

8f0d19a

This version works for data_S127_E50_P64_F64_H1_B1 but not for data_S…

3c6040c

…63_E127_P64_F64_H1_B1

No ebugs for bias padding detected one bug without bias in phase 5

0094c11

Just errors in phase 5 and 6

2a04d89

No bugs in all phases

aab0df5

Bias padding for all phases without bugs

64978c1

Added test vectors in the gitlab-ci

d433625

Fixes in gitlab-ci

8d4de04

Added license on top sim_ita_tb_wave_important.tcl

55ac726

Pipelining test

bf27488

Changed bias for test vectors in gitlab-ci

83290ae

Xeratec assigned marcelkant Nov 13, 2024

Marcel Kant added 2 commits November 18, 2024 16:51

Fixed synthesize errors

3db2ff7

Fixed synthesize error

0ec7089

Xeratec requested review from Xeratec and gamzeisl January 24, 2025 18:54

Xeratec added the enhancement New feature or request label Jan 24, 2025

gamzeisl requested changes Feb 1, 2025

View reviewed changes

marcelkant and others added 2 commits February 8, 2025 15:06

Update PyITA/softmax.py

7d42ab7

Typo fix Co-authored-by: Gamze İslamoğlu <54476562+gamzeisl@users.noreply.github.com>

Update Makefile

5e08872

Co-authored-by: Gamze İslamoğlu <54476562+gamzeisl@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Arbitrary Matrix Shapes #9

Arbitrary Matrix Shapes #9

Uh oh!

marcelkant commented Nov 12, 2024

Uh oh!

gamzeisl left a comment

Uh oh!

gamzeisl Feb 1, 2025

Uh oh!

gamzeisl Feb 1, 2025

Uh oh!

Uh oh!

gamzeisl Feb 1, 2025

Uh oh!

gamzeisl Feb 1, 2025

Uh oh!

gamzeisl Feb 1, 2025

Uh oh!

gamzeisl Feb 1, 2025

Uh oh!

Uh oh!

gamzeisl Feb 1, 2025

Uh oh!

marcelkant Feb 8, 2025

Uh oh!

gamzeisl Feb 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		self.requant_add_ffn[0])
		self.FFp_requant = self.apply_activation(self.FFp_requant, self.activation)

Arbitrary Matrix Shapes #9

Are you sure you want to change the base?

Arbitrary Matrix Shapes #9

Uh oh!

Conversation

marcelkant commented Nov 12, 2024

Changes

Current Limitations

ToDo

Uh oh!

gamzeisl left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants