The issue of "required_accum_size"

First of all, thank you for providing such a helpful resource.

While simulating with Llama3-8b, I noticed a potential issue with the `required_accum_size` formula in `Mapping.cc`
The current code is 
`(inner_I * inner_J) * _config.precision`
However, since the result of a 16-bit x 16-bit matrix multiplication should be stored as 32-bit,
I believe the correct formula should be
`(inner_I * inner_J) * _config.precision * 2`

Additionally, when defining `max_acc_rows` as
`(_config.core_config[key.target_core].accum_spad_size KB) / (dim * 4 * 2)`
the denominator considers core_height, precision, and double_buffer, which indicates that
the stored  data is 32-bit


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The issue of "required_accum_size" #26

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

The issue of "required_accum_size" #26

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions