Skip to content

refactor(ops): Split matmul.py into modular files #139

@m96-chan

Description

@m96-chan

Problem

src/pygpukit/ops/matmul.py is 1981 lines with mixed concerns.

Current State

src/pygpukit/ops/
└── matmul.py  (1981 lines - everything)

Proposed Structure

src/pygpukit/ops/matmul/
├── __init__.py          (exports)
├── generic.py           (matmul, batched_matmul, transpose)
├── fp8.py               (FP8 GEMM variants)
├── nvf4.py              (NVF4 GEMM)
├── gemv.py              (GEMV operations)
├── grouped.py           (grouped GEMM for MoE)
├── quantize.py          (quantization helpers)
└── availability.py      (all *_available functions)

Benefits

Related

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions