Reduce repeat_interleave calls in apply_F_to_columns by sanjanag · Pull Request #69 · linkedin/DuaLip

sanjanag · 2026-04-27T20:31:40Z

Summary

apply_F_to_columns previously did three repeat_interleave calls and a
torch.cat per bucket to build the flat index arrays. Computing cols_rep
once and indexing prefix[cols_rep] / starts[cols_rep] produces the same
arrays with one repeat_interleave and no torch.cat. Same outputs, less
overhead — most visible on GPU where each call has launch latency.

This also adds direct unit-test coverage for apply_F_to_columns (identity,
scaling, multi-bucket equivalence, varying column lengths, output-tensor
mode, empty buckets, ReLU-style clamping).

Test plan

New TestApplyFToColumns suite passes (7 cases)
Existing tests/test_sparse_utils.py cases unchanged and passing

Compute cols_rep once and derive idx_in_col / flat_indices via prefix[cols_rep] and starts[cols_rep] indexing. Drops two repeat_interleave calls and a torch.cat per bucket. Add direct unit-test coverage for apply_F_to_columns. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

sanjanag and others added 2 commits April 27, 2026 13:30

Replace assigned lambdas with defs to satisfy flake8

3395962

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce repeat_interleave calls in apply_F_to_columns#69

Reduce repeat_interleave calls in apply_F_to_columns#69
sanjanag wants to merge 2 commits intolinkedin:masterfrom
sanjanag:apply-f-to-columns-perf

sanjanag commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sanjanag commented Apr 27, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant