-
Notifications
You must be signed in to change notification settings - Fork 705
[ENH] Add Samformer
model for PTF v2 from DSIPTS
#1952
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #1952 +/- ##
=======================================
Coverage ? 87.19%
=======================================
Files ? 148
Lines ? 9065
Branches ? 0
=======================================
Hits ? 7904
Misses ? 1161
Partials ? 0
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
FYI @PranavBhatP, working on a It seems like this PR is also not up to date with |
I will be pushing changes in a while. |
@fkiraly could this be assigned to me? I don't think it will be visible on the kanban board... |
In the original implementation there is this switch: if hasattr(nn.functional, 'scaled_dot_product_attention'):
att_score = nn.functional.scaled_dot_product_attention(queries, keys, values) # (n, D, L)
else:
att_score = scaled_dot_product_attention(queries, keys, values) # (n, D, L) in your implementation you are leveraging on If the error persists, we can have a look together! |
…ss to fix test failures
works now! i am using the custom implementation. |
test for quantile loss case is not included since we do not have a nn metric for quantile predictions
what is this switch for, @agobbifbk? Is this to handle different versions of |
apologies, it was assigned to the wrong Pranav. GitHub autocompletes when you start typing @, so if you have very similar GH names it is easy to select unintentionally. |
The function was introduced recently, and probably it is bugged for some version (or some platform). The authors put a switch to be sure that it works in different versions of PT. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm
Reference Issues/PRs
Closes #1940