Thank you for your implementation. I encountered difficulties while attempting to apply PBRF to the final layer (linear) when fine-tuning a BERT model for text classification. Would it be possible for you to share the code for PBRF in this specific context? @nrimsky
Thank you again!