Gradient descent on the dilation rates,

Hello! After reading your paper, I have gained great inspiration. However, there is one point that I don't fully understand and hope to receive a reply from you. Regarding the method mentioned in Section 3.4 of your paper, "Gradient descent on the dilation rates," I have examined your code and found that the dilation rates are hardcoded. Therefore, based on the description in your paper, can I understand it as follows: You first train the model using deformable convolutions to find the optimal parameters, and then use these parameters as the dilation rates to retrain the network?
![image](https://github.com/user-attachments/assets/e7337d29-24c0-4441-b63b-357b7de60059)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gradient descent on the dilation rates, #19

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Gradient descent on the dilation rates, #19

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions