-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
enhancementNew feature or requestNew feature or request
Description
"Classical initialization techniques do not guarantee that the weights of the target network are
initialized within the same range. However, adaptive optimizers, such as Adam [30], can mitigate this issue to some
extent. " (https://arxiv.org/pdf/2306.06955)
Should come up with a method so that early training weights match common weight initialization ranges.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request