Hi @wanji ,
could you please clarify the loss that is minimized in BatchTripletLossLayer layer and point to a paper that is explaining it?
In particular what is the mu parameter in
layer {
name: "loss"
type: "BatchTripletLoss"
bottom: "ip2norm"
bottom: "label"
top: "loss"
top: "accuracy"
top: "debug"
triplet_loss_param {
margin: 0.5
mu: 0.5
}
}
It looks like from the code that implements Scheme II in the paper you mentioned.
From the code seems that you re-implemented this this but I am not sure. I cannot find your parameters in that paper.
Thank you.