I was analyzing the the BetaVAE class implementation
and I saw that in every train step there is actually total_loss = reconstruction_loss + kl_loss, while it should be total_loss = reconstruction_loss + self.beta * kl_loss as correctly implemented in cBetaVAE.