<img width="749" alt="image" src="https://user-images.githubusercontent.com/10068278/226122221-7304d92e-363d-4036-b945-23cf1e190e72.png"> It seems that the code can converge, the loss function is always about 133.13.
It seems that the code can converge, the loss function is always about 133.13.