In the current implementation of train if the algorithm actually converged below the threshold, it returns from the function without saving a model. It would be much more convenient to save the model anyway, or to have a wrapper that saves two layers (.vec and 2.vec).