As the title said, I would like to know the way to use pretrained model to generate caption for a specify image.