Hi Authors, recently I am doing a comparison between HIVE and InstructPix2Pix. To my surprise HIVE is not working that good as compared to InstructPix2Pix. It is even quite off compared to it. I am really wondering if this is really the case or am I missing something. I have tested both condition and weighted based models with SD 2.1 as backbone. Both are not good and weighted based is quite even worse than condition based. And, in the paper figure 20 it shows that they are not supposed to be too different. I am testing with simple instructions even. Can you please help me here what is going on? Thanks a lot!
Hi Authors, recently I am doing a comparison between HIVE and InstructPix2Pix. To my surprise HIVE is not working that good as compared to InstructPix2Pix. It is even quite off compared to it. I am really wondering if this is really the case or am I missing something. I have tested both condition and weighted based models with SD 2.1 as backbone. Both are not good and weighted based is quite even worse than condition based. And, in the paper figure 20 it shows that they are not supposed to be too different. I am testing with simple instructions even. Can you please help me here what is going on? Thanks a lot!