Midjourney is a very powerful

Connect Asia Data learn, and optimize business database management.
Post Reply
rochona
Posts: 743
Joined: Thu May 22, 2025 5:25 am

Midjourney is a very powerful

Post by rochona »

popular closed-source text-to-image generation system known for the high quality of its generations. We evaluate how SDXL and SDXL-DPO compare with Midjourney in terms of user preferences. We compare these models to Midjourney 5.1 (the latest model available at the time of our experiments) using a collection of 346 Midjourney-generated images hosted on Kaggle. We generate images using SDXL and SDXL-DPO using the same prompts and ask crowdworkers to choose an image between Midjourney and SDXL in pairwise preference, by asking the following question: “Which image do you prefer?”. We collect 5 responses for each comparison and choose the majority vote as the collective decision.

Users prefer Midjourney 5.1 to SDXL by a substantial margin (58% to 42%), but after tuning SDXL with DPO, user preference for SDXL improves significantly — SDXL-DPO is selected over Midjourney 51% of the time. These results indicate that DPO-tuning enables an open-source model to match the performance of the powerful closed-source Midjourney models.


Next, we compare SDXL and SDXL-DPO to Emu, a recent model america phone number list developed and hosted by Meta. We compare the models using a 200 caption randomly selected subset of Partiprompts (browser interaction is a slow way of collecting data) by employing the same crowdsourcing protocol used for Midjourney. Emu is preferred over vanilla SDXL by a significant margin (61% to 39%), mirroring the results reported in the Emu paper. In contrast, SDXL-DPO is able to close down the gap to Emu. Emu is preferred by a much narrower margin to SDXL-DPO (54% to 46%) and in fact, the breakeven point falls within the 1-standard-deviation error bar.
Post Reply