Creating original, new images is among the top uses for generative AI. Tools like Adobe Firefly, Microsoft Designer, Canva, and Midjourney are among the most popular. Users of X (formerly Twitter) can now take advantage of Grok.
All of these tools can create semi-realistic images, but all have their drawbacks as well. Grok in particular gets some details humorously wrong, such as a cigarette sticking out of Mickey Mouse’s face; a door handle and door hinge both on the same edge of a door; lights sticking out of people’s heads; and Homer Simpson with two eyes but three pupils.
Grok also struggles to follow instructions, as users have noted. I discovered that first-hand when trying to create a simple image for a blog post. The output is impressive (and very fast), but…Grok can seem challenged to follow instructions when generating images, as the following example shows.