If AI image generators are so smart, why do they struggle to write and count?

There’s a lot of buzz about generative AI tools – especially ones that can quickly create jaw-dropping images from a text prompt. But there are limitations and flaws, Professor Seyedali Mirjalili from Torrens University notes… especially when hands are involved.

Generative AI tools such as Midjourney, Stable Diffusion and DALL-E 2 have astounded us with their ability to produce remarkable images in a matter of seconds.

Despite their achievements, however, there remains a puzzling disparity between what AI image generators can produce and what we can. For instance, these tools often won’t deliver satisfactory results for seemingly simple tasks such as counting objects and producing accurate text.

AI-generated image produced in response to the prompt ‘KFC logo’. Imagine AI

Be a member to keep reading

Join Mumbrella Pro to access the Mumbrella archive and read our premium analysis of everything under the media and marketing umbrella.

Become a member

Get the latest media and marketing industry news (and views) direct to your inbox.

Sign up to the free Mumbrella newsletter now.

"*" indicates required fields

 

SUBSCRIBE

Sign up to our free daily update to get the latest in media and marketing.