</figure></iframe></div></div></figure><p id="2514">Services like Midjourney, Steady Diffusion, and DALL-E 2 have seen a huge increase in popularity recently. These apps, which allow anyone to make almost any type of image they desire using simple text instructions and a brand-new form of artificial intelligence known as generative AI, have both generated enthusiasm and criticism.</p><p id="4196">These programs function because they have been “trained” to identify connections between the countless images that have been downloaded from the internet and the text descriptions that go along with them. Eventually, the program “understands” that, for example, the word “dog” relates to the picture of a canine.</p><h1 id="bba2">But Why Does AI Struggle?</h1><p id="c7f2">According to a Stable AI spokesman, one of the reasons AI picture generators struggle with hands is that human extremities are less visible than their faces in the datasets required to train the image synthesizers. Because they are hardly ever seen in huge scale, hands are likewise frequently significantly smaller in the source photographs.</p><p id="ea41">Professor Peter Bentley, a computer scientist and author based in London, claims that the 2D picture producers also find it difficult to understand the 3D geometry of a hand. “<i>Kids understand the fundamental concept of a hand. None of these models fully comprehend what the complete thing is</i>”, he tells the BBC.</p><p id="fb48">While Stable Diffusion released better but still subpar photos, DALL-E produced hilariously poor images. These appeared more plausible while remaining completely incorrect.</p><figure id="7302"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*3L_DsyI18NeqGWeT.jpg"><figcaption>Image Generated by Stable Fusion</figcaption></figure><figure id="dccb"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*W0qUBbIbtRJQPWlr.jpg"><figcaption>Image Generated by Stable Fusion</figcaption></figure><p id="4ec6">Here are some results generated by DALL-E</p><figure id="1b92"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*v6bthAo7uA9h9yoU.jpg"><figcaption>Generated by DALL-E</figcaption></figure><figure id="e72c"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*JbHqcSnY7Z8fRtn8.jpg"><figcaption>Generated by DALL-E</figcaption></figure><p id="0df1">Amelia Winger-Bearskin, professor of AI and the arts at the University of Florida, explains that generative AI just doesn’t comprehend what a hand is and what its purpose is. She told BuzzFeed, “<i>It merely looks at how hands are depicted in the photographs that it has been trained on. Hands are quite nuanced in photographs. They typically have something in their hands. Maybe perhaps, they’re clinging to someone else</i>”.</p><h1 id="7ac5">Is It Just the A
Options
I?</h1><p id="aed9">Not only AI, that found it difficult to draw hands; throughout history, artists have also tried to avoid drawing hands due to their complexity. Artists like Leonardo da Vinci didn’t begin observing and drawing hands until the Renaissance.</p><p id="b726">Winger-Bearskin continues, “<i>Da Vinci was actually pretty concerned with hands and did many, many studies of hands</i>”.</p><p id="393f">Meanwhile, AI simply observes an image and says, “<i>Well, in this case, there’s only half of a thumb, because the rest of it is concealed behind fabric or clutching onto something, and as a result, when it reproduces it, it’s somewhat deformed.</i>”</p><figure id="95fe"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*AkkuGShyiYd7gVmh.gif"><figcaption></figcaption></figure><p id="d05b">My other writings —</p><div id="3b5f" class="link-block">
<a href="https://medium.com/@smraiyyan/list/6b5f19244ab3">
<div>
<div>
<h2>Awesome chatGPT prompts</h2>
<div><h3>All parts</h3></div>
<div><p>medium.com</p></div>
</div>
<div>
<div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*8ad8a80e0ad2481b33dc1412cbb578eb32abbdac.jpeg)"></div>
</div>
</div>
</a>
</div><div id="d880" class="link-block">
<a href="https://readmedium.com/these-5-videos-changed-my-career-as-a-data-scientist-96aae295b1f">
<div>
<div>
<h2>These 5 Videos Changed My Career As a Data Scientist</h2>
<div><h3>Are you ready to delve into the world of tech and AI? Let me introduce you to 5 videos that have had a profound impact…</h3></div>
<div><p>medium.com</p></div>
</div>
<div>
<div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*k4JWtB7-Gf8WCXeg)"></div>
</div>
</div>
</a>
</div><div id="3cd8" class="link-block">
<a href="https://ai.plainenglish.io/jailbreaking-chatgpt-the-dangerous-game-of-linguistic-manipulation-6158bf02e2d4">
<div>
<div>
<h2>Jailbreaking ChatGPT: The Dangerous Game of Linguistic Manipulation</h2>
<div><h3>Using ChatGPT requires challenging assumptions. It means considering how something else might interpret language…</h3></div>
<div><p>ai.plainenglish.io</p></div>
</div>
<div>
<div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*_Fth0fG_ov0_BwUgaAYJAA.gif)"></div>
</div>
</div>
</a>
</div></article></body>
The Handicap of AI: Understanding Why Hands Remain a Challenge for Image Generation
The quality of AI images has astounded the photographic community with its hyper-realism. But it seems that there is one obstacle they keep running into: hands.
Image Generated by Stable Fusion
Miles Zimmerman, a 31-year-old programmer from San Francisco, was playing around with Midjourney earlier this month and having his mind blown by the AI-powered application that creates visuals from a brief text input.He used ChatGPT to help him make some of his prompts, and one of them was very specific: “A candid photo of some happy 20-something year-olds in 2018 dressed up for a night out, enjoying themselves mid-dance at a house party in some apartment in the city, photographed by Nan Goldin, taken with a Fujifilm Instax Mini 9, flash, candid, natural, spontaneous, youthful, lively, carefree, — ar 3:2.”
Join the Medium Membership Program for only 5$ to continue learning without limits. I’ll receive a small portion of your membership fee if you use the following link, at no extra cost to you.
In a matter of seconds, Midjourney spit out made-up images after made-up images of appealing young people having a good time at a party. He shared the images on Twitter, where they soon went viral.
Services like Midjourney, Steady Diffusion, and DALL-E 2 have seen a huge increase in popularity recently. These apps, which allow anyone to make almost any type of image they desire using simple text instructions and a brand-new form of artificial intelligence known as generative AI, have both generated enthusiasm and criticism.
These programs function because they have been “trained” to identify connections between the countless images that have been downloaded from the internet and the text descriptions that go along with them. Eventually, the program “understands” that, for example, the word “dog” relates to the picture of a canine.
But Why Does AI Struggle?
According to a Stable AI spokesman, one of the reasons AI picture generators struggle with hands is that human extremities are less visible than their faces in the datasets required to train the image synthesizers. Because they are hardly ever seen in huge scale, hands are likewise frequently significantly smaller in the source photographs.
Professor Peter Bentley, a computer scientist and author based in London, claims that the 2D picture producers also find it difficult to understand the 3D geometry of a hand. “Kids understand the fundamental concept of a hand. None of these models fully comprehend what the complete thing is”, he tells the BBC.
While Stable Diffusion released better but still subpar photos, DALL-E produced hilariously poor images. These appeared more plausible while remaining completely incorrect.
Image Generated by Stable FusionImage Generated by Stable Fusion
Here are some results generated by DALL-E
Generated by DALL-EGenerated by DALL-E
Amelia Winger-Bearskin, professor of AI and the arts at the University of Florida, explains that generative AI just doesn’t comprehend what a hand is and what its purpose is. She told BuzzFeed, “It merely looks at how hands are depicted in the photographs that it has been trained on. Hands are quite nuanced in photographs. They typically have something in their hands. Maybe perhaps, they’re clinging to someone else”.
Is It Just the AI?
Not only AI, that found it difficult to draw hands; throughout history, artists have also tried to avoid drawing hands due to their complexity. Artists like Leonardo da Vinci didn’t begin observing and drawing hands until the Renaissance.
Winger-Bearskin continues, “Da Vinci was actually pretty concerned with hands and did many, many studies of hands”.
Meanwhile, AI simply observes an image and says, “Well, in this case, there’s only half of a thumb, because the rest of it is concealed behind fabric or clutching onto something, and as a result, when it reproduces it, it’s somewhat deformed.”