avatarJames Horton, Ph.D

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

4570

Abstract

result I have seen Midjourney change blurs that look like they should be one thing (like a person) into another entirely (like a rock), in order to better fulfill its prompt.</p><figure id="fb7f"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*YY3oIjbXh8VV0F5j6_h20g.png"><figcaption>Image by author (via Midjourney)</figcaption></figure><h1 id="013a">Composition (about 62% complete)</h1><p id="6245">At this point in the process Midjourney has produced clearly identifiable pictures. The art could be considered complete at this point, at least in the sense that the basic composition is there, but Midjourney is only two thirds of the way complete. What’s happening?</p><p id="8e0c">Well, it turns out that the last third is very important. This is the point where MidJourney’s work starts to alter the details of the picture more than its form or composition.</p><figure id="406f"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*JqXmw4qkivBJWGs6w2HRQw.png"><figcaption>Image by Author (via Midjourney)</figcaption></figure><h1 id="d7be">Details (100% complete)</h1><p id="03a4">I am repeatedly surprised at how much an image transforms in the last third of its journey to completion. The details make the difference between a painting of a starry sky and Van Gogh’s <i>Starry Night</i>. An artist’s style dwells in the details, and the details also appear (in AI art) to be the place where the most errors occur. You can see the results of the final touches in the picture below.</p><figure id="bb7e"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*Z8Y3JSjIADj0TCLdLTHnSw.png"><figcaption>Image by Author (via Midjourney)</figcaption></figure><h1 id="543f">Midjourney’s Lessons for Writers</h1><figure id="358a"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*IRqpf7VZbaWhoEOi"><figcaption>“A beautiful woman holding a rose” — Image by Author (via Midjourney)</figcaption></figure><p id="32ff">The above portrait is the best of the bunch. It’s wonderful, of course, but can we really learn something from it? Your creative process is likely different from Midjourney’s, after all.</p><p id="4988">The brilliance of AI art, however, isn’t that it gives you a perfect process to replace your own. Rather, it offers you a new way of thinking about your work that can be flexibly adapted to your own writing style. You can use its principles to enhance your writing, diagnose problems, and break through mental blocks. Here are a few principles for you to consider.</p><h2 id="b9b2">Principle one: Iterate, iterate, iterate</h2><p id="d5b0">The one principle I hope you take from this article is that iteration is powerful. In fact, many common pieces of advice about writing are actually suggestions to iterate.</p><p id="2135">For example, here’s an old debate; should you outline, or should you just free-write a shitty first draft and then polish it to create a better one? These seem like opposites, but both are actually just forms of iteration. Pick the one that makes you feel confident, and go.</p><p id="39f7">The principle behind iteration is that <i>multiple passes reduce the complexity of creation by spreading it across episodes</i>. Producing great work in one pass requires intense mental effort — and this is true for both you and a computer. Producing great work over multiple passes reduces the “computation” load drastically so that no single iteration is too complex for your hardware to accomplish.</p><p id="a443">You can apply this immediately to your workflow if you procrastinate. Procrastination is a way of avoiding the pain of a high computational burden until you have a burst of last-minute adrenaline to help you handle it. People describe this as a feeling that a mental “switch” has been flipped.</p><p id="77f9">Want to work without waiting for the switch to flip? Iterate. Reduce the burden so that you don’t need adrenaline. Iteration is the flowing river that carves away the mountainous impossible. Start gently, take several passes to clarify your ideas, and several more to write your draft. Each pass becomes the raw material that feeds into the next round of iteration.</p><h2 id="6e81">Principle two: Seed and prompt</h2><p id="8f09">In the world of AI-generated art, creation results from the collision of a <i>seed</i> and a <i>prompt</i>.</p><p id="9096">The <i>prompt</i> is a user’s instructions to the AI. But writers have their own prompts — the high-level goals they set for a creative project (e.g. <i>write an essay on surveillance capitalism</

Options

i>).</p><p id="52da">The <i>seed</i> is a panel of randomly generated static. Life doesn’t offer writers many panels of actual random static, but we do still have our own form of randomness to use as seed material; the quirky details of our own lived experience.</p><p id="77c9">You can combine a prompt (your big goal) with any number of seeds, each resulting in a unique creation. Your essay on surveillance capitalism will look radically different if you start by considering your iPad rather than the light bulb in your room. They may have the same core intent, but the seed(s) you choose will alter how it is expressed.</p><p id="6f47">Midjourney starts with four seeds. Why? Well, variety allows the user to examine the result of the same prompt across multiple seeds, to pick what works best. It’s a great trick, and you can do it, too.</p><p id="72c5">Changing seeds is powerful because it breaks an artist’s fixation on a single way of achieving their goals. Are you struggling with a project? Try changing seeds; keep your big vision, but express it through a different set of random material drawn from your life. This creative recycling is a great way to break free of a mental rut.</p><h2 id="44f4">Principle three: The final third happens after composition is done</h2><p id="a570">When I was younger I stocked shelves at a major retail store. Each night our final task was to “face” the aisles, which meant arranging every item in a perfect row, each sticking off of the shelf by a quarter inch.</p><p id="8a4f">That must sound ridiculous. But having done it, I can share an insight; there is something psychologically <i>different</i> about an aisle where everything is perfectly aligned. It is the visual equivalent of a firecracker. And it hinges on getting the details just right.</p><p id="d22f">During AI art generation, when you request that a prompt be executed in the style of a specific artist (for example, the inimitable Van Gogh), some of the artist’s preferences regarding form and color appear during the first two thirds of the rendering, but it is only after the 60% mark that the subtleties that identify an artist’s style will appear.</p><p id="76bc">In fact, for many artists who use more conventional form and color, the signature characteristics that mimic their style don’t appear at all until the final third. The details make almost all of the difference.</p><figure id="c267"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*Lsk73gVVwxaVD3EX691i2g.png"><figcaption>“Rainbow galaxies above a dark town at night, by Van Gogh” [Author’s; via MidJourney]</figcaption></figure><p id="c996">Our mind has emotional mechanisms triggered by details so subtle that we overlook them until they line up together in a way that leaves us stupefied. Their effects are profound; they are the difference between a face that you hardly notice and one that leaves you breathless. And they all happen below that quarter-inch level. That is where quality dwells — and also the markers of an artist’s unique style.</p><p id="5643">In Midjourney those details aren’t even touched until the last third of the composition process — after the composition already looks complete. Why? Well, in AI, it’s because the main picture needs to be resolved before the detail work can commence.</p><p id="0e4b">But this principle applies to your prose as well. Are you the kind of writer who completes the final third? That’s the work you do after your piece already looks complete to you. That’s your chance to polish the details, and shine.</p><figure id="51b4"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*Pq517YE6XSIlSa-ZLMXaWw.png"><figcaption>A rose, painted in watercolor (generated via Stable Diffusion)</figcaption></figure><p id="6ba4"><i>Thanks for reading! If you enjoyed this piece, you may like the following one, too:</i></p><div id="a620" class="link-block"> <a href="https://baos.pub/four-iconic-book-lines-illustrated-by-artificial-intelligence-23640b0c209c"> <div> <div> <h2>Four Iconic Book Lines, Illustrated by Artificial Intelligence</h2> <div><h3>What happens when you ask an AI to illustrate passages from famous books?</h3></div> <div><p>baos.pub</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*_ILJch9Mo5Jh3Bp2DzqKew.png)"></div> </div> </div> </a> </div></article></body>

The New AI Art Generators Can Teach You a Thing or Two About Writing

The powerful method that allows soulless machines to create art

“A beautiful woman holding a rose” (by author, via Stable Diffusion)

A beautiful woman holding a rose

The above image was generated from the phrase “a beautiful woman holding a rose.” I started with the Midjourney AI text-to-art bot. For the final pass, I upgraded Midjourney’s image using Stable Diffusion. The result is far beyond anything I can do myself.

But as I am discovering, that limitation no longer applies.

To be honest I am not sure what to say. It is hard for me to believe that we live in a world where this is now possible.

The thing that finally convinced me was repetition. Produce one image like the above and you can dismiss it. Fifty, though? Eight Bierstadt paintings of Half-Dome at Yosemite, shrouded in morning mist? A dozen sunsets in the vivid style of Leonid Afremov? Daguerreotype portraits of grizzly bears wearing sombreros?

At some point, I became a believer. Computers can now create art.

How the hell did this happen?

As it turns out, the question “how the hell did this happen?” is a useful one. Smart people have been asking a related question (how can we make this happen?) for decades.

Now that they have their answer, asking it again for ourselves can teach us something. I am especially partial to what it can teach writers; there are lessons there for word-jockeys who want to better understand their craft.

Before the code came together, after all, someone had to reverse-engineer art itself, to answer a megaton question: how can we design a creative process so powerful that a non-sentient machine can use it to bootstrap its way to being an artist?

Call me crazy, but I think we writers can learn something from the answer to that question. Happily, the process is on clear display if you use Midjourney. I want to walk you through the creation of the image from my initial prompt (a beautiful woman holding a rose). Here it is, roughly divided into steps.

Static (0 % complete)

Midjourney starts with a panel of randomly generated static called a seed. Actually it starts with four of them; prompts are processed four times simultaneously so that the user can choose the result they like best. Below, you can see four such seeds; they are too blurry to make sense of right now. This is the raw material used to generate an image.

Image by Author (via Midjourney)

Basic Form (about 10% complete)

Now that you’ve seen the static, we can discuss the basic process that an AI like Midjourney uses to move from a seed to a completed work.

To grossly oversimplify, Midjourney starts by treating the random static as the blurred result of a “true” picture hidden underneath the noise of the random static. It then calculates how to de-noise the image.

The problem is that de-noising everything at once is way too difficult. So Midjourney splits the process into hundreds of gentle iterations. With each new iteration, the image is “nudged” to be more in line with the prompt. In the image below, you can compare the panels with the original static and see that the vague blurs and shapes have been subtly altered to become identifiable as a woman and a rose.

Image by author (via Midjourney)

Structure (about 25% complete)

At some point, the image coalesces and Midjourney starts to identify the placement of smaller features within the original, broader forms.

At this point the process seems to be a negotiation; not all of the (still blurry) basic shapes lend themselves correctly to fulfilling the prompt. As a result I have seen Midjourney change blurs that look like they should be one thing (like a person) into another entirely (like a rock), in order to better fulfill its prompt.

Image by author (via Midjourney)

Composition (about 62% complete)

At this point in the process Midjourney has produced clearly identifiable pictures. The art could be considered complete at this point, at least in the sense that the basic composition is there, but Midjourney is only two thirds of the way complete. What’s happening?

Well, it turns out that the last third is very important. This is the point where MidJourney’s work starts to alter the details of the picture more than its form or composition.

Image by Author (via Midjourney)

Details (100% complete)

I am repeatedly surprised at how much an image transforms in the last third of its journey to completion. The details make the difference between a painting of a starry sky and Van Gogh’s Starry Night. An artist’s style dwells in the details, and the details also appear (in AI art) to be the place where the most errors occur. You can see the results of the final touches in the picture below.

Image by Author (via Midjourney)

Midjourney’s Lessons for Writers

“A beautiful woman holding a rose” — Image by Author (via Midjourney)

The above portrait is the best of the bunch. It’s wonderful, of course, but can we really learn something from it? Your creative process is likely different from Midjourney’s, after all.

The brilliance of AI art, however, isn’t that it gives you a perfect process to replace your own. Rather, it offers you a new way of thinking about your work that can be flexibly adapted to your own writing style. You can use its principles to enhance your writing, diagnose problems, and break through mental blocks. Here are a few principles for you to consider.

Principle one: Iterate, iterate, iterate

The one principle I hope you take from this article is that iteration is powerful. In fact, many common pieces of advice about writing are actually suggestions to iterate.

For example, here’s an old debate; should you outline, or should you just free-write a shitty first draft and then polish it to create a better one? These seem like opposites, but both are actually just forms of iteration. Pick the one that makes you feel confident, and go.

The principle behind iteration is that multiple passes reduce the complexity of creation by spreading it across episodes. Producing great work in one pass requires intense mental effort — and this is true for both you and a computer. Producing great work over multiple passes reduces the “computation” load drastically so that no single iteration is too complex for your hardware to accomplish.

You can apply this immediately to your workflow if you procrastinate. Procrastination is a way of avoiding the pain of a high computational burden until you have a burst of last-minute adrenaline to help you handle it. People describe this as a feeling that a mental “switch” has been flipped.

Want to work without waiting for the switch to flip? Iterate. Reduce the burden so that you don’t need adrenaline. Iteration is the flowing river that carves away the mountainous impossible. Start gently, take several passes to clarify your ideas, and several more to write your draft. Each pass becomes the raw material that feeds into the next round of iteration.

Principle two: Seed and prompt

In the world of AI-generated art, creation results from the collision of a seed and a prompt.

The prompt is a user’s instructions to the AI. But writers have their own prompts — the high-level goals they set for a creative project (e.g. write an essay on surveillance capitalism).

The seed is a panel of randomly generated static. Life doesn’t offer writers many panels of actual random static, but we do still have our own form of randomness to use as seed material; the quirky details of our own lived experience.

You can combine a prompt (your big goal) with any number of seeds, each resulting in a unique creation. Your essay on surveillance capitalism will look radically different if you start by considering your iPad rather than the light bulb in your room. They may have the same core intent, but the seed(s) you choose will alter how it is expressed.

Midjourney starts with four seeds. Why? Well, variety allows the user to examine the result of the same prompt across multiple seeds, to pick what works best. It’s a great trick, and you can do it, too.

Changing seeds is powerful because it breaks an artist’s fixation on a single way of achieving their goals. Are you struggling with a project? Try changing seeds; keep your big vision, but express it through a different set of random material drawn from your life. This creative recycling is a great way to break free of a mental rut.

Principle three: The final third happens after composition is done

When I was younger I stocked shelves at a major retail store. Each night our final task was to “face” the aisles, which meant arranging every item in a perfect row, each sticking off of the shelf by a quarter inch.

That must sound ridiculous. But having done it, I can share an insight; there is something psychologically different about an aisle where everything is perfectly aligned. It is the visual equivalent of a firecracker. And it hinges on getting the details just right.

During AI art generation, when you request that a prompt be executed in the style of a specific artist (for example, the inimitable Van Gogh), some of the artist’s preferences regarding form and color appear during the first two thirds of the rendering, but it is only after the 60% mark that the subtleties that identify an artist’s style will appear.

In fact, for many artists who use more conventional form and color, the signature characteristics that mimic their style don’t appear at all until the final third. The details make almost all of the difference.

“Rainbow galaxies above a dark town at night, by Van Gogh” [Author’s; via MidJourney]

Our mind has emotional mechanisms triggered by details so subtle that we overlook them until they line up together in a way that leaves us stupefied. Their effects are profound; they are the difference between a face that you hardly notice and one that leaves you breathless. And they all happen below that quarter-inch level. That is where quality dwells — and also the markers of an artist’s unique style.

In Midjourney those details aren’t even touched until the last third of the composition process — after the composition already looks complete. Why? Well, in AI, it’s because the main picture needs to be resolved before the detail work can commence.

But this principle applies to your prose as well. Are you the kind of writer who completes the final third? That’s the work you do after your piece already looks complete to you. That’s your chance to polish the details, and shine.

A rose, painted in watercolor (generated via Stable Diffusion)

Thanks for reading! If you enjoyed this piece, you may like the following one, too:

Ai Art
Midjourney
Artificial Intelligence
Writing
Technology
Recommended from ReadMedium