avatarPaul DelSignore

Summary

Midjourney has introduced a new 'describe' feature that transforms images into text descriptions, enhancing accessibility, searchability, and creative prompt generation.

Abstract

Midjourney's innovative 'describe' feature revolutionizes the way users interact with images by converting them into detailed text descriptions. This tool not only aids those with visual impairments by providing ALT text for web displays but also improves search engine indexing and enriches image captions. The feature generates four distinct descriptions for a single image, which can be used to craft more nuanced prompts for further image variations. Users can upload an image, receive descriptions, and then use these to remix the image with modified prompts, offering new creative possibilities. The article demonstrates the feature's capabilities by comparing original and AI-generated prompts, showcasing the potential for diverse artistic interpretations.

Opinions

  • The author expresses surprise at the diversity of descriptions generated from a single image, noting their usefulness in prompt engineering.
  • The author endorses the new feature, stating a preference for a remixed image over the original, indicating the feature's potential to enhance creative outcomes.
  • The author enjoys the process, exemplified by the exclamation "I like it better!" when comparing a remixed image to the original.
  • The author highlights the feature's ability to generate descriptions that are both different and similar to the original prompt, suggesting a nuanced understanding of the image's content.
  • By using a NASA astronaut photo as an example, the author demonstrates the feature's versatility and its application to real-world images, not just artistic creations.

Midjourney’s Crazy New Describe Feature

Helping You Write Better Prompts

made by author in Midjourney

Midjourney released a new ‘describe’ feature that lets you transform images-into-words.

“We think this tool will transform your liguistic-visual process both in terms of creative power and discovery.” - Midjourney team

The Importance Of Image-To-Text Descriptions

Image descriptions have important broader implications that are worth mentioning:

  1. Improved accessibility: Image descriptions make digital content more accessible for people with visual impairments or reading difficulties. This is done via the ALT text element for web displays.
  2. Enhanced searchability: Descriptions can enable better search functionality and indexing via search engines.
  3. Use for captions: Captions can incorporate descriptions to provide additional clarity to images.
  4. Detailed Prompts: Descriptions can be used to create more detailed prompts for crafting new variations. They can provide inspiration for prompt engineering.

Midjourney will generate four different descriptions based on an image you upload and makes it easy to generate new variations.

How It Works

The way it works is you simply start by writing /describe and Midjourney provides a way to upload an image.

screen cap by author

After you upload the image, you click enter

screen cap by author

Midjourney then returns four descriptions based on the image

screen cap by author

The four numbers on the bottom are active remix buttons — each number matching the corresponding description. Clicking on the number will remix the image based on the new description.

You can also modify the prompt via the remix:

screen cap by author

This is actually a cool remix version: I like it better!

made by author in Midjourney

This was the original prompt I used to create this sample image:

an illustration of a brain with tree roots, psychedelic art, vibrant, by Alex Grey, by Amanda Sage, by Robert Venosa, neon colors

And this is one of the prompts that Midjourney described, that I used for the remix:

An image of an abstract brain tree with roots, in the style of mark henson, luminous colors, dark symbolism, detailed anatomy, bold lines, vibrant color, psychological phenomena illustrations, chiaroscuro woodcuts

I am surprised to see how different the prompts are in comparison but are somewhat similar.

Just for fun, I uploaded a photo of the NASA astronauts via the new moon mission — and had Midjourney describe and generate a new version of AI astronauts.

1st image credit: Josh Valcarcel/NASA / 2nd image: made in midjourney

NASA astronauts group pose for a photo, in the style of photorealistic portraits, dark cyan and orange, uniformly staged images, romantic depictions of historical events, celebrity portraits, hasselblad h6d-400c, non-representational — ar 117:77 — v 5

If you liked this article, throw out some Medium love… claps, comment, and be sure to follow.

You can also support my work on Medium by becoming a member using this referral link.

More From The Generator

Midjourney
Technology
Artificial Intelligence
AI
Generative Ai
Recommended from ReadMedium