What Is Guidance Scale In AI Art Generation

Samuele
4 min readNov 3, 2022

--

This story was written with the assistance of an AI writing program.

In the last few days I have become obsessed with the art generated through Artificial Intelligence. I don’t have the knowledge to really understand how these algorithms work, but I really enjoy experimenting with the various options. One of the weirdest concepts, for me, is guidance scale. In this post I will try to understand what it means.

So, as far as I can understand, it is a number that indicates how important the prompt is (i.e. the textual description of the image as a general). The lower the number, the more “creative” the image will be; the higher the number, the more relevant the image will be to the prompt.

Better to give an example. I use this text as a prompt:

ultra-cute Moebius king, a starry background, high quality, 3d render 4k, soft shadow, soft light

I start with Guidance Scale = 1. The result is this:

Image by Samuele

I then switch to Guidance Scale = 5

Image by Samuele

I continue with Guidance Scale = 10

Image by Samuele

Now uidance Scale = 15

Image by Samuele

I end up with Guidance Scale = 20

Image by Samuele

As you can see, the best results lie somewhere between 5 and 10. Of course, it also depends on the prompt. Sometimes the best result is the result of a low value, other times a high one.

Image by Samuele

Other examples

I keep trying. I intend to use some suggestions from Jim Clyde Monge as a prompt. By the way, I recommend following this author, he has very interesting articles, and even if I’m not an AI expert, I really enjoy reading them. For me he is a source of inspiration, and I think I will write a few more posts starting from his suggestions.

That said, let’s do some testing with the prompt:

a misty valley with exposed fossils, extremely detailed oil painting, unreal 5 render, rhads, sargent and leyendecker, savrasov levitan polenov, bruce pennington, studio ghibli, tim hildebrandt, digital art, landscape painting, octane render, beautiful composition, trending on artstation, award winning photograph, masterpiece
Image by Samuele

Now I try this prompt:

cut sticker, kawaii, blue puppy
Image by Samuele

Finally, a prompt taken from an image, the pillars of creation:

hubble space telescope pillars of creation

The result I want to get is something like this:

This instead is what Artificial Intelligence creates:

Image by Samuele

As a last test, I try with a portrait (the prompt is taken from this images)

character portrait of young woman as a heroic retrofuturistic punk, pixie cut with shaved side hair, bad attitude, dystopian cyberpunk steampunk soviet mood, intricate, wild, highly detailed, digital painting, artstation, upper body, concept art, smooth, sharp focus, illustration, art by artgerm and greg rutkowski and alphonse mucha, vibrant deep colors
Image by Samuele
Image by Samuele

Thanks for reading! Stay tuned for more.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

Samuele
Samuele

Written by Samuele

I'm a hobby programmer, experimenting with Svelte, Javascript, Construct 3 and magic tools

Responses (1)

Write a response