Generative AI Art Generation: Mastering all the AI Tools - Midjourney, BW, DALL-E, SD, Runway, etc.

Henry Learning, Instructor | AI Entrepreneur

Get unlimited access to every class

Taught by industry leaders & working professionals

Topics include illustration, design, photography, and more

Get unlimited access to every class

Taught by industry leaders & working professionals

Topics include illustration, design, photography, and more

Lessons in This Class

- 1.
  
  Welcome to AI Art Generation
  
  3:05
- 2.
  
  AI Image Generator Apps Introduction
  
  12:04
- 3.
  
  AI Image Editing Apps Introduction
  
  7:49
- 4.
  
  DALL-E Introduction
  
  7:24
- 5.
  
  DALL-E Image Generation
  
  10:22
- 6.
  
  DALL-E Image Editing
  
  9:47
- 7.
  
  DALL-E Outpainting
  
  8:44
- 8.
  
  Prompt Examples
  
  13:11
- 9.
  
  New Update: Comparison Between DALL-E 2 and DALL-E 3
  
  14:00
- 10.
  
  New Update: DALL-E 3 with Bing vs DALL-E 3 with ChatGPT
  
  6:47
- 11.
  
  New Update: DALL-E 3 with ChatGPT
  
  17:35
- 12.
  
  New Update: Examples and Use Cases of DALL-E 3
  
  17:55
- 13.
  
  New Update: DALL-E 3 New Parameter Gen_ID
  
  11:07
- 14.
  
  Prompt Writing - Subject and Medium
  
  10:17
- 15.
  
  Prompt Writing - Composition, Action and Details
  
  10:59
- 16.
  
  Prompt Writing - Negative Prompt, Stylizers & Modifiers
  
  11:30
- 17.
  
  Prompt Writing - Artists
  
  8:12
- 18.
  
  Prompt Sample - Portrait
  
  14:52
- 19.
  
  Prompt Sample - Landscape
  
  10:02
- 20.
  
  Prompt Writing Resources
  
  9:02
- 21.
  
  Lexica Introduction
  
  8:11
- 22.
  
  Lexica Features
  
  9:39
- 23.
  
  Lexica Image Generation
  
  7:15
- 24.
  
  Prompt Guidance Parameter
  
  10:28
- 25.
  
  Lexica Image to Image Generation
  
  9:22
- 26.
  
  DreamStudio.ai Introduction
  
  8:43
- 27.
  
  DreamStudio Features and Models
  
  12:27
- 28.
  
  DreamStudio Image Generation & Seed Parameter
  
  14:25
- 29.
  
  BlueWillow Introduction
  
  4:29
- 30.
  
  BlueWillow Overview and Discord Setup
  
  10:32
- 31.
  
  BlueWillow Image Generation Part 1
  
  10:10
- 32.
  
  BlueWillow Image Generation Part 2
  
  14:32
- 33.
  
  BlueWillow Image to Image Generation
  
  11:15
- 34.
  
  Midjourney Introduction
  
  9:39
- 35.
  
  Midjourney Overview, Setup, and Basic Commands
  
  18:04
- 36.
  
  Midjourney Text to Image Generation
  
  14:17
- 37.
  
  Midjourney Image to Image Generation
  
  13:06
- 38.
  
  Midjourney Basic Commands - Blend
  
  12:24
- 39.
  
  Midjourney Basic Commands - Describe
  
  8:50
- 40.
  
  Midjourney Prompt Writing - Keyword
  
  13:07
- 41.
  
  Midjourney Prompt Writing - Option Set
  
  9:41
- 42.
  
  Midjourney Prompt Writing Resources
  
  5:00
- 43.
  
  Mijourney Parameters - Image Weight, Quality and Stop
  
  8:28
- 44.
  
  Midjourney Models
  
  7:36
- 45.
  
  Midjourney Parameters - Stylize
  
  7:53
- 46.
  
  Midjourney Parameters - Chaos, Tile, Seed and Remix
  
  12:51
- 47.
  
  Midjourney Emojis
  
  2:22
- 48.
  
  Midjourney Image Generation Example- Portrait
  
  9:51
- 49.
  
  Midjourney Image Generation Example - Logo
  
  7:35
- 50.
  
  Midjourney Image Generation Example - 3D Render, Anime, Characters, Landscape and Concept Art
  
  11:57
- 51.
  
  Midjourney Bonus Video - Faceswap with InisghtFace
  
  9:25
- 52.
  
  New Update: Midjourney New Features
  
  15:18
- 53.
  
  Introduction to Basic AI Photo Editing Tools - bigjpg.com and vectorizer.ai
  
  8:56
- 54.
  
  Basic Photo Editing Tools - Segment-anything.com
  
  7:54
- 55.
  
  Basic Photo Editing Tools - Creatorkit.com
  
  6:51
- 56.
  
  ClipDrop Introduction
  
  2:38
- 57.
  
  ClipDrop Tools Overview - Stable Diffusion, Uncrop and Reimagine XL
  
  8:19
- 58.
  
  ClipDrop Tools Overview - Cleanup & Remove Background
  
  11:46
- 59.
  
  ClipDrop Tools Overview - Relight
  
  7:39
- 60.
  
  ClipDrop Tools Overview - - Image Upscaler
  
  9:13
- 61.
  
  ClipDrop Tools Overview - Replace Background
  
  10:14
- 62.
  
  ClipDrop Tools Overview - Text Remover
  
  9:36
- 63.
  
  Adobe Firefly Introduction
  
  9:21
- 64.
  
  Adobe Firefly Text to Image - Portrait & Logo
  
  15:04
- 65.
  
  Adobe Firefly Text to Image - Illustration, Anime, Landscape and Concept Art
  
  11:26
- 66.
  
  Generative Fill - Logo Editing
  
  16:47
- 67.
  
  Generative Fill - Portrait & Product Photo
  
  6:31
- 68.
  
  Text Effects
  
  5:57
- 69.
  
  Generative Recolor
  
  8:44
- 70.
  
  RunwayML Introduction
  
  5:15
- 71.
  
  Text to Image Generator
  
  8:01
- 72.
  
  Train your Own Generator
  
  9:29
- 73.
  
  Image to Image and Infinite Image
  
  11:11
- 74.
  
  Image Expansion
  
  9:18
- 75.
  
  Frame Interpolation, Erase and Replace
  
  10:26
- 76.
  
  Backdrop Remix, Image Variation and Add
  
  10:15
- 77.
  
  Upscale Image and 3D Texture
  
  6:49
- 78.
  
  Leonardo.ai Introduction
  
  5:42
- 79.
  
  Leonardo.ai Overview
  
  6:35
- 80.
  
  Image Generation - Text to Image
  
  13:49
- 81.
  
  Image Generation - Leonardo Parameters
  
  10:00
- 82.
  
  Image Generation - SD Parameters, Schedule & Sampler
  
  11:00
- 83.
  
  Image to Image Generation & ControlNet
  
  15:57
- 84.
  
  AI Canvas - Outpainting
  
  10:44
- 85.
  
  AI Canvas - Inpainting
  
  13:39
- 86.
  
  Leonardo - Training a Model
  
  15:56
- 87.
  
  Astria.ai Introduction
  
  7:45
- 88.
  
  Astria - Training a Model
  
  14:00
- 89.
  
  AUTOMATIC1111 Introduction
  
  6:15
- 90.
  
  AUTOMATIC1111 Google Collaboration Setup
  
  7:47
- 91.
  
  AUTOMATIC1111 RunDiffusion Setup
  
  17:08
- 92.
  
  AUTOMATIC1111 Basic Parameters
  
  10:10
- 93.
  
  AUTOMATIC1111 Parameters - Sampling Steps, Sampler, Seed & CFG Scale
  
  11:09
- 94.
  
  AUTOMATIC1111 ControlNet
  
  7:40
- 95.
  
  AUTOMATIC1111 Hires Fix and Image to Image
  
  7:29
- 96.
  
  AUTOMATIC1111 Inpainting Extras
  
  15:09
- 97.
  
  Create a Comic Book Version of Yourself in Astria.ai & AUTOMATIC1111
  
  14:04
- 98.
  
  Create a Book Cover in Midjourney & AUTOMATIC1111
  
  18:47
- 99.
  
  Create a Logo in Leonardo, Midjourney, ClipDrop, Vectorizer and Firefly
  
  20:07

Beginner level

Intermediate level

Advanced level

All levels

797

Students

Projects

About This Class

CLASS UPDATE!!!

Exciting news! We've made big updates to our DALL-E and Midjourney sections!

Discover the latest about DALL-E 3 with new videos showcasing its unique features, cool applications, and how it integrates with ChatGPT-4. Get a sneak peek into the DALL-E 3 with Bing vs. the DALL-E 3 with ChatGPT and explore the Gen_ID parameter.

Explore the upgraded features of Midjourney, including the groundbreaking "/tune command." Learn how to upscale your images, polish your editing techniques, and explore the fun possibilities of the --weird parameter.

Class Overview:

In this class, we teach you all the AI art generation tools you need to know to become a well rounded AI artist.

This includes Midjourney, DALL-E, Leonardo, Stable Diffusion, Automatic1111, RunwayML, Adobe Firefly, BlueWillow, and more.

But we don't just stop there... In this class, we'll also introduce AI photo editing tools like Bigjpg, Vectorizer AI, ClipDrop, and CreatorKit. You'll even get to work on exciting projects using the skills you've acquired.

What You Will Learn:

Become a well-rounded AI artist
Master over 10 AI Image-generating tools
Learn more than 4 AI photo editing tools
Explore various AI art generation techniques
Perfect your prompt writing with different parameters
100+ downloadable prompts

Why You Should Take This Class:

Taking this class is a remarkable opportunity for anyone seeking to dive into the realm of AI-driven industry. Most other lessons focus on just one tool, whereas we teach a diverse array of over 10 cutting-edge AI image generating tools, allowing you to master the latest and most powerful technology in the field. With various AI editing tools, you'll be equipped to refine and enhance your creative projects to professional levels.

Who This Class is For:

This class is for beginner to advanced users interested in not just mastering one tool, but all AI art generation tools to become a well-rounded AI artist. Whether you're just starting or looking to elevate your skills, this course offers valuable insights and hands-on experience to enhance your creative abilities.

Materials/Resources:

To engage in this course, you'll need a computer with internet access. Additionally, a Discord account, which will be essential for tools like Midjourney. You'll also need accounts for the various AI art generating tools included in this course as these will enable you to fully utilize their capabilities.

For prompts used in the course, see downloadable resources.

Meet Your Teacher

Henry Learning

Instructor | AI Entrepreneur

Teacher

Hi there! I’m Henry, a data consultant to Fortune 500 companies, no-code specialist, and AI enthusiast based in Canada. I run Henry Learning.

We are a passionate group of advocates of no-code application development in business, as they are much easier to understand and deploy quickly. We also believe that AI and automation can make everyone's life easier, and am on a quest to teach these skills to as many people as we can. We firmly believe that no-code and AI solutions are the future of work, and that they will be integral in our every day lives.

We also love teaching and mentoring students on a variety of topics including AI, no-code, automation, data analytics, and visualization, and more. We are committed to creating enga... See full profile

Related Skills

Midjourney AI & Innovation AI Fundamentals Generative Art

Level: All Levels

Hands-on Class Project

For the class project, you have 3 options:

Channel your inner superhero as you transform yourself into a comic book character with the magic of Astria AI and A1111.
You can unleash your artistic flair to design an eye-catching book cover with the dynamic duo of Midjourney and A1111.
Craft a unique logo that speaks volumes with the power of Midjourney, Leonardo, ClipDrop, and so on.

(See class videos 94 to 96).

To share your work, post screenshots of your generated images.

Manfred Prantner 2 likes

Class Ratings

Why Join Skillshare?

Take award-winning Skillshare Original Classes

Each class has short lessons, hands-on projects

Your membership supports Skillshare teachers

Learn From Anywhere

Take classes on the go with the Skillshare app. Stream or download to watch on the plane, the subway, or wherever you learn best.

Transcripts

1. Welcome to AI Art Generation: The medium for artists have always changed over history, from cave paintings to sculptures, to architecture, to canvases, to digital art, to virtual reality. And now we are in the realm of A I. There are now dozens of AIR generation tools out there. And as an AI artist, you need to be familiar with all of the important ones. And that's what this course is all about. Welcome to AIR Generation, Mastering all the AIR Tools. We created the scores because we saw teachers making A generation horses, but only talk about Meat Journey, or only Dali, or only Runway. That's not the right way to think about it. Imagine you're learning to be a carpenter. You don't just learn how to use a hammer. Instead you learn how to use all the tools like a screwdriver, arrange, and so on. Different tools to different things. For example, Meat Journey is great for the realism but it's not a tool for, in painting. For that you can use Leonardo if you're doing image customization, stable diffusion using automatic 11. 11 is your tool. How about if you need to upscale, relight, or edit, then clip drop. Adobe Firefly is the way to go get my drift. This class is created for someone that wants to be a well rounded artist, that's who we are. I'm Katrina and I'm an artist and videographer that creates a still video content for tech firms using AI technologies. And I'm Henry. I teach AI technologies to people to help enhance their productivity and create automations. In this course, we deep dive into the 12 most used AIR generation tools. We create photorealistic images with mid journey. We do image to image generation with blue willow. We relight and reshape with clip drop. We do generative fills with Adobe firefly. We do out painting with Leonardo. We even apply a custom model using stable diffusion, automatic 11, 11 and much, much more. Now this is an 11 hour course, but trust me, by the end of this course, you will be a well rounded AI artist, will be competent in all of the popular AI art generation type tools. The class project is by far the most exciting thing that we've done. In fact, you will have three options. You can either make a logo for business, you can make a computerized version of yourself, or you can make a book cover. So what are you waiting for? Start the journey to become an artist Today, it's as easy as clicking this button. We'll see you inside. 2. AI Image Generator Apps Introduction: In this course, we will be covering a lot of AA tools and apps. It's easy to get confused, that's why I created this chitchat that will help you navigate through how this course here I've listed the major image generator platforms that will cover. Here we have Ali images, do Lexcams Studio, blue willow, mid journey, leonato, and Automatic 11, 11. Let me now tell you a little bit more about each platform for the model. Dali uses a proprietary model. Dali images is based on stable diffusion. Lexica Art is also stable diffusion based. They have their own proprietary model. Dream Studio is using stable diffusion. Blue Willow is stable diffusion based. They use their own algorithms to choose a specific stable diffusion model based on your prop. Then we have Mid Journey. They have their own model. Leonardo is also stable diffusion based. So they have a lot of stable diffusion models as well as a proprietary model. And in Automatic 11, 11 is where you can use any open source staple diffusion model. How easy it is to use each platform. Delhi Images I Lexica. It's pretty easy to use for Dream Studio, I would say. Not really because they have a pretty difficult user interface for blue willow and in journey it's pretty easy as long as you have Discord account. If you don't have that, you'll have to sign up for Leonardo is also somewhat easy. The thing with Lado, they have lots of settings and parameters which is great for more advanced users. And automatic 11, 11 is not beginner friendly, it's for advanced users. Then some platforms would be on their website, some will be on Discord like Blue Willow and Mid Journey. And for Automatic 11, 11, it's a web interface of stable diffusion which you can install locally on your computer or use a cloud server. Next, for some platforms, you would need to write a longer prompt to generate better images. However, for other platforms, that's not the case, and basic prompts work as well as longer prompt. Those platforms are Lexica and Mid Journey. If you write some basic prompts with these platforms, you will generate great images. Then I've listed some features that you can find on each platform. The first one is Image to Image Generation. This is where you can upload your image and generate based on this image. For Dali. You can do that for Lexica.org You can do Dream Studio? Yes. Blue willow, lado and Automatic 11, 11. Everything except images. Then we have in painting allows you to edit a specific part of your image basically by erasing and writing a new prop. Some of the platforms that have this feature are Dali, Leonardo, and Automatic 11 11 for out painting. Out painting is basically extending your image. The platforms that have it are Al Lexico art, blue willow, mid journey Leonardo and Automatic 11 11 for image quality. This is my personal evaluation. I found that with Lexico Art, you get high quality images mid journey, great images, Leonardo and Automatic 11, 11. You can also achieve really high quality images, but with Leonardo and Automatic 11, 11, they require some experience to get high quality images. So you need to know the settings that you work with. Then for the level I assigned. Also, based on my evaluation of the platform, I think Del images and lexico dot art. Are more beginner friendly because they have simple interface and just easy to use. Then for dream studio blue willow and mid journey, it's more intermediate, especially for blue willow and mid journey, you need to create a discord account, Leonardo AI. I would put it somewhere between intermediate and advanced because I think it has too many features for a beginner user. For Automatic 11, 11, that's definitely for advanced users. So let's check out some pros and cons for each platform. Okay, here are the pros and here are the codes for Ali. It has simple interface, it's easy to use, but the images are pretty low quality and the plan is quite expensive. Images right now it's free to use. It also has simple interface. Some limitations here is that there are no profile with history of generations. Also, it's sensitive to some basic words, some things you won't be able to generate with images. Lexica, it has simple interface, it generates high quality images. It is also an image search engine where you can look for image and prompt inspirations. It's great with short prompt. It allows to generate images in a private mode with a paid plan. The basic plan is pretty cheap. The limitation here is that it only uses its Lexica models, which give a specific aesthetic to the images, which may not be great for all the art. The next platform is Dream Studio. It was created by a company behind Table Diffusion. Here you get access to newest table diffusion models. It also has some advanced settings for S. It has difficult UI and also limited models then Blue Willow. It's beginner friendly and it produces good quality images. But some limitations here is that it's on discord, so you need to have a discord account, and it also has limited settings. There is little control over image generation for mid journey. Mid journey creates really high quality, realistic images. It's great with short prompts. You can generate images in a private mode and the basic plan is also pretty cheap. Some limitations here is that it's also on discord. You need to have Discord account. Right now, they don't have image editing yet except for Out painting. If you need to change small things on the images that you generate with mid journey, then you would have to use a different platform. Our next platform is Astra. It has simple interface and it's best for high quality custom model training. It's also based on stable diffusion, it's also pretty cheap for the cons here is that the settings it has may be quite limited. For a more advanced user. Leonardo I has many settings. It has a lot of stable diffusion models. It allows you to generate images in a private mode, and the basic plan is also very cheap for the cons Ganado is still in the wait list mode. If you want to use it, then you should register in advance. Also, it has quite advanced. A user interface has a lot of features and settings, so it can be quite overwhelming for a beginner. Then our last platform here is Automatic 11 11. It has high image customization and control, lots of settings, features extensions. A big benefit is also that you can use your custom models for the coins. Automatic 11 11 is not beginner friendly, it's definitely for more advanced users. In the next few slides, I will show you what images I was able to generate with most of the platforms here. Okay, here. The first one is L here. As you can see, I've tried to use different art types to showcase the platform. Okay, let's move on to the next one, Lexica. Especially with the portrait, it has this really nice aesthetic Blow willow, here are some images. Then mid journey here we really get creative with some of the artwork. Adobe Firefly. I found that it's really good for logos and illustrations, but right now it's not for commercial use yet. Then there is Leonardo. The images that I generated here use the Leonardo diffusion with prompt magic. Then here we have automatic 11, 11. Every image I've created with different model. Here are the settings that I've used and the models. Now I want to show you some image to image generation with different platforms. Here is the original image, Here is the result with different platforms. This is Al here. It wasn't graded with proportions at all, The body figure is all distorted. But Lexica and mid journey were really good with more simple image here. Let's see, the results here is with day to lexica and my prompt was iridescent wolf magical colors. This is the result with Lexica mid journey and Leonardo with Daly. You cannot actually add your prompt here. It just creates variation of the image. 3. AI Image Editing Apps Introduction: Now let's move on to photo editing platforms. Before we've discussed the AI generating platforms, here are photo editing. Actually all of the photo editing platforms here also have image generation. As you can see, text to image model, that's the image generation. The clip drop uses stable diffusion until the Firefly has its own proprietary model. Runway is stable diffusion based how easy it is to use. They are all pretty simple to use. Now let's take a look. What features do the use platforms have for image to image? Here we have only runway for image variations, we have clip drop here in the brackets is the name of the tool here. Reimagine Excel allows you to do image variations in clip drop in runway. It's image variation for out painting. Again, we have clip drop with the tool called crop in runway. There are two tools that allow you to do out painting, extend image and infinite image. For in painting, we have adobe firefly and runway. Here with firefly it's called generative fill in runway it's called rays and replace. Then we have clean up. The difference between clean up and in painting. In painting allows you to replace the area that you don't like with something else. Whereas clean up will just erase the thing that you don't want to see on the image. With a clip drop, it's called clean up with Adobe Firefly. You can do that in generative fill. Replace background, you can do it in clip drop. The tool is called Replace Background in Adobe Firefly, it's called Generator Fill in Runway, it's called Backdrop Premix. Or you can also use a different tool called Creator.com to remove background in your image, you can use Clip Drop. And the tool is called Just Remove Background, or you can use the website called Segment, Anything.com by meta. The next feature is Up scalar. For Clip drop, it's called Image Scalar Runway, Upscale image, or you can also use a different website called piggpgt com. Here are some other tools that each platform has here. For clip drop, we have relight and text remover. For Adobe Firefight, we can generate cool text effects here. They also have generative recolor that allows you to change the colors of a vector image. Then for runway, we also have a color frame interpolation, three D texture, and model training. Another website that I also want to talk about is the vectorized I, that allows you to create a vector image from GPG PNG image. In the next few slides, I will show you how some of these features work. Let's take a look. Photo scalar, Here is my original image with a low resolution. I wanted to upscale this photo. Here are the different platforms that allowed me to upscale the photo. Here are the different results. My favorite one was the clip drop for X, smooth for artwork up scalar, this is the image and here again, different platforms, this time for artwork upscaling. I like the big GPG way more than the other ones. For background replacement, here is my original image from mid journey. Here are some images that I could get with different platforms. Here I use the Create It. It produced nice results, but in lower solution. Here is the clip drop. Here is Adobe Firefly and Runway. As you can see, Runway tried to add the platform here to my product. Next, let's move to Image variations. Here is my original image. I generated it with my journey. So let's take a look at what results did we get from different platforms. Okay, here we have clip drop with the re, imagined Excel tool and Runway. Here we've got in very similar style but now the furniture is arranged in different manner. We're asked for runaway. We've got the same composition but just slight modification in color and style of the furniture for out painting. Here is the original image, you are probably familiar with this meme. Let's see how different platforms extended the image. Here we have clip drop with the tool called and crop and runway with the tool called Extend Image. Here are different results. From my experience with clip drop, you would get a bit better out painted images than with runaway with a fewer artifacts. But still is not great especially keep. You see the hands and the legs. Next is remove background. Here is the original image here. I chose the image with the hair because it's the most difficult thing to remove. Background with hair here, let's see, here is the result with segment, Anything.com and here with Clip drop. Remove background here, it's apparent that Clip drop did a way better job. Now let's remove background from a more simpler image. This is the original image. Here is segment anything.com and clip Drop. As you can see, the clip drop doesn't have these pixilated areas at the bottom here. Again, Clip performed better here. Here I've compiled a list of all the features that you will learn in the scores so that you know what platform you can use for specific feature such as remove text, You can use drop here, let's see. For example, colorized black and white images. Then you can use Runway Had Color. For some, there will be multiple of platforms that have that tool. For example, train your model. You can use Runway, Leonardo or Austria on that node. See you in the first module. 4. DALL-E Introduction: Hi everyone. Today I want to start off by introducing you to Dali, which is one of the simplest AI image generation platforms. Dali stands for the noising order encoder for learned language embeddings. It was developed by Open AI. It's the same company behind Chip. It was introduced in January 2021. It uses an algorithm similar to stable diffusion because Dali relies on a process called diffusion. The image generation starts with a random set of noise, which is basically a random arrangement of pixel values. Then this gradually is modified in a series of steps to make it match a given prompt. By starting with a different set of random noise. Each time different images, different results can be created from the same prompt. The process is called diffusion because it involves spreading out changes across the image. Each step of the diffusion process makes small adjustments to the pattern, making it more and more like the desired image. You can think of it like looking up in a cloud sky and finding a clown that resembles some object or an animal. You can make it more clear and defined in your imagination. That's basically the diffusion process. More advanced model became available in April 2022. That's what is being used right now, it's Ali version two. Now I would like to outline some advantages and disadvantages of using the platform to start off. One of the advantages is that it's very easy to use. If we go to Ali, you just need to type your prompt here and click generally. There's no extra features. Very simple to use for a beginner that it will generate an image according to your prompt. Another good thing is that it gives some free credits every month. You can explore the platform for free. Also, it can generate images in different styles. If we go to the gallery, you can see images in different styles. I would say realistic styles like this one paintings realistic impressionism, cartoons as of styles, which is great. Another great thing about Delhi is that I can upload and edit images. If I go to the website, there is a button to upload an image. I can upload an image here, I can make variations of my image or edit the image here, there are two ways I can modify my image. First is in painting, basically I can replace certain part of my image with something else. For that, I just need to erase the part of the image that I don't like. Write a prompt, it will generate a different image with this part. Another feature is that I can make an out painting, basically extend the boundaries of my image. Again, I can write a prompt and it will extend my image according to the prompt that I write. Explore in the later videos. In terms of limitations day produces quite simplistic images compared to other platforms. It doesn't have as much details, it's not as creative. Another thing, it only produces square images if we go to the gallery. As you can see, all the images are squares, for example, if you want to create a landscape dimension or a certain aspect ratio. Unfortunately, here you cannot do that. It's only going to be a square format. Another problem with Dell is that it cannot quite produce potter realistic images, because when it renders a phase or hands, it makes mistakes. Let me show you what I mean. This is a problem I tried and this is the image I got. If we zoom in, you can see some artifacts. The teeth are not friended correctly. The eyes I'm not sure if it's drawing, so there's big problems with her facial features. Yeah, I wouldn't recommend using deal for high quality, for realistic images. Another thing with Ali is that similar to stable diffusion, it requires detailed prompt. If you want to create an image with more details or you are looking for specific image, then you need to write a very clear detailed prompt. Otherwise, you would get very ambiguous results. I want to show you these four images were all created with the same prompt, a girl in a scarf, and as you can see, the results are all over the place. We have some realistic images, we have cartoon. Make sure you add as much details as possible and open actually recommends to have more detailed prompt. Even if you write a very detailed prompt, the application sometimes may struggle to produce the desired output. Just because it may interpret certain things differently. And you may need to iterate a few times and generate more images to get what you want. This is very similar to other image generating models. In the next video, we'll start exploring Ali and I will show all the features in detail. See you then. 5. DALL-E Image Generation: In this video, let's dive in into Dali and let's start exploring it. The first thing you'll need to do is to go to the website and this is what you'll see. You can read more information about Dali. Here's an introductory video and some articles about Ali and latest updates. It also outlined some features such as out painting in painting. This is what I covered in the first video, variations of course, image generation. What I want to say is that when Daly first came out, it really amazed people. Because the way it combined different objects, concept attributes and styles in a surreal image, and yet making it auto realistic, that was only possible in our own imagination, in our head. And here we go, we have the stool that can take our imagination and put it in a canvas. So what Daly is best for is making this crazy, unreal, surreal images. You can put lots of concepts into it and make your vision come true without further ado. Let's go ahead and login or sign up for Ali. If you already have an open AA account, for example, if you use GPT, you can log in with the same account, otherwise you'll need to sign up. But it's very easy, All you need to do is give your e mail address and they will also ask for your phone number. I will log in after you log in or sign up. This is the page you'll see. Let me just talk a little bit about how it works here. This is where you will write your prompt. There is also a feature to surprise me that will just give you a random prompt. If you need some inspiration, you can click on it and it will give different prompts every time. For example, a submarine, a bowl of soup, that's also a portal to another dimension, digital art, for example. This sounds really interesting to me. You can go ahead and click Generate. After loading, it's going to give me a few images. Okay, here we go. We guide our Bowl of soup. That's a portal into another dimension. Let's say if you like a certain image, I like this one for example. I can click and I have an option to download it. Or we can make variations by clicking. Variations is going to make a few more similar images to the one I chose. Okay. As you can see, the style is very similar to this image that I chose. It's all black background. The ball first. A little bit. Yeah. As you can see, it's not as detailed as it could be. Okay, now if we go back, I want to talk a little bit more about what styles we can create with Ally. For this, it's best to use the gallery as inspiration because it has really good examples and actually really good examples. Because I would say 60, 70% you get really strange outcomes. For example, this one is a three D render. This one is an expressive oil painting. This one is a photo, it doesn't say the style. This one is style paper wave. This one bang style. As you can see, every image is produced in different style. The style is not specified. Sometimes it's the artist who is specified. Johannes Mu is the artist of a girl with a pearl. As you can see, the style is very similar, but now it's an animal. Again, here's a hand drawn sketch. This one is a photo cyberpunk. This one is an oil painting. An oil pastel. This one is a cartoon. This one is a pencil. And what a color drawing again, This one is a three D render. This one is a comic book style, let's say a certain style of the image, for example, the style. You want to generate images with the same prompt, then you can go ahead and click Try this example. This will images with the same prompt. If you want to modify the prompt a little bit, you will need to copy it. Let's copy it and base it here. Let's say you want a cat with a pearl earring and click Generate. This is what we've got. These images are AI's interpretation of a cat with a pearl earring by Johannes Mir. As you can see, all these images are quite similar and yet different. On every image we see a cat wearing some blue scarf and pearls. Well, this particular image does have pearls, but overall the style is very similar to Johannes Mir. This is what the Dali algorithm does. It takes the prompt, interprets it, and then gives off different images. Now we can try some more examples and try to challenge AI to see what prompts it will give. Let's try a giant bearing, riding a bicycle and a cartoon style generate. Okay, great, I actually like the first image and the last, although it still needs some editing because you can see the bicycle handles are not rendered well, and the E as well, as well as the last image, eyes are not rendered correctly. It needs some editing. The others, I would say, are worse just because they like the details. As you can see, again, the cartoon styles are all different here. If you want a specific style, then you need to specify it in the prompt here. Let's try some more examples. Let's do, for example, animals Atlantis lost city of Atlantis. Let's make it a digital art. Okay, this is what we've got. On the first image, I actually see whales here. This is probably some architecture. This is all underwater. I got the concept of City of Atlantis correctly. However, all the other images, the second, third, and fourth, they are pretty bad if none of those looks like animals. N here actually this is something that you'll experience when you use AI generating platforms. Because some images will reflect your concept or your prompt really well. However, you will also get images that are really wrong. This is just a heads up, it's going to happen, you'll just have to generate a few more images, because the more images you generate, the higher the likelihood that one of these images will capture your concept correctly. Actually, in Ali, if you don't like any of the images, you can go ahead and click on it. And there is a flag button. When you click on the flag button, there's two options. You can flag the image if it doesn't match your text description, which it doesn't. So I can go ahead and click it and hopefully open AI. Team will go and review the images that helps the team to build a better product and improve their algorithm. That's it for this video. In the next video, I'll go more in depth into how to make editing with Ali. 6. DALL-E Image Editing: In the last video, we tried a few products with Dali. And this is where we left off in this video. I want to show you more of Dali features. Let's go ahead and try all the features that Ali can offer to us. I want to make some edit on the image that I like. All you need is to click on it. It's going to expand it. Here's a button called Edit. We can click on it. As you can see here, the interface is very simple. There are only five buttons. And in order to edit the image, we need to select an eraser. So make sure eraser is selected. And when you move to the image, you'll see a wide circle and you can start erasing things that you want to replace in this image. For example, let's say if you made a mistake, you can always go back by pressing control on Windows or command on Mac on the right hand panel. We can also change the size of the eraser. For example, I want to make it a little bit smaller to make raising more precise. What I want to remove from this image are those squiggly lines. I'm not sure what they are supposed to be, but I don't like them in this image. I will remove them, and when you remove something, prompt will pop up and this is where you will write a prompt to fill those ******. I'll put underwater fish for example, and click Generate. After rendering is complete, you will see four different images that fill out those ****** in a different way. If we zoom in, these are the ****** I didn't like as you can see here. I fix those ****** and added a fish, I believe here. Yeah, it added a few fishes on the other image here. Now we can go and select the best one. I like this one the most. I'll go ahead and edit this one again. Now overall I like with the design of this image, what I would like to do is to expand the image to show maybe some more information. To do that there tool to make out painting and that's called a Generation Frame. Click on this button, you will see a square like this. You can drag it anywhere you want and place it anywhere you like. You can also zoom out the canvas a little bit. You have more space to create. When you zoom out, there is also this button pen. This allows you to move the canvas anywhere you want. Let's move it here. Now let's click on Select. This will allow us to select the frame again and place it anyway we want. Let's start with the frame here. I want to expand my image to the right. And let's zoom a little bit, maybe not that much. Okay, now let's write a prompt for this new generation frame. I want some more of this image. I will put Underwater City. Let's see what it's going to generate. Okay, interesting ideas here. As you can see the extension. This image is in the same style as the original image. Now again, it has four variations and we can see which one we like the most, like this natural curve here. After you look through all of them, choose the one that you think suits your image the most. And then click. Except if none of the images look good. Then cancel and try again. I'll click now. Let's do the same to the left side, move the canvas alto bit. Let's select the generation frame and move it to the left. Another thing what I want to say is it's important to capture a little bit of your original image. Because that's how AI knows in what style to create this new image. If you do like this, it will have little information I would always recommend to do at a third, to capture at least the third in your new generation frame. For example, let's do this, let's capture, let's put under water, lost city of Lantus. See what it's going to generate. Actually, this is better than what I expected. It still captured the style because we had a little bit, lots of cool ideas here. I like all of them, but I would leave this style very interesting idea here. I'll accept it. We're very likely that this is what it generated because it took very little information about our original image and created actually very similar style. All the details actually look very similar to what we have here. However, if the generation frame doesn't touch your actual image, then it will generate things in a completely different style. Let me show you. Let's zoom out. Let's put generation frame. Let's put generation frame completely unrelated. Let's do the same underwater Los City of Lantus and click Generate. As you see, let me as you can see this image is completely unrelated to our style, totally different. It's just a brand new AI's interpretation of underwater Los City of Atlanta. If you want to expand your image, make sure the generation frame overlaps with the image. Let's cancel this. You can go and expand your image up or down. But this image, I think we did a pretty good job. Of course, we can do further editing of some fish. Let me quickly do it. When you editing, make sure you return the frame back. So if you don't have your frame at the places where you erase things, it's not going to generate image here. It will generate image wherever you have the frame. I'll move it here first. Okay, This is something I like accept. And let's move the generation frame here. Okay, after a little bit of editing, I think we have a very nice image here, so we can go ahead and download it. So you can click on this button and save the image. This is what we have After all the work and editing, I think this image came out very well. That's how I would imagine the lost city of Atlantis. 7. DALL-E Outpainting: Now let's try something different. Let's applaud an image, for example. This one deal asks if you want to crop an image. This is important if you want to make variations. Because Delhi's limitation, it can only generate square images as an input, it also square images. If you don't want to make variations, then you can skip cropping and basically added the image the way it is. However, if you do want to make variations, then just what you would like and click Crop and then choose Generate Variations. Okay, This is what we've got. As you can see, the images are really bad because Ali is not meant for photorealistic styles. But let's try something different. Let's uploading an image of a famous painting of a starry night, for example. Let's crop it and generate variations compared to the ballerina. It did a way better job with this, with this style, just because it's more abstract. It looks really good here. Okay, now what I also want to show you is that you can take this image and if I go to the editing mode here, I can actually upload more images. Let's first zoom out a little bit here. I can upload some more famous art. I can upload an image like this. As you can see, it's way bigger than this one. I will resize it to match the other one. I think this looks pretty good. When you're done, click the place here, check Mark. Now let's zoom a little bit. And let's upload one more image. For example this one click place. Now I will go to the generation frame and put frame where both images overlap. For example, here I'll put an impressionist painting of, of it. Try to merge them. Quite interestingly, I like this one. I will quick accept and I will remove the sport. I'll go to razor and I'll remove the sport. I will move my frame down and repeat. I'll put impression painting of a down at night. Okay. It actually match pretty well here. Let's see what are the other ones, okay? I think this one is the best one. Let's accept. Let's accept again, let's erase the part. Let's continue like this. I think this is the best because it has street going up where you can see the town. I'll set the win in order to merge this painting. It would be best to erase some sharp ends just because it, um, separate everything. I'll just remove the sharp ends here. I will go and put my generation frame here. Presion painting of a town at night. Let's see what it's going to generate. As you can see, it detected that these two styles are very different. It made a clear separation here, but on the next one, it's pretty well done that it detected that this is a sea or an ocean. You have a little bit of a beach here. My favorite one is the second one. I'll accept it. Let's just continue and see what it'll come up with. I think this matches the style. Let's do this one. Let's turn a little bit more here, okay. Wow, that's interesting. Except some boats here. It's either number three or number one here. Definitely way more people. I think this one is more clear. Accept this one. This one, I'm not sure if it's a car or something, but why did it match it like this, kill it's cancel it. Okay. Understand. Because the part is very sharp. In order to help I merge it, let's erase the spart. Let's delete the sharp here. It will give AI more ways to merge the two images. Let's generate again, maybe a person will be noisy here, except let's do our final one again. Let's raise the sharp edges here. Let's again select, I'll move the frame a little bit here. Let's generate again. I think this one is the best. Yeah, except as you can see, Let me move the generation frame over there. As you can see here, we took three different paintings and combine them together in one image. I think this is fascinating because two of the images were Bango style. Another one is a painting of Monet. It is quite different, but somehow AI was able to join everything together and create, in my opinion, a masterpiece. In this video, I used images that were quite similar in a style. However, you can try something different. You can try completely different styles, completely different themes, and combine them together. And that's what Out Painting is for. It's now up to AI to think how to merge the two completely different things together. So here you go, Have fun with it and see you in the next video. 8. Prompt Examples: To conclude Dally's module, I would like to use different prompts to show what Ally is capable of. I chose realistic photo, logo, magical realism illustration, landscape and conceptual art. I think that is more or less a good representation or various or genres because our course covers a lot of AA or generating platforms. Each platform or model have its own strength and weaknesses. So the best way to show you the differences between the platforms and where each platform performs best is to use same prompts for all of them. So this is what we'll do, starting from Dali. Let's get going. Let's start with realistic photo. For this, I choose a portrait photograph of a young woman. So I wrote professional portrait photograph of a young British woman in a jacket with wavy blonde hair, beautiful symmetrical face. Cute natural make up, blurry, raining, city, street background, highly detailed Sharp Focus Depot field, and then this is aperture and the ****. In the next module, I will explain why I chose certain words and how I wrote this prompt, but for now, I just want to show how Daly will interpret this prompt. Let's go ahead and try this prompt with Daly. Let's paste our prompt and clear Generate. All right. On the first image, I think Dali did pretty well compared to everything else. Here, it has the most natural look. However, there are still some problems, some inconsistencies with eyes, maybe lips as well, but as you can see, because the prompt is longer and here I write that it's highly detailed. It would usually give better results compared to a shorter prompt for portrait. As you can see, Daly is not good at producing photo realistic images of people. It makes a lot of errors. Okay, let's strike a different prompt. Let's do a logo here. I wanted to make a logo for a bakery. I wrote line logo for a cupcake with a cherry on top, clean lines, simple shape, minimalist vector. Let's try this prompt with Daly. As you can see here, it did exactly what we told it to do. It's a line logo. Here I can see clear line, simple shape. I like the third one the most. Also you can see the name, I don't know, it came up with the name for my bakery. I like the font here as well. Maybe I would remove the line, this line. But overall, it did whatever we wrote in the prompt and it was creative here. It doesn't look like a cupcake, but it has a cherry on top. It looks more, I don't know, like a burger with the cherry on top here, it added those lines. I think it did a pretty good job with the logo. It generated a few interesting ideas here. Let's try something different. Let's try magical realism. Magical realism is when you take something real, for example, like a dog and place it in unreal situation. For example, dog riding a bike. In this case, I decided to do three D Render or raccoon reading a book, armchair lighting from a lamp, realistic unreal engine. As you can see here, deli captured exactly what we wanted it to. I wanted a lamp con, reading a book armchair. It perfectly did the job here. However, when we zoom in, the eyes are missing here and you can see certain artifacts here and strange lines overall. It's not great because it lacks details, it's poorly rendered, but conceptually is okay. I found that if I remove this from the prompt, the lighting from a lamp, it generates better images. I think this is a little bit better. We have a raccoon in glasses reading a book. There's a little bit more attention to details, although still it's not too smooth. But compared to this one where there are no eyes at all, I would say the more simpler one is better. And one of the reasons is that here you can see raccoon is much closer to us. Here it's further away. The closer the object, the better it's rendered, the more details it will have. Just because of the way model works overall, Deli may be a good option for simple three D renders like here, but it would still require some further editing and retouching. Let's try something different. Let's try illustration. For this one, I chose children's book illustration of a girl riding a bike in summer. And here are the names of illustrators Axel Scheffler and Non Blake. Let's try that one. I think these are quite good illustrations. If we zoom in, I don't see any problems. This is something I would see in a book illustration, for example. Very simple. It follows exactly what we wrote. It's a girl riding a bike. I like those three here. This one I think the girl is missing, and nose. With some editing, I think it would still be a good image, although we would need to fix the leg here as well. But the this one and the third one, pretty legit and could be used right away for book illustration. I think Delhi did a fantastic job here. Let's do landscape for landscape, it's digital art of magnificent medieval castle between the hills and fields. Large pornographic background with dense nature and mountains, grand fortress, epic scene fantasy. Let's try it. Oh, wow. I'm very impressed with the first image here. I think this one stands out the most just because of the lighting. See how it's dark here and the light falls on the grass, this area here. And I think this makes it magical for some reason. We can see here, the image is in different styles. This looks like a oil painting. I'm not sure about this one. This one looks like pastels. This oil might be acrylic style, but as you can see, these use different brush strokes. I want to mention most AI models are better with landscapes compared to photo realistic features like photographs of people. Our face. There are certain features like eyes, they have to be the correct proportion or hands. We only have five fingers, it's not six or seven, I Sometimes we will get those things wrong, makes mistakes. However, those mistakes you wouldn't see on landscapes because, for example, the shape of the cloud is not as important as the shape of an eye. For example, if it makes a mistake, that mistake would actually look like a creative interpretation or creative element of the artwork. Mistakes here would look aesthetically pleasing compared to the postic phase features where we would immediately spot the mistake. For this prompt, Landscapes Tally performed really well. In the next prompt, we have conceptual art. Here I wanted to challenge AI. I didn't include any subject, I just include an idea, the meaning of life. And I wanted to see how AI would interpret it and put it into art. Then I just put adjectives. The whole prompt is the meaning of life, Breathtaking art. Standing high resolution, highly detailed, inspirational Eight K. Let's copy paste our prop here. Oh, wow. I was very creative with these prompts. As you can see, all these four images are in completely different or genres. I absolutely love this one. Here we have emerged of a person's face with a sky. So I would interpret it as a God figure. I'm not sure how AI even got here, but I think this has a very deep meaning. This one nurtured. This is a planet, maybe looks very futuristic. This one is a landscape. Some shapes, abstract art, fantastic. For this prompt, I'm truly amazed how creative A I was when I played with AI before. For the same prompt, it generated this wave with light, which is absolutely beautiful. Also, this brain with neurons. It's astonishing how AI takes this prompt, this idea, the meaning of life, and makes an art out of it. Because it's usually us who were taking an idea and making it into art. But now here we have AI that creates crazy cool things. And now we can look at those images for hours and try to interpret them. So for example here, the color scheme is so interesting. Now you know how to use deli and what art it's good for, it's strength and limitations. For example, I would use it for simple three D rendering. I would use it for cartoon style images. And I would use it for landscapes because I think it does a pretty good job on them. However, I wouldn't use any portraits not photo realistic. At least because they're way better platforms that can do photo realistic images, portraits of people. On that note, I will leave you with Ali and I will see you in the next module, where we will explore prompts and actually how to write them. See the. 9. New Update: Comparison Between DALL-E 2 and DALL-E 3: Hello everyone. This is an updated module in this presentation, the images you see here, they were all generated by a Gi. But this is not mid journey, this is not, Leonado, guess what? This is Daly three. And as you can see, it has improved so much from Daly two. Okay, so let's check out the differences between dally two and Dally three. Okay, with my presentation on the right, you will see images generated with Dally three. For your reference, I also included prompt. For example, here we have a prompt, Make a logo for a coffee shop with a name Espresso Club. Let's get into Dally two versus Dally three. Well, the resolution for images is way higher, Daly three generates images with the resolution of thousand 24 by thousand 24 for a square image or for landscape in portrait, it's 1792 by thousand 24 pixels. The resolution is twice as higher than it was in two. For al two it was 512 512. So you see a significant pump in resolution. Okay, next one is the improved details. Because the resolution improved, now we get more intricate details in the images. Next is the superior image quality. Dali three, I believe, had a way more training, training images and the images, it can now generate a way better quality. Especially, it's noticeable for the portraits because when I used to, there were many artifacts with the faces. Now it's not a problem. One of the big updates is the legible text, as you can see on this image, as I've asked it to be with a name Espresso club, that's how it appeared. Espresso Club. Of course, sometimes you do get mistakes, especially it sometimes would duplicate the letters that I notice. That's the most common mistake, but then you get lucky and get the name as you've requested. Finally, it can accurately depict historical figures. Let's see, here's the prompt that I gave to Daly. Three, make an image. How would a girl from the famous A girl with pearl earring by Johannes me would look like? Now here on the left we have the original painting by Johannes Mia. On the right we have the image generated by Daly. Three definitely knows how a girl with pearl earring the original painting looks like and it made a replica but this time it put in the scenario where I wanted to be in the modern times. It added those elements. As you can see, it added the Genes Jeans jacket with the metallic pattern here. If you look closely, look at the texture of the jacket, texture of the skin, how the light falls on the skin, especially like the touch for the lips, Also the texture of the scarf. Here, it resembles the scarf on the original painting. This is something that we would never get with Daly. For that, I wanted to draw the parallel between Daly two and Daly three. Let's get into that. Here on the left we have an image that we generated with Dally two and on the right we have the image for the same prompt generated by D three. The prompt was professional portrait photograph for a young British woman for Dally Tube when we were generating images. Actually here I chose the best one because most of the images had artifacts, a lot of errors for facial features. Even here we do see that eyes are not symmetrical. That's. Artifact by Daly two, whereas for Dali three, now I do not see any problems. And that's actually the first image that I got from this prompt. We also have more details. You can see the zipper on the jacket. Look at the texture of the sweater even though the background is blurry. But you definitely see details of the buildings around buses, cars, and just the street. Just a lot of more things going on for our next prompt, children's book illustration for a girl riding a bike, the same thing for Dally Three, We get more details again for Daly two, this is also one of the best images I could get for my prompt. Others had problems and artifacts like with the legs or with the bike, maybe nose, mouth, missing eyes and so on, but with Dally three, those artifacts are more rare on this image. We actually, I think we're missing girls mouth but it's not that noticeable here. Again, we have way more details, but at the same time it used this pastel color children's book style illustration. Our next prompt was Magical realism. Read, render, or Racon reading a book, arm chair, lighting from a lamp realistic and real engine. Here, again with Ali to we have some artifacts. If you look at the eyes, the ears, maybe the arm chair here, definitely a lot of room for improvement. Whereas for Dally Three, we get a completely different image. Look at the texture of the raccoons fur. It's so realistic as well as the lighting. We have this lamp as our lighting source and everything seems proportional and correct in terms of composition. The next prompt is line logo of Cpk with a chair on top, clean lines, simple shape, minimalist vector. I pretty like the results we've got with Ally two. But of course Daly three is even better because here you can actually see that the name of your company will be Legible Cupcake here. Another great thing is that sometimes when platforms are advanced and then they can create advanced images, they try to add a lot of details. Here, I ask for a line logo of Cupcake. I want the image to be a line logo. I don't want it would be a three D cupcake or a super complicated logo. I just want very simple and what I liked about Daly Three that it actually followed my prompt and gave a line logo. The next one is the landscape, We have digital art, magnificent medieval castle. On the left we have a rough sketch of the castle which has a place to be, but on the right image by Daly Three, we have a lot of details on the castle. Look at all those windows, the houses, we can even see the windows on those houses. Again, the composition is correct on the background, the objects are light on the foreground, the objects are more saturated with Daly Tu. We could only make square images, whereas with Dali three Chagpt we are able to make a landscape size image which is great for landscapes. The final prompt was, the meaning of life, breathtaking art, standing high resolution, highly detailed, inspirational eight. I liked the image that Dali to generated. Looks very interesting with neurons here, brains, and so on. But with Dali three, the image was exceptional because phoenix is a symbol of being reborn. And also the colors of the image are fantasy like pretty cool. Dali Three was released in October 2023 and it's available in being as image creator and part of being chat. It's also integrated in Cha PT and Enterprise. What are some limitations of Dali three? Well, if you want to use Dali as part of a GPT, then you need to get a subscription and upgrade to Cha GPT. If you just want to try out Dali three, you can go to being AI and try it for free. The next limitation is time to generate. I noticed that with Cha GPT it could take around 30/42 to generate, which is pretty long for only one image. With being is a bit different, it generates four images at once. I would also say it takes around 30 seconds. But there are boosts that allow to generate images a bit faster and we'll talk about that later. Another limitation are the mistakes it makes in the text. Here you can see I asked it to make a poster of Hubburn. The nice thing, it knows historical figure, it knew the deryburn. But the problem is the text here, if you look far away, you would probably see something written like Dre burn. But if you look closely, you definitely see mistakes. There is double D, double double B U R. Again, the most common mistake is that it repeats the letter if you get the name would be written correctly for image editing right now, it's only possible in GPT and how it's done. First you write a problem to generate first image, then you write a follow up prompt. For example, saying here I can say, please improve the text and hopefully it will generate the same image but with improved text. But here is the limitation because for the follow up prompt, you may get the same image but with the edit, or you can get a completely different image. Because there are no settings, there is very little control of what's done. Your best bet is to describe the way you want the image to be edited as precise as possible and hope for the best. But that sometimes doesn't work, and for that reason, using other platforms for editing may be more easier. Lastly, we have policies. There are certain images that you cannot generate. For example, no explicit content, no copyrighted material, no offensive content. It won't generate images of modern politicians, public figures, or recent artists work. 10. New Update: DALL-E 3 with Bing vs DALL-E 3 with ChatGPT: Now let's try to use Daly three. First, I want to show you Daly three in Bing. Then I will show you Dally three in Chip. The reason for that is that there are different functionalities and I just want to show you how they differ. In here is the link. This is how you can access Dally Three. If you click here, it's going to take you to Microsoft Bing Image Creator. All you need here is to login with your Microsoft account. In the first page, you will see the images that were generated with Ali Three and questions and answers. Let me tell you a little bit more about Ally Three. In being it's free every day, you get boosts that are allowed to generate images a bit faster than usual. You can also exchange your Microsoft rewards for those boosts. Let me show you here, you can see I have 15 boosts. If I go to the questions here, how do Microsoft Rewards work with Image creator here if you run out of boosts, you have the option to use Microsoft Rewards to redeem for additional boosts and enjoy faster processing times when you run out of boosts. An image creator, you'll be reminded that you have the option to redeem Microsoft rewards points for more boosts. To be honest with you, I've never tried to redeem Microsoft rewards points for additional boosts because I generally use Cha GPT with Ali. But if you use Microsoft Rewards, that's a handy feature to know here. What else you need to know about Dali? Three in being here, it generates four images at a time. It only makes square format, there is no portrait or landscape, there are no editing capabilities, and you cannot applaud a reference image. Let's try it out. For example, here we can even start with Surprise Me. It's going to generate a sample prompt for us. Here we have Boho interior design with red accents. Let's try that. Just click Create. As you can see, it used one of the boosts. It should generate an image pretty quickly. Here are the images that we got here. I do not see any popping up mistakes or artifacts unless I look closer and maybe you can spot a few. But the adherence to the prompt is phenomenal. Let me show you a few different pront, For example, I tried a futuristic sneaker, digital art three D render. Let's take a look at those images. Look at the texture of the sneaker and the lights. Definitely futuristic. Let's see the other ones. It even incorporated these dots in the material suggesting that it's a preferable material. As N advertises, The attention to detail here is astonishing. And look at all the three D design it just and if you have a product, you can use Dally three for photoshoot inspiration or for a background, for example. Yeah, here it's very simple. There are no settings. You cannot change anything. You just write a prompt. That's it. On the right here we have history, which is neat. The only thing here you can do is click Save Images Here, we can actually customize that. But that's not the editing, it's just the Microsoft designer which puts the image into a mock up, for example, if you want it framed or you want to make a post about it here. There are some templates that you can use. I'm not too familiar with this because I'm used to canvas. But yeah, this is something you can explore. Now let me show you how you can use being chat to, to generate images. Chat here. Click on Chat. Here, we can, can choose a conversational style. For example, let's use the balanced. Let's generate an image of a cat on a piano. Piano. Why not? Let's click Tab. It will generate an image. The cool thing is that the result of this generation will be seen in this image. Create a platform history. You won't lose. Okay, so here are the results. Let's actually open it in the image creator. Here, let's go to creations. And here is our cat. Let's see, we have the cat piano. This one is pretty cool, not that. Okay, this is how you can use Do three in creator or with the Bing chat. 11. New Update: DALL-E 3 with ChatGPT: Okay, let's now move on to GPT form and Dali three integration. Dali three was natively integrated into GPT GPT enterprise. If you want to use Dali three, you'll need to have a subscription, at least for GPT. What are some features because it's four. I've included some more functionalities that are possible like analyze the images. I just want to show you all the capabilities of this union between P four and Al three with the images. Okay, let's see, generate images from a text prompt. It offers three image size options. 1020, 4,024 pixels square, landscape and portrait. Unfortunately, you cannot use a custom. But at least this is an improvement. Because in Li two, it was only a square. Here. In GPT four, at least there's landscape and portrait. Unlike in being AI, which is also only a square with GPT, you can edit images with a text bond. It may not be ideal, but you can tweak the images a bit and let's say you don't like the colors but you like the composition. Then this is something that you could do with GPT. Make images based on input image. That's also a benefit of using GPT. And Ali, because in being AI, there is no feature like that. You cannot upload an image and ask to generate an image based on your input, whereas in GPT you can. I will show you how also you can analyze input images. This is useful because sometimes I want my image to be analyzed. To know what kind of prompt I can use to make an identical or similar images. Let's explore some of those functions. For example, here I have a reference image. I uploaded an image of myself and I put a prompt, Generate an image of me in a comic book style. This is the result. It produced a comic version myself. I liked how it captured the wavy hair. My green eyes, nose, the shape of eyebrows, if you look at the image, is spot on as well as it captured the black suit. Well, the shirt design is a bit different. Also the background, the park, we captured all those small nuances which is pretty good from one image. In terms of replicating my face in the images, it didn't quite work. I tried some realistic examples. For example, again, I've uploaded the same image. Let me show you, here is the image I've uploaded and I asked, generate an image of me in the comic book style. Here is the result. Then I asked it generate an image of me, but if I were in 18th century, and here's the result again, It captured that the hair is curly, the eyebrows, but just the facial pictures do not resemble me. I guess it would be great. If you want to make yourself into a fictional character, like a comic character or a cartoon, then it would work better than the realistic style. Well, at least for now. And then I also tried more images. Here's the result. Again, no close to facial features. This is how you can upload your own image and just try out and create a Comic books of yourself, for example. Another reference image I gave was this tower in Estonia, In Talin, I gave the image and I wrote the prompt. Make an image similar to this one. Here's the result. I love how it captured the tower. Also, you can see on this the original image, this building with the spike. You can see that it's exactly the same one with the spike here, which is great. Let's try with a different image, applaud an image, and see what PT with Ally will generate. Okay, let's open a new chat here. You can attach a file here. I have a different image I took in Estonia here. It's going to upload the file. While it's uploading, let's write a prompt. First of all, I want to know if John PT knows where I was. Let's ask where is this place? This is the analysis of the image. First. We actually got an error here saying that it couldn't open the Hague format. Let's change our image to GPG. Okay. I converted the image into P, G. Let's use where is this place? Okay. Okay. This is the response, I'm not sure here showing the error, but the response is good. It says the image you've provided appears to be to show the viral gates which is part of the fortifications of the old town of Tin, the capital city of Estonia, which is correct. Okay. It completed the analysis part correctly. Now, let's ask to generate an image similar to this one. An image similar to this one. Of course, you can make some modifications. For example, you can ask, generate an image similar to this one, but during the nighttime or during the spring. Let's, But somewhere let's. Okay. Here we've got the image. Let's just try to compare. Here we have two towers. Well, they look on this image bit bigger than here, maybe that's the perspective. We have people market overall. Again, it generated a very similar image. Which is great because now we can give any image as reference and ask to generate or use elements of our reference image for example. Now let's move on to editing. The image I want to edit is this landscape. The prompt is a familiar one digital art of magnificent medieval castle between the hills and fields. Large panoramic background whose dense nature and mountains, grand fortress, epic scene fantasy. I want it in landscape size 1,792 by 1,024 pixels. I started with the prompt. This is the image I got in the chat. The next thing I felt missing in this image are more rose colors, like purplish, maybe like a sunset. That's exactly what I put in my chat. I asked it to make it with more rose colors. This is the result that Chan GPT gives here the castle looks quite different. In the previous image, it was in the way the fortress was round, whereas this one is more square. As well as we do have different elements such as this additional castle at the top here, but the composition is overall the same and this very similar angle, we got our rose colors. Purple, predominantly purple, pink. Okay, now I wanted a more zoomed in image of the castle. I just As this zoom into the castle. Here's the result I got. I wasn't satisfied with this at all because the image is completely different, even though it says here that here's the zoomed in view of medieval castle, focusing on its intricate details and the rose hues. Here we have monotone brown image. If you are unsatisfied with the result, you have a few options. First, you can regenerate the response, and sometimes that will give you the desired result on the second or third try, or you can change the prompt. Here we have a pretty short. If I want to keep the same image as here, I would put more precise description such as saying, keep the same image, but zoom into the castle. Something like that. Let's try some more editing. Make the castle, make a mistake here. Make the castle more magical and with beautiful nature. Here is the result. We go pretty magical colors. I like this image, but now I want something a little bit more realistic. I make it more realistic while keeping fantasy like elements. Here is the result which I liked, and here we still have magical colors. With the purple blue, we have a beautiful pink color of the castle, but it just looks more realistic as well as like this game with a little light. Here we have blue and in the front we have pink. Okay, let's move on then. I asked to make it look like Disney Castle. Make it look like Disney Castle. Here is the result. This is how you can write simple prompts to go and develop your image. Which I think is great because you start with one image and then through some prompts, you end up with completely different results. And maybe that's the direction you never thought of going. But this is where it took. You love the result, which I think is a way better process than just think of a huge prompt at the beginning. For me, especially simplifies the process of creating a way. You start with something short, like a castle, and then you add more details or you decide which direction you want to go. The next feature that we've already tried is the analyze here. I gave it a strange image, it gave me a description, and I just put that description into Dali three. This is the result it gave, which I think is pretty similar results. We still have the gage, we have the track, the background is different, but overall the feeling of the image is the same. Let me show you how you can do something like that with GPT. Again, let's create a new chat here. I will upload an interesting image and let it analyze the image. I'll make a description for the image. Okay, the text is pretty long so what I'm going to ask is make a description that I can use for a prompt for AI or generation. Okay, we still ended up with a huge text here. What I'm going to do is I'm just going to limit it to one. I'm going to just change it here. I'm going to put limit to to 30 words. Okay, So this is something we can work with. A ring master in a red jacket and black pants, tipping his top hat while aligned playfully bites his head on a circus pedestal with a blurred audience behind. Perfect to the point. And with all the information here, it gives a little bit more description about the gloves he's wearing, the color of the hat, and so on. But I think we can include that for this image. If you actually want to make a similar image, you could have just said make a similar image to my reference image, for example, and it would give you another image. But here I just wanted to test its reading image abilities, which it did successfully. Now as a separate step, I can use GPT or I can go to being AI here. I'll just put the description here and let's create. Okay, here are our results. We do have a line, we have a ringmaster with a hat. The only thing it didn't do is showing the lion playfully bites his head. Let's see the others. Well, this one is a bit better. Yeah, here's how chat TPT can analyze basically any image that you input. It will give you a description if you want, you can use that description in any other platform like mid journey or anything else that you're using. 12. New Update: Examples and Use Cases of DALL-E 3: What can you actually use dally three for? Well, there are so many use cases where dally three will be helpful. And we will explore those cases and the prompts that you can use. Dally Three is such a powerful tool in terms that it's too simple and is accessible to basically anyone. You just get your GPT subscription and you just generate an image of it, will generate basically everything that you need. That's why I think the AI tools are replacing stock images. Because before you just go and search for a stock image, you may find a free one or you would have to pay, I don't know, $510 per image or even more if you couldn't find the free one. Whereas with Dally three or mid journey, you just type a simple prompt of exactly the image you need and that's exactly the time you go and search for that image. In stock images, you get a free with no copyright image that can be useful for your business presentation or anywhere else. I think this is a very important part of Daly three here. Another reason Al three may become your favorite tool is that when you use it with GPT, you actually get the conversation, you give orders, you have conversations and you get the result. Let's move on to other use cases. So you can make logos with it. For a business, a company, you can make book covers, book illustrations, coloring books, card design, album covers. You can make website and product design. You can brainstorm and get inspirations from Daly three. Then you can also make posters, marketing materials, and many more things. The sky is the limit of what you can do with Daly three. Okay, let's try some use cases. So the first one is images for your presentation. Instead of stock images, you can generate an image for your presentation yourself. Okay. For example, here is the prompt, generate an image or realistic potograph. Then you put a specific description of what you want, like a happy person or people in the conference room for example. Then you put that I can use for my business presentation or class or education presentation and so on. For example, this image was created with the prompt, Make an image of a person jumping from happiness that I can use for my business presentation. Because I put the business presentation, it added the details with a formal clothing. Here let's see a different example in Microsoft being I generate a realistic photograph of people enjoying a meal at a cafe that I can use for my presentation. Here are the results that it gave me. I think the one is the best one here. We got two people using an ipad or taking pictures. Then this one is not bad, but it looks a bit fake in the background. So I would probably, if I were making a restaurant presentation, I'd probably use this image. And again, it took only 30 seconds to make this image and you can easily use for your presentation. Okay, the next one is the logo design. Here I would say you can start with being AI because it gives you more ideas. But in order to do editing or if you want to have this development, then you can use GPT here, the prompt that you can use such as design. And then you can list adjectives how you want to see the design of your logo. Like luxurious, simple, vector, colorful. Um, logo for the name of your business, like a cafe, pharmacy and with the name and then just put your name, an espresso club. Here we have the Prompt Design luxurious logo for Spasalon called Harmony. Here we got, we got the name correct. For the other images, the name was messed up with being I you would get four options. That's what I go for. The prompt here, you can see the options I've got. When I run it again and emphasize that I want the name harmony. I got three images out of four with the name harmony where those two spell it completely wrong, then you just need to create more or you can use GPT for GPT because it generates only one image. Well, right now I don't know if in the future you could generate more. But right now they limited to one image per response. In order to brainstorm and get more designs per image, you don't need to wait longer. You can ask it to four designs at once. Here, just make four logos for a construction company hold Skyline. From here you can choose which one you like more and you can ask to expand that design. Sometimes works, sometimes it doesn't. For example here I ask, I like the bottom right logo from the four designs we've created. Expand it and make it in two colors, green and red. Here is the result I've got. I think it captured a little bit from the fourth design, but still is completely different. Unfortunately, when you have multiple designs on the same image, it's a bit more hard to isolate one. What probably you can do is just crop that off, uploaded as the image. Ask GPT to analyze and create a similar design. That's probably the better way of doing that. Okay, a different strategy is that you start with one design. For example, design a minimalistic logo for a construction company called Sky Line. Let's say you don't like a few things. You say, use the image you generated, but use only dark gray and sky blue colors. White background here is the result, changed the background. But for some reason it didn't add the blue colors as I asked. As well as it separated the line, it made a space between sky and line, which I do not want here. Using the same logo, make small adjustments. Make sure Sky line is written together. Use two colors for the logo. Do gray and light blue, white background repeated myself. Let's see the result. This was the result. Again, I didn't take the space out, but it added another color blue. In the next iteration, I ask it keep the logo, but make an adjustment. Make the blue color more light and bright. Remove the space between sky and line. Here is the result. This one is a bit better. We got the blue as we wanted. The space is removed between sky and line. But the thing now, I don't like the second line, I ask. Okay, great. Use the same logo but make small adjustment. Remove one line above Skyline. It did remove it, but now it changed the design quite a bit and added those lines at the top. Let's say you like this design but you don't like the line, then I would recommend to go and use other editing tools that we've discussed, like Clive drop. O to remove the line, that would be much easier and faster than asking it here. Now let's move on to book cover here you can use a prompt like design a book cover for a and then put the genre of the book like Mystery historic children's book or any other novel about describe what the book is about. For example, flying cars or the girl is falling in the rabbit hole and so on. Titled, and just put the title because it could get the name of the title correct as we can see here, The Lost. So here the prompt was, design a book cover for children's book titled The Lost. Let's see some more examples. Here I put the design a book cover for a fantasy novel about a girl who lives in AA society titled The New World. We got different illustrations. Book covers. Those can be good inspirations for an actual book cover. This one is pretty cool. You can see Roberts attacking the world. This looks like an AI body with the Earth. Lots of details. Now next move on to website design. That's another very helpful use of Dally Three because before with Daly two, those things were not possible. Here you can put a prompt, something like design adjectives, colorful, modern minimalist, or landing page for a specific type of website, such as online pharmacy or any other business for website design. The prompt here, minimalistic home page for designer portfolio. I was pleasantly surprised when I saw the results for this prompt. Let me show you all the results. Here are the results first, second, third, and fourth. The reason why I was so pleasantly surprised is that my expectations for a minimalistic homepage matched with the results. This is something if I were looking for a minimalistic website, template would find, here we have pages. But what shocked me is that it actually named the elements it put logo here, web design, UX design. It clearly understood my assignment that I want the website to be a designer portfolio. It knows that he would, let's say my logos here. I would show my web design. Here, I'll show my UX design. This is brilliant. That's how easy you can use Dali three to plan and design your website. Now let's move on to posters. Advertising posters. Or any advertising material. Okay, here we have design a poster featuring. And then you can put person place product such as mountains here with specific text or emotion message. Let's say you want to design a poster that you can tell on C, then you can make it here. This is a poster that I made with GPT and Daly. Here was my prompt. Make a poster with a motivational message. With a few iterations, the text is legible. Here we have believe in yourself even got the correct text. The reason I got it here is because in my prompt, I asked to correct the text, and probably that's why I put it here. Let me show you. Here is the first image I got for the prompt. Make a posture with motivational message. Then what I ask, please improve the text. There are mistakes on the image. Yourself is written as yourself. Here is this result that I showed in the presentation, but it added this correct text, possibly, because here I ask it to improve the text. Then I asked it to remove correct text and remove the background here, is the result a completely different result. I didn't want to go that path. It's easier for me just to remove that line myself. For an advertisement poster, you can just put design and advertisement poster for a specific product such as Heads. Let's see that as well. If I go to being I, this is the prompt I use. Design a futuristic advertising poster for headset. I've got pretty cool images, very futuristic actually, this one. If you are making product photos, then this is something that you can incorporate. Maybe you can even like Photoshop out this part and add your product instead. As you can see with Dally Three, you can pretty much generate any image that you can use for work for. So for example, a school presentation for a family gathering, for example. Dally Three is a simple tool, but yet it is so powerful I encourage you to explore it and start creating. 13. New Update: DALL-E 3 New Parameter Gen_ID: Hello everyone. This is a small update on Ali. I recently discovered that there is a parameter that enables to do editing so much easier. That's the generative ID. You can actually ask when you generate an image to give you the generative ID of the image. The generative ID, let's see what it is and what are some use cases for it. Nid refers to the unique identifier assigned to an image. Each time an image is created, it's given a NID so it can be referenced in future and they identify ensures that if you want to make modifications, references then you can do so accurately. Basically, Jen ID is very similar to the Seed in stable diffusion for example, I ask GPT to generate an image of a couple sitting on a bench in a park and include NID with it. It generated me an image, it gave me a NID. It is what we know from the seed, from stable diffusion, is that if you give the same prompt and the seed number, it should generate the same image. Let's test it out. I'm putting the same prompt here and I'm giving it Jen ID. But as part of a prompt now, let's see. Now as you can see, we're getting the same image and the functionality of Jen ID is pretty much the same as the seed. That would allow us to make small adjustments to the image by referencing the ID. Let's see this example here. I say generate an image of a cartoon character in the children's book style, a girl with curly hair, the explorer, and it gives me an image. Then I ask, what's the end of the image? It gives me the end of the image. Now what I say is keep the image and I put the ID number the same, but make facial features more picture like. Now it knows that I want something in a similar, let's say style. Now it generated this, even though I was looking for more similarities between this and let's say this image, but close enough so we still see Explore Clothing. Okay, let's try one more. Now, this image has a different en ID. So I can ask what's the gen ID of this image? So it gives me a number. Now I say generate exactly the same image, but this time the girl wears a blue scarf. Here is the image. Now what I say is generate in exactly the same image. And then I reference again, not this one but this image. Now instead of blue scarf here I put a red scarf. Look what, Now we pretty much get the same image with the blue scarf, but now with the red scarf. We referenced the same ende, the same prompt here, girl wears a red scarf and girl wears a blue scarf. The end is the same. It has the same starting point and similar generation process. That's why we're getting pretty similar results. Now let me show you another cool thing you can do with end. It's called cross reference or combination. Basically what you do is you give end of one image of the second image and then you ask GPT to combine them in one image. Here, let me show you now I ask to generate an image of an adventure landscape. This is one version then here I like this image, but I think the style is a bit different to my character's style. I ask Jug Pit to generate an image of an adventure landscape with mountains in the style of and I give this look at this style I think matches a little bit better with the style of our character. I ask for end of this image if you want to get Jen ID with every image. Then you can make sure to include NID for all generated images in the future. That's just going to simplify everything. You wouldn't need to ask ID for every image it generates. Here's the prompt. Now generate an image that cross references here. I give the end of the mountain setting of this image here and of our character, this image here here. It generated two images. Usually it generates only one image. But this time it unexpectedly generated two images. My thoughts, the reason is that here I referenced two images and maybe that's why it now generated two results. That's my explanation of what's going on here. As you can see, the images are pretty similar to one another here. Slight modifications but pretty much the same composition. Now, I went on to say, make a full body shot. Here's the result I liked pretty much everything except the facial features here. And I've said improve facial features here. I referenced this image to be more similar. Here, referenced our first second image here. Here is the result. Now I ask GPT to put this image in tropical setting. Here is the result. Now you can see some resemblance of this character with this character, even though there are some small changes. Okay, another thing I wanted to experiment with is to put character in the setting that I give. I have uploaded this image and what I asked for the prompt, create an image that places the explored character here. I reference this image here with a similar attic setting as on the image I've uploaded. Hopefully it would this character to the attic setting like on the image that I've apploaded. Let's see the result. The character features here are quite similar to the reference character, but the details are quite different. I would say that this character is way younger than the reference image than this character. As you can see, it's not perfect. You would still get differences, but it's way better than just using the description and text without the gen ID. With gen ID, you can also experiment with some keywords such as combined blend, merge, because that may give you different results. And booky words like style, aesthetics, design element. Let's see. For example, I've asked GPT to generate an image of a cat in Impressionist style. Here is the image now for some reason it didn't give me a NID. I ask ask to generate an image of a dragon in a modern digital illustration style. Here is the image, I get the ID. Now I want to combine those images. I say use the aesthetics. Here is Gene D of the cat image and Gene D of the dragon image. Let's see the result. As you can see, it used the cat with the dragon as here, the colors I think are from the dragon. The style is the impressionist style of the cat. Okay, it's not really a fuse, but we see cat and dragon side by side. Again, here we have two images. Okay, now instead of use the aesthetics, I say combine elements. Here's my dragon image and cat image. Let's see the result. As you can see, the results are a bit different. Now, the dragon and the cat are side by side. Here the dragon is in its pallet, in its own original calpalate. Let's see the other one. And here we have also predictable style. Now it's time to experiment with different prompts and use cases of Jen ID. 14. Prompt Writing - Subject and Medium: Hello everyone. In this module you will learn how to write prompts. This will be applicable to all of the AI tools that we cover in the course, but especially to the stable diffusion based tools including images, Do Lexica dot Art, Dream Studio, blue willow, Leonardo Astra, and Automatic 11 11. I'm going to do the prompt writing in a tool called Automatic 11 11 which is a stable diffusion program. We will be covering this program at the very end of our course because it's a bit more advanced. But for prompt writing, I decided to use this one because it gives me greater flexibility. But again, these lessons could be applied anywhere. At this point, you don't need to follow along with what I'm doing because we will use all these concepts in the AA tools we cover next. However, if you'd like to, you can jump to automatic 11 11 section on how to set up and run Automatic 11, 11 so that you can follow along with what I'm doing. But again, that's not necessary at all. Before we begin, I just want to outline a very great resource for prompt writing. It's a stable diffusion prompt book and it's brought by open art. It's a guide into prompt writing and it goes into a lot of details as well. A gives a lot of interesting examples, artists names, and so on. Throughout my presentation, I will be referring to some of the content from here. Let's begin by writing a short prop, for example. We, we'll usually start with the subject. For example, a man. Let's click Generate. As you can see here, we have four different images of men. In this case they were pretty similar in style. However, if I do it a few more times, you'll see that it can be completely different. Here we do write any specific details, it's up to AI's interpretation. It has a lot of room for imagination. Here on some of the images, I notice that the head is cropped. To fix that, we can use the negative prompt. Negative prompt helps AI to know what we do not want to see in the image. In this case, I can put crop and crop head. These are some things that I do not want to see in the image. I don't want to see cropped image or crop head. Let's try with this negative prompt. As you can see here, we got another four images of men. The three of them I think are pretty fine. Those three, but this one is cropped just a heads up. Even though you put it in the negative prompt. Sometimes I would not quite do exactly what we write at the prompt. The only way to fix it is just generate lots of, lots of images. And another way is to actually write a more longer prompt, which we will do. We started with the subject. Subject can include like people, person, man, woman. It can include animal. It can be some object glass of water. For example, Castle. If you want some landscape sunset. Also we have here celebrity names. For example, I can put M. Watson and generate a few images here in the negative prompt. I would also like to put not say for work and naked, so we don't get any naked images. Here we have it. We have four images of Emma Watson. You might be wondering how does AI know about Emma Watson? Well, A I was trained on a large data set of images that were publicly available. As you know, there are tons of images of celebrities available online. Ai knows most of the celebrities pretty well and you can use it in your prompt. We can also try glass of water just to see the objects. As you can see here, we've got exactly what we ask for, a glass of water. As you can see, different styles in order to build on that. We can now specify medium and art style for medium. We have oil painting, watercolor, photograph, pencil drawing, airbrush, digital art, technical diagram, three D, illustration, vector, and much more. For art styles, it refers to a historical art style. We have abstract Renaissance. This would be a. A style of Leonardo da Vinci. Mona Lisa, for example. Impressionism is Bango cubism, contemporary pop art. And then we have surrealism and fantasy to add to glass of water, we can put water color color painting of glass of water. That would narrow down what style AI uses. Here we go. We have four images of glass of water in watercolor style. As you can see, it matches pretty well. And also the color scheme, for some reason is very similar to, we can also try photograph. I know from experience that if you put photograph of a subject, for example, of a woman, it would tend to be black and white just because the photograph is more predominantly with the older photographs. You can see here that all the images are black and white. And that's just because AI has association of the photograph with black and white photos. Let's say you want to make a modern photo of a woman. Then you would need to put some adjectives, for example, modern photo of a woman. Let's see if that does it, It may not work. You can see here that the photos we've got are still on the black and white scale. They don't seem to be modern at all. To solve that, I'll just add a little bit more words to the prompt and to the negative prompt. To start with the negative prompt, I will add black and white. I don't want to photos, I don't want monochromatic. Monochromatic one color which is black and white. For mode photo a woman, I would add woman in a T shirt. There are items that belong to a particular period in time. For example, T shirts, jeans, heels will be something very contemporary. When you list specific things that belong to a specific time period, you'll get images from that time period. Let's generate. You can see here that all the photos here contemporary, you can see a woman in a T shirt. That's exactly what I've said. If you want certain time period, make sure to include, um, items of clothing or accessories from that time frame. Okay, that's for medium. We can also try art style. I think style. Let's change the pro, let's try abstract drawing of a car. Here we have again four different images, the first two and the fourth one. We see clear drawings here in this prot, I actually combined the medium and the art style abstract drawing. And you can try out those things, You can combine, mix and match the art style and medium together. Let's do one more medium, Let's do a technical diagram. You can see how changing the medium affects the image. Technical diagram. Okay, let's see technical diagram. On the first and the second one, we see labeling different viewpoints of the car. That's what we would expect on a technical diagram. Let's do, let's do pop art art. Here you can see that the images are in completely distinct style compared to the previous batch of images. This one is the pop art and again, very big effect of the art style on the image. 15. Prompt Writing - Composition, Action and Details: In the last video, we've started writing prompts. And we've started with the subject. And then we went further to talk about medium and art style. And how you can use specific words like well, painting to define in what style you want your images in. And that has a very big effect on your output. Now I would like to even further add to our prompt and I would like to talk about composition. So there are two types, There's short type and point of view. For short type, it's basically where do you want to see your subject? Is it like a close up or further away? You want to see a full body of your subject for point of view, imagine a photographer takes photos of a castle, and this is basically where the photographer would take his photos. Is it from a low angle shot or it's from a drought? As you can see, these all have a big impact on the image itself. Let's try some of them. Let's do a close up portrait first. On these images, this is what we see. We exactly see just the face of a girl and maybe a little bit of shoulders. What would the close up shot be? Now let's also do a full body because this one is a bit of a tricky, the images we've got are quite, for the best one is the first one and the second, but the second one is not quite full body but the rest are cropped. So you can see this one, the head and the shoes are a little bit cropped here as well. In order to not have those cropped images, I can add specific words, definitely to see the hair. For example, I can add hair or a hat to make sure that I doesn't cut through the body. I'll add the hat here. Also, because I want a full body, I can add specific elements of the body that I want to see. For example, if I want to see legs, I can put legs or certain attributes of clothing. For example, for full body, I can also put shoes or boots. I'll put those things. I'll put a hat, and I will also put boots here. This one looks a little bit better if you want to see a person or your subject a little bit further away. It would be also nice to add a background and we will talk a little bit that, um, in our next slides I'll just put in a park and we'll see how that will change the images. I will also put a photo, full body photo of a woman had boots in a park. I think this is a little bit better in terms of composition. As you can see, the subject is placed a little bit further away and we can see boots and the head. And basically everything except for this image where AI just ignored the boots and didn't render anything here in terms of faces for AI. The further away the subject, the less space it gives to AI to properly render faces. For now, disregard the faces. Okay, here's the full body photo of a woman. Now let's try some point of view. Let's do, for example, we angle shot of a castle. Here's some images that you can expect. When you use wide angle, you'll see your subject in full. Usually, I found that wide angle view is somewhat similar to panoramic view. Now let's change this to low angle shot and see how that will make a difference. Low angle shot, let's do a photo here. Compared to the previous one. Most of the images we see the castle above. And that's what low angle shot means, that the camera, if we were talking about the photos, is located below the subject. Now let's compare that to a high angle. Here we can see the subject below. All of the images are consistent and some of them I would say are drone photographs of the castle. You can also use drone photo of castle. To finish this up, I would like to use fish eye photo of castle. Here we can see different images, but as you can see most of them are distorted. This is what we would expect from a fish eye. This one I would say fish of a ceiling in the castle. Maybe not sure. But these three I really like. Now you can use composition words to define how far or how close you want to see your subject and from what angle. Now let's go further and talk a bit about action. Action can be very important for certain images. For example, for dynamic images. Action images like soldiers attacking a castle, for example. But even simple images for a portrait, action can be important. For example, when a photographer takes photos of a model, he usually directs where she should look or how she should stand. Similarly here, you can direct to how you want your subject to be. For example, you can say, a portrait of a woman looking up. Let's try that. As you can see here, compared to the portraits we tried before, where the face is usually looking straight at you. Here, we can certainly see that the woman is looking up, her head is tilted, And this one as well, I would say the second and the third ones are my favorite. Here, you can see how you can use the action to impact the posture of your subject. This one was for looking up. We can do, for example, reading. Let's do a cartoon of a boy reading a book. Here you can see a boy doing certain action, which is reading a book. I certainly captured exactly what we asked. So that's it for the action. Now let's add more details to our subject that can be very important if you want your image very detailed. For example, if you make images of a woman, you can ask yourself questions. What does she, does she wear jeans? Or does she wear a dress? Or is it a historical persona, for example? Then you can specify certain attributes of clothing of that time. For example, a tunic or a corset, for example. You can even add more details to that clothing item. For example, what kind of dress is it? Is it a short sleeve, long sleeve, puff sleeve. Maybe it has embroidery or lace and stuff like that will add a lot of details to your image. Also the hair you can specify what hair does the person have, is a long wavy hair or maybe a short and dark hair. For jewelry, you can put, for example, pearl earrings, bracelets, necklace and so on. For shoes, you can put sneakers by flats, hiking boots, heels, and others and accessories. You can put things like a scarf, sunglasses, maybe a heat or handbag and so on. So let's try some of them. Let's do an oil painting of a woman and then I'll put dress. I'll put floral embroidery and lace. And then puff sleeves maybe also I'll put that she has pearl earrings. Let's try that. In these images we see a lot of frame. Actually, I don't like that. I'll put that in the negative prompt. I don't see any frames frame. However, as we specify that it's an oil painting, usually oil paintings come with frames and exactly what AI generated. If we zoom in a little bit, we have some maybe pearl earrings and you can see the puff sleeve stress place. Every detail that we've described in the prompt is presented here. As you can see here, you can use your prompt to specify the specific details you want to see in your image. 16. Prompt Writing - Negative Prompt, Stylizers & Modifiers: In the last two videos, we've talked about prompts and how you can use specific elements to get the desired image. We started with the subject and then we talked about medium and art style. We've talked about composition and action, as well as the details that you can add to your subject. Now I'd like to talk about negative prompt. Here I've compiled a list of words that I think would be very suitable for negative prompt. These are the things that I do not want to see in my image. For example, it is bad framing. Out of frame, bad anatomy, bad proportions, blurry crop staple diffusion makes mistakes with how many arms people have. For that not to happen, I also include extra arms, extra fingers, extra lex, and so on to make the face and hands more detailed. We can also put poorly drawn face or poorly drawn hands. Then we don't want any text signature waterworks and so on. I'll just copy this negative prompt to our program here. This will actually help to make our images look better because some of the arrows or mistakes can be avoided with negative prompts. Okay, now let's talk about background and environment. This is where you want to place your subject. For example, it can be a man in a park, or a dog in space, or something. Underwater. Underwater is a nice one. Let's try grocery store. Let's do a portrait of a woman in a grocery store. Okay, here we've got images of a woman on the background. You see the grocery store here, we have some fruits here that's captured our background. Sometimes it helps to add background, it knows exactly that this refers to the background and this is our subject. Let's move on to the next element, stylers and modifiers, stylizsre, words that modify the look and feel of the artwork. For example, lighting. There can be a big difference between a moon light versus a daylight or a studio light. Here I've included soft, diffused light, sharp street light, moonlight, cinematic studio lighting, morning sunlight, natural lighting. Here I'll put a portrait of a young woman, contemporary dress, neoclassical style, place embroidery in a magical part background. Now I will add the lighting, moon light. Let's check out these images. I think the first one is set in the dusk. This one has a little bit of sunlight, but the other two, this one and the next one, it does look like a moonlight, maybe also a little bit of a street light. Let's compare that to daylight. As you can see here, these images feel very different to our previous batch of images in the moonlight. Here we can see a nice day light, that's the effect of lighting. Next is the color scheme. This is what colors you want to see predominantly on your image. You can put motor black and white or vapor wave is a certain style vapor wave. You can have this style of colors. Also, you can specify maybe it's called warm colors, vivid colors. Pastel colors. For example, let's try pastel colors, daylight pastel colors. I think that will work very well with the style. Let's see these images here. We've got color scheme. As you can see, it's quite pale and soft, there's not much contrast. That's what the past color scheme is about. Okay, let's move on to resolution. These are actually super important. If you want a good quality image, you should always put any of that highly detailed, intricate HD R 64. It's basically just the resolution, we can put that in our image. 64 detailed, as you can see. By adding highly detailed and resolutions such as 64 K, it added a little bit more details in the images to look at the slays. Next we can put specific words to make our image more realistic. For example, keywords like Unreal Engine and octane render. You will usually see in other prompts to make images more realistic. Unreal Engine is a real time three Z creation platform, usually for games that makes images more detailed and realistic. Octane render is a rendering engine, specialized photo realistic rendering, realistic three D scenes and lighting stable diffusion knows about these keywords and will actually produce better images. You can also use hyperrealistic, ultra realistic, and photorealistic. Let's try to use Unreal Engine with our prompt Unreal engine. As you can see here, the girl stands out a little bit more here. And just the way the lighting works makes her figure looks more real. These words, the stylizers, highly detailed, unreal engine, and the lighting will all make small adjustments that overall make the image way better. Now we can go to emotions and adjectives. This is how you want your image to feel. So it can be like magical, romantic, or it can be gloomy horror epic. In my prompt here, I already have the magical background. Usually for adjectives they don't apply only to the specific or park background, they actually apply to the prompt and to the full image. If you put like gloomy background, then the overall feel of the image will be gloomy here. I wouldn't add any extra words, although I could put like fantasy and so on. Okay, that's for emotions. Next, if you're doing a photo or photo realistic image, you can add specific keywords that would apply to photography, such as aperture. You can put like 1.8 which is great for portraits. You can put ****, for example, 80 millimeters or macro. If you want an image of, I don't know, an insect for example. You can put specific camera, for example, phone camera would make a different image to a professional cannon photograph. You can also put long exposure if you want, um, night lighting effect. With that, let's put aperture in our prompt 0.8 and maybe 80 millimeters with the aperture of 1.8 and 80 millimeter ****. I would expect the background to be blurry and the person's figure, but very detailed. Let's see if this is what we've got here. Yes, I can see on the first two images that the background is blurry and we've got this intricate details of the dresses here. Next we can indicate websites. In our propped for example, we can say trending on Art station. Art Station is a platform for modern illustration. At the time of training, standard diffusion would know what was trending on that. Platform sieve is a Japanese anime style, this is the Platform Pit Net. If you're creating anime style images, then you can put trending on Psi. Then you can also include like Instagram and so on That will give the photo more modern feel. We can trending on Instagram here you can see that the posture just feels a little bit more modern, something that you would maybe see on Instagram. I highly recommend using stylizers and modifiers in your prompt, especially the ones like resolution and realistic keywords, because they actually make your image look way better. 17. Prompt Writing - Artists: So far for prompt writing, we've covered the subject medium and art style composition, action subject details, negative prompt background, and we've finished with stylizers and modifiers. The final element for prompt writing that I'd like to talk about is artists. Artists may have a very strong influence over the image because the influence in what style you will get your image. For example, here I've listed three artists, Alphonse Mucha. If you add that to your prompt, you will likely get a two D illustration for Frida Lo, you would get a mix of surrealism, symbolism, and modern art. For Ing, it will be an impressionist style we can try with our prompt here, I will first remove something that will conflict with the test. For example, trending on Instagram. I'll remove that. I will remove the photograph, the aperture, because that's for a photograph and I'll remove a real engine. I will keep highly detailed and I will remove as colors because I want the color scheme to be influenced by the artist here. I'll put, okay, let's see the images. As I can see here, all the images repeat the style of Van Go. Even the brush strokes, this is something that Van Go used in his paintings. Except the hands here, I think. I didn't know what to do with the hands. It just used more or less photorealistic hands. The hands look out of place. Similarly here, the face and the hands are a little bit out of place. Okay, now you know that artists have a strong effect on the image. I would also like to say that you can actually add multiple artists here. You can put let's say alpha. I'm not sure what it's going to be, probably merge of styles here. We can definitely see that the background and the clothing item on all these images actually is Bangor style. However, look at her face, neck and hands. That's in my opinion, that really looks like Alphonso style. If we go and Google Alphonso, I found some images here and look at the face. In one of his illustrations, I think A, I tried to capture some of this facial expressions and details of Alphonso style. Here's how you can combine the different artists. You can even have a third or fourth artist here if you want. Another great reason for using artists in your prompt is that certain artists will help AI generate correct proportions and correct faces and hands. Ai still struggles with faces and hands. Having an illustrator like Alphonso or digital artists where in their works they have very detailed pass, having them in your prompt will actually really guide AI into producing better faces and better facial features. Okay, where can you find those artists? If we go to open art book, the one that I showed you in the first video here, they list a few artists here for different, like portrait artists, landscape artists, horror artists, anime scifi. However, this is a very small list. I found a few resources for you where you can go and look for the artists. The first one is a prompt guide. If we go here, you'll have the name of the artist. Here are some images that were generated with stable diffusion. As you can see, different artists will have different images and styles. You can go and check it out there. Also, you can search for the specific artists here. Or if you are doing it in a specific style, you can go and choose the category, for example, painters. And you'll have all the artists that are in this category. Okay? The next one is this website. I actually like this website a little bit more because here you have more artists and it's easier to see what each artist is best for. For example, if I want more detailed pass, I would probably use this artist. You can click on it. You'll have more examples here as well. You can just copy the prompt. You just click on it and the name of the artist will be copied here. You can see there variety of styles and artists. This one is also a very beautiful one. I'll copy this one and try it with my prompt. A highly detailed I will remove the Ang, I'll put Alphonso and Emily bald. I think the Alfa. And this artists will go well together because they are both illustrators. Let's see. Look at these amazing images, look how detailed all of them are. The face looks stunning, I think because we've used two illustrators. I made faces look way better than when we tried it before. We have dress and look at the park background that AI tried to implement with these artists. I think we achieved really good quality images. Okay, a third website that you can also use is the screens notion here as well. There are a bunch of artists that you can use and check them out. Okay, now you know where to go to check artists or look up for artists. 18. Prompt Sample - Portrait: Now I would like to summarize everything that we learned so far about prompt and talk more about the order. As well as actually do the prompt from scratch to finish. And show you how I work with prompt to achieve the desired outcome. Okay, the order is actually very important because AI pays more attention to the beginning of the prompt and the ending of prompt. If your prompt is very long, it may Mrs. words or concept in the middle. It will put more emphasis in the beginning and the end. If you have certain details that you want in the image, make sure you put it at the beginning or at the end. I will also show another way how to emphasize certain words to have them in the image. But for now, let's talk about what usually should go at the beginning and what usually goes at the end. At the beginning, we have the medium because that has a big influence on the artwork. Then we have our subject, action and details. This is all about the subject. What is he or she, what are they doing and details about the subject. Then we have the background and stylizers, words that describe lighting, words that improve resolution of the image, and so on. At the end, we have artists, I've made the medium and artists in the same color, because in a way, artists do influence the medium and style of the image that you will get. Now I want to show you my process for prompt writing. I'll start from scratch and we'll improve prompt until I have the desired image. First, I want to start with young woman. I want to create an image of a woman in ancient Egypt that looks like Cleopatra. We'll try to do that here. Young woman, that's my subject. Now for medium, I want air brush painting. Airbrush portrait, a portrait of young woman. Now I want to specify the details of the clothing. She will be in tunic and she'll have gold jewelry. Let's try to generate this here. We've got some images right now. They're far away from my desired outcome. I will keep going. I want to specify gold jewelry I want, and I want gold earrings. Now I'll put the background. In ancient Egypt, temple with columns. With columns background. Let's try this. These images are a little bit closer to what I'm thinking about. However, I don't like the style here. We'll keep on working what I like that now I can clearly see more Egyptian style here. That's because we have ancient Egypt here. And that's an important keyword. Yeah, now these women look like Egyptian Pharaohs. Okay, now let's add stylazersIillutlightI. Want a dramatic light, dramatic light, high contrast. I would also like to add Unreal Engine and 64 K, highly detailed. Let's see, now we're getting images that look like a three D character. It doesn't look like a photo or painting. That's, I guess the effect of Unreal Engine because that what is used for games. And here we can definitely see like a three D character that can be used for games. What I will take Unreal Engine out of here, I'll just keep the 64 K and highly detailed. I will also put here hyper realist. Hyperrealistic. Also, I want the woman in Egyptian ethnicity. I'll put airbrush portrait over young Egyptian Egyptian woman. I could also put a historical figure like Cleopatra. I could put Cleopatra. Maybe I will specify that I want tank tone tank. Okay, let's try this. Okay, we've got some interesting images here. I like the scripture background. I actually add that as well to my prompt. See, the hand is bad here. Here we have a frame. This one, I like this image, but it's off center. Let's try again, and let's add a little bit more details. Now, I'd like to add artists. I've decided to use Greg Rutkowski which is a modern illustrator. I can go to our stable diffusion che sheet and paste the name and search. Here you can see some styles that were created with Greg Rutkowski style and you can see that it's very detailed and that's what I want in my image, I will use him. By Greg Rutkowski I also want to add Alphonso much because I really like his style as well. Let's try. As I said, I liked the scriptures on the columns. I will put that somewhere here. Columns with scriptures. The images I've got here, I've got a full body shot that was cut, cropped, I've got a picture here. None of the images here I like, I think the lighting, the gold looks a little bit fake. Instead of just gold, I'll put iridescent gold, Iridescent gold jewelry. And hopefully that will make the gold color more deep. Also, as you can see, we've got a full body shot. I will do airbrush close up portrait of a young Egyptian woman. Let's try this. Here you can see that the background is plain. That's probably because our prompt God a bit longer. This ancient Egypt template with columns and scriptures background gets lost a little bit in order to emphasize that we can use parentheses. Each keyword has a weight of one. When you put parentheses, for example, I will put temp, I'll put parentheses here. And this will make the keyword weight of 1.1 It adds extra weight and gives AI a flag that make sure to include these words. I also will highlight columns with scriptures background as well. Also, I would like to add aperture focal point of 1.8 to make sure that the background will be a little bit more blurry. Also, I would like to add some emotions. I want it to be epic. I want to highlight it, There's two ways to emphasize it. So you can put parentheses. Another way which is equivalent is you can put it in parentheses and then Cullen and put 1.1 That's basically the same as just putting it in parentheses. However, now we have an option not just to do 1.1 we put a different weight. We can put 1.2 or 1.3 which we cannot do just with parentheses. Let's put epic 1.3 okay? Let's try this. I think these images came out really well. We will need to fix the eyes a little bit, but that should be fine. Yeah, overall, it looks nice and I like the neck here, very intricate details. Overall, I'm very satisfied with this one. I will run it and improve the eyes a bit after face here. The images that we get, I think this is impressive. I would probably this image, maybe I'll correct the small artifact on the head piece. But overall, it looks astonishingly good. Okay, this one is not my favorite. This looks more Indian style. Interesting. Okay. I would keep this one. I would, I would save it, but just for fun, let's try a few others. And as you can see here, we have the columns, but we don't have the scriptures. I will remove parentheses here. I'll put parentheses on the scriptures. I will put a higher weight. I'll put 1.2 tunic. I've noticed that most of them are not wearing a tunic. I will also put a little bit more weight on the tunic. Let's put 1.3 for example. Let's see these images, these are fantastic as well. I like the third one. Let's try to generate a few more. Because none of these have columns, I will put even more emphasis. Ancient Egypt temple with columns and I'll put parentheses here. Another trick is that instead of one parentheses, you can use two parentheses here that would be equivalent to, to this one parentheses, one parents. Then we put column and then 1.21 Basically it's 1.1 times 1.1 which is 1.21 Or you can use two parentheses, that would be the same weight. We will cover weights in a little bit here to make more emphasis. You can use two parentheses or even three parentheses. Yeah, that will give it more weight. Again, some stunning results here. My favorite one is the first one. We have all these details of the scriptures in the background. She has beautiful face and earrings. Well, this one looks a little bit longer. I'm not sure if there's any symbol to that, but that for sure can be corrected. I think with this prompt, we achieved a great results. So we can stop here. There are other ways you can improve the image. But this is a little bit more advanced. And we'll cover that a little bit at the end of the course when we will talk about this program. But for now, we achieved everything here. 19. Prompt Sample - Landscape: So I quickly want to show you how to create prompt for a landscape. This was for a portrait. Let's quickly do landscape. Okay, again, I start with the subject. A castle. I want a wide angle shot. Wide angle shot of castle. Now this castle, I wanted to be a mean evil castle. I'll put evil. Now I want the castle to be in a forest. I can, in a forest I can specify what is that I want in at want a color. Flowers and trees also for the medium. I wanted to be an oil painting. Oil painting. A white angle. So maybe I'll yeah, Oil painting and then I'll put white angle shot of Castle Devil in a forest. Let's put Medieval Castle. Do a little bit of rearrangement. Medieval castle in a forest. Colorful flowers and trees. Okay, let's see what it'll generate. Here we have our images. Yeah, it's beautiful. Now, I want to make this castle feel a little bit more magical. I will put adjectives magical. I will put epic also to make it more like a fantasy castle. I will make it flow in the sky. I will put the action here, Medieval castle. And I'll put action floating in the sky above clouds. Now it's not in a forest, then I'll just move in a forest. Forest will be just an element for the image, not the background for colorful flowers, trees, magical epic. Let's also add stylish, realistic, and detailed. Let's create this now. We can see some here. Maybe I tried to put some clouds. Although here it looks like F doesn't look like the castle is floating. Yeah, none of these images look like the castle is floating. What I'll do here, I will emphasize this action. It means evil castle floating in the sky of clouds. I will emphasize this by putting a double parenthesis. Also, I want to add artists. I already chose two artists. It's Adrian Everson. Let me show you who he is. Our Chet, here are some examples of his artwork. Again, very detailed. I also have a gurney. Let's add him here as well. Here is not found here. Let's try, maybe here. Okay. Now here you can see that colors are nice and soft and this is what I want to see in my images. I will use his name as well. Let's go back to our program. I will put those two artists. James Gurney. I'm Adrian Everson. Okay. Let's generate and hopefully this will work. Okay. Let's see. Wow, this looks way more magical. It still looks like a fog but now, because we added the artists, I love all those details here. Yeah, some fog and clouds and looks like the castle is in the sky. To make it even more clear to AI that I want castle floating. You know what, I will remove the white angle shot. That might confuse AI a little bit. Beneath is forest and with colorful flowers and trees. I'll put three parentheses here. Three parentheses in the sky above clouds for details. I can also put more resolution. I can put 64 K. Remember we also mentioned that we can put things that are trending on certain websites. You can put trending on Art Station. I think that may make our cast a little bit more fantasy like because Art Station is for modern illustrations. Also I can put fantasy here. Magical and fantasy epic. Let's emphasize fantasy. Okay, let's strike this out. Okay, let's see these images as you can see here in the first one first image, the castle and a little bit of forest are floating on the Broke there above the clouds. I think AI captured our prompt very well. Here again, you can see some clouds in the forest. I think this is beautiful overall. My favorite one is the first one, and this the first one something that I would keep. If for some reason you're not getting your desired output with a long prompt, try modifying it a little bit. If that still doesn't work, try just generating a lot of images because then there's high chance that one of them will be something that you're looking for on this node. I think I've covered everything that I wanted to tell you about. The only thing left for me to explain is the keyword weight about all the parentheses. Again, there are two ways you can create higher weight for your keyword. You can put the keyword in parentheses and use keyword column, then just put the number. It has to be one point, something. This increases the keyword strength. It has to be higher than 11.2 or 1.3 Sometimes you may want to decrease the strength of a keyword. For example, colorful flowers and trees. I want less of this in the image. I can put parentheses here. We'll put a column, and I'll put a number that's less than one and higher than 00 point. Let's say eight, that should give me less of the colorful flowers. To decrease the keyword strength, you would put the keyword in parentheses and you would use column and put a number that's less than one and bigger than zero. For example, 0.9 That decreases the keyword strength. It has to be less than one. Okay? Another way you can increase the keyword strength is just using parentheses. Just putting the keyword in parentheses, that will increase the keyword strength. If you put only one parentheses, it will be 1.1 If you use two, it'll be 1.21 If you use three, it's equivalent as like putting keyword column 1.33 To decrease keyword strength, we can use brackets, one bracket. The keyword in a bracket would have the weight of 0.9 The keyword with two brackets will have a weight of 0.81 and the keyword with three brackets will have the weight of 0.73 That's for keyword weight. 20. Prompt Writing Resources: Now I want to talk about resources for prompt writing. First of all, it's an open art prompt book that you've already seen. Here are a lot of examples. They explain different keywords. For example, for cameras like drone thermal camera footage, there is a great stable diffusion art guide that also has keywords here. For example, style hyperrealistic, the word hyper realistic, and here's the node. It increases details and resolution. I think this will be very useful for you if you are just starting out and unfamiliar with some of the terms here. If you want a little bit more information about prompt writing, you can use the skid here. They actually write a prompt from the scratch and would explain every step of the process as well. Next is a list of modifiers. Here, if you go to this website, it has keywords and you can see what images can be created with these keywords. For example, dim light or light diffraction, studio light, those different things here at the top, actually, right now it's lighting keywords, but now you can change, for example, to effects. Here you can check out some keywords like bulk effect or neon light effects and see what can be done with that. Filters, lenses and so on. This is a great tool to check out. Next is mid journey styles and keywords. Even though it's for me, journey, it has a keywords, for example, artists or materials. If you click on materials, there are a lot of keywords. For example, solids. It will help you to find the vocabulary or keywords for your prompt. For example, wooden or lumber, sawdust, and so on. These images are journey generated images with stability diffusion, you may not get the same images, but it's great to help you find the right keywords for your prompt. Another great resource is Prompt Hero. Here you can see a lot of beautiful images generated by others. If you go on the top here, you can see the featured images at New Top. Then you get specific platforms. For example, mid journey, you'll get the mid journey images or Dali. These are all images that were created with Dali engine or stable diffusion will be all the images created by stable diffusion. If you want to save any images that you liked, you will need to create an account with prompt hero. Then you can browse through different images and actually save them by hitting the like. That will be in your profile. In your favorites. Here are the ones that I chose and they are in my profile. I can always go to click on this image, for example, and see what prompt did the other person. So this is the prompt, this is the negative prompt. Here's the generation parameters. We will discuss this later and model used stable diffusion 1.5 for example. For stable diffusion, I like this one. See how long this prompt is. But with the information that I gave you now, you can identify why did the person chose those words and how did he or she structured the prompt? For example, where's our subject? It's gorgeous Norwegian girl. And what is the medium? Professional portrait, photograph. And then we get the details in winter clothing with long wavy blonde hair. Look, freckles, beautiful, symmetrical face, huge natural make up. These are all details of the subject. Now we get to the background, standing outside in snowy city street. And you can see here that there are parentheses, two parentheses. Now you know that this is emphasized. And here are our stylizers. Ultra realistic concept art, elegant, highly detailed, integrate sharp focus. Here's our aperture ****, medium shot, volumetric fok trending on Instagram, our websites trending on tumbler, HDR and resolution. And we've got some negative prompt here as well. So now we just can copy this prompt and try it out in our own program. So let's go and just paste the prompt here, Let's check out the images. I think the second one came out exceptionally well, and the third one as well. Even without writing prompt yourself, you can go to Prompt Hero, choose your favorite images, Copy the prompt here, then you can change the details. For example, you like you like this image, but you don't want it to be in a winter time. We can actually change winter clothing. Let's put summer clothing standing outside in snowy CT Street. We will change that just in city street. And this should change our image a lot here for gorgeous images that were completely generated by AI. And we can see beautiful blurry CT Street background, we can save the image. Now you know that you can use prompt hero to get inspired and improve your prompt writing skills by checking out prompts created by other people. Okay, there's another good resource which is Lexica Art. We will actually cover Lexica in one of our modules. I'm not going to go into too much detail here, but again, there are lots of images and you can search for prompts. If I click on this button here, you can choose the model. The Lexica aperture is native to Lexica. If you're using stable diffusion, then select stable diffusion again. If we click on the image, for example this one, I can see what prompt was used to generate this image again, I can use that in my program here and try out this prompt again. You can check out these images and learn from them. Now, I think we've covered quite a lot on prompt writing. And you should be all set to create your own prompts. And start experimenting with prompt writing, happy writing, And see you in the next module. 21. Lexica Introduction: Hello everyone. In this module, we will cover Lexica. Lexica is another image generator, but it's also an image search engine. If we go to Lexica here, you can search for images and you'll get tons of AA images that were generated by others. If we click on one of the images here, you can check out what prompt was used to generate the image, which gets quite handy. Lexica uses its own model called Lexica Aperture. Currently, there are two versions available, version two and more advanced version three. These are both fine tuned models based on stable diffusion. Lexica was founded in 2022 by Sheriff Shamim. Okay. Now let's talk about pros and cons For pros, Lexica has simple interface and it's easy to use. If we go back here, I think that Lexica has one of the best user interfaces. The first window is home, this is our search engine. And the Generate is where we write prompts and can generate our own images. Very simple. Another great feature is that it is an image search engine. So we can go and look for images, get inspired, check out prompts and so on. It produces high quality images. If we just go in the gallery and look through the images, for example, a teapot here, it looks spotless. I don't see any artifacts for portraits as well. Look at some portraits here. And also it's photo realistic images. If we compare it to some other platforms like Ali, look at these phases. If I even zoom in the lighting, the eyes, the rendering looks spotless. It's a great model for photo realistic images. It works well with basic prompt. If we compare Lexica to just like the basic stable diffusion model, to get high quality phase or high quality portrait, we would need to add lots of stylize. Also add artists that help to make the face look better. But for Lexica, if we just write like a woman or it would create beautiful portrait right away here, for example, we see a short prompt and we already have beautiful, beautiful images here. Okay. It also has three limited credits. If we go to the account here, you can see that you've got 100 images per month. You can see how many you've already used. For example, I already used seven of 100 images. Lexica also has image to image generation. Basically, you can upload your own image and Lexica can generate AA images that look similar to the image that you've uploaded. So that's a great feature as well. Another great thing is that it allows private image generation with a paid plan. If we go here, here we can I keep my images private. Images created under the start and pro plans will show up in our search engine. If you subscribe to the max plan, then all your images will be private unless you decide to share them. To make your images private, you will need to get the plan this one, Okay? What are some disadvantages for Lexica? There are only two models available. As I said, a aperture version two, and version three. I don't know if you've already noticed, but the images that Lexica creates it, distinct recipe for all because it's a fine tuned model of stable diffusion, it uses a distinct recipe. It creates images, I would say in somewhat similar style. Just look at some lighting and colors. For example, if I want, let's look at Go. Here we have the style of being. But look at the colors. These are Lexica colors. These are not being colors. The lighting and the soft, smooth ear brush like texture. That's what lexica adds to the images. If you want to create images with white artistic variation, that may be not the best platform for you. It also has limited advanced settings compared to, for example, the platform with covered images I, where you can change like seeds and you can use other models and so on. Here, if you go to January, Advanced Settings here you can choose the dimensions of the canvas, choose the model type, Lexica, aperture, version three of version two, and you can use the guidance scale, and that's it. Limited advanced settings here. Also it requires a paid plan to use images for commercial purpose. I put that as a disadvantage because usually for, for a lot of platforms, you can use images for commercial purpose right away even without a paid plan. If we go to account here, can images for commercial purposes and they reply, you can use any image you find on Lexica for personal use. For commercial use of images created with Lexica, you must have a paid plan with some restrictions on team size. If you're a team of two to five people, then you need the pro plan. Teams of five plus need the max plan. Please see our license page for more details on allowed usage. The information I have on my slides comes from Lexica Art website. However, as a disclaimer, none of the information I tell you is a legal advice. So make sure you do your own research or consult a lawyer for legal issues. That's it for the Lexica introduction. In the next video, we'll go over some functions and we'll try different prompts and generate different images. 22. Lexica Features: Now let's check out what Lexica can offer us. First, I want to start with the search engine. If you go down here, you can see a whole gallery of AI generated images if you like any of them, you can go and put this heart, let's say I like this Capybara. I'll put a light here. Every time you put a light, it will be added to your light gallery. Here are all of my images that I've liked. You save styles, if you find something inspirational, you can save this for later. If we go back to home here, let's say you're interested in a specific style, you can look this style up. For example, pop art. Here, you'll have all the images that have this keyword. You can also look for specific objects, or for example, if you're interested in some products or looking for inspirations. For example, cream here, let's put cream product here. You'll have all different cream products. For example, this one minimalistic photo of natural cream for skin care. This one looks lovely. Again, if you liked any of the designs, you can save it for later. Or if you like this style, you can click on this button. Explore this style. That will just search for images in the similar style. For example, this one is very interesting. Let's try a different one. Not sure what this is. A herbal supplement, capsules back surrounded with nature. Okay, For example, this one, let's say you like the overall look of this. You can go and open this in Editor. Now you can add some, I don't know keywords. For example here it says iphone photo of natural cosmetics, Flower serum, Camp handmade cosmetics. I will change this a little bit. I'll put just Photo natural cosmetics. Just flowers, serum, cream, liquid soap, handmade cosmetics. Let's put by, let's click Generate. Let's check some images. As you can see, overall the shape is great. However, the lid is a little bit distorted. I think the cycle one came out the best in terms of the shapes of the product. And you can see the by symbols like the flowers and the wooden ball as well. Now also if you go to my likes gallery here I have the styles that I like and I actually want to work with this image. If I open this an editor here on this image, I actually have a few options here. First, I can download it, Basic download the image. I can make variations, I can upscale it, make it bigger, or I can out paint it. If you remember from Ali, the out painting extends the image. Let's try this first. I'll click on Out Paint here. Let's check it out compared to Dali, where we actually have frames and we need to put frame where we want to make the out paint. Here, the lexica extends all the edges. It does it beautifully. You can see the pattern is extended and repeated. However, I think in Dali, actually, when we were out painting, we were able to specify what we want to out paint. We were writing a prompt, but here, there is no function to write the prompt. It just does it from the style. That's what the out paint does. Also, you cannot choose the dimension in which you want to out paint. If you had the image in a portrait format, you cannot change it to out paint in a landscape format. For example, it will be in the same format as the original image. That has some limitations with out painting here, but the feature is great. Now let's try some other problems. We can use the same prompts as we use for and for images AI. For example, for realistic port, I can use the photograph. For introduction, I said that here we can use very simple prompts for realistic photograph. I want to try that. Instead of basing this whole prompt, I would probably just put a portrait photograph of a young British woman in a jacket. Let's try this. I'll photograph of a young British woman in a jacket with wavy blond hair. I will also put the background blurry, rainy city, street background. We can also add negative prompts, but for now, let's try just this. Here we can see some gorgeous photos of, of a young woman here. Let's say you like this one. You actually can make variations of this image. You've got some small variations. As you can see, her hair are a little bit purplish. In this one you have a little bit different signs. For example, here she's wearing pretty much the same jacket, but as you can see, her hair is merged with the jacket. This is a little bit better here by using variations, you can remove certain artifacts from the image. Okay, now let's try the whole prompt and see if using a longer prompt with Lexica will make any changes. Also, let's add the negative prompt. The images we've got with this longer prompt and negative prompt, I think they came out worse than the images with the simpler prompt. Because look at these images, The background is fabulous. It's like nice, soft and the whole composition is very harmonious. And the light, everything suits very well. Here we have some artificial feeling. I would prefer this image, let's say, because I like this image a lot. I can actually go and out paint. After out painting, I added some more background, it extended. We've got a little bit more of the jacket. This side looks beautiful, but I think the signboard is bit deformed. The sign looks a bit better here. I think the building has some problems here. The umbrella is flying in the air. Creative. I like this one the most. You can download it or make variations of it. You can like it to save it in your likes gallery. Another thing you can also do, you can load prompt into editor. Basically, it will load this prompt here. We can try that load prompt into editor as you can see it out of the prompt here. You can also load image into editor that we'll load this image here. You can generate, you can change the prompt a little bit and generate images similar to this one. But I will show you how to do this a little bit later with our own images. I think this will be more fun. 23. Lexica Image Generation: Let's strike you other prompts. Here we have the logo, as you can see with Lexica, shorter prompts sometimes work even better than the longer prompts. So let's put line logo of a cupcake with chair and top. I want to make it a square, so I'll change the size and I'll click Generate. Okay, here we have some interesting ideas I think so far compared to wit images, This is in a completely different style. Definitely. It looks like a sticker here, so you can see the white outline here. But yeah, here, some images that tried to make it colorful. Did we put colorful here somewhere? No, we haven't. But it was creative here. I want to change it a little bit so we don't get it too colorful. I'll put colorful here. Colorful in the negative, prompt. In the dance settings, I can also choose like guidance scale. That's exactly the same as prompt guidance in images and basically means how close the image follows the prompt here. I wanted to follow more. I'll put maybe nine here. The maximum is 13 that we can choose. I'll put, let's say nine here. Let's generate again. Okay, here, I think it's a little bit better. I don't know how useful that would be for a logo, but it would be great for a menu. I think. An image for a menu or sticker. Here's the style, let's try a different one. Magical realism. We have the three D render of Raccoon. I will delete the Unreal engine from here. And also negative prompt, and we'll make the guidance scale back to seven, the regular one. Let's click January here. I think the raccoon is missing the bottom half. Let's check the other ones. Okay, we have this front of raccoon on the armchair and the bag is from behind. Interesting. I don't see any legs but maybe the hidden in the book. I think this one was the best one, because here at least we see some by clicks here. I'm a little bit disappointed with how it ended up. But let's say if we improve the guidance scale a little bit, maybe also nine, and see if that will help. Okay, this didn't help. It made it even worse. But yeah, it added a little bit of details. In the sofa, you see like a small patterns and so on. Let's without just con reading a book, arm chair, and lamp and see if this will work. Let's make it eight. As you can see, this is actually a little bit better. Maybe the three D render confused it but I don't know, it looked like a tail. But now as I'm looking more on it, it's not a tail. We have some artifacts, that's definitely the problem here. Maybe this model is not that well known animals, I'm not sure. Ok, let's try other ones. Illustration children's book illustration. Let's try the illustration here again. I'll take out the artists names. Girl riding a bike. Okay, let's try that. Let's move it to seven. Okay, this is way better. Legs are deformed here, but everything else looks perfect. The background is very nice as well. Here. Yeah, not bad for illustration. It did a great job. Okay, let's move on to landscape. I paste my prompt here. I wouldn't remove anything from here because I think that's pretty descriptive. I've actually seen long prompts in Lexica. I wouldn't worry too much. I'll make the dimension, maybe this one guidance scale, let's make it smaller. Ai has more room for artistic style here. We've got our magical castle here. Again, I think I did an impressive job with the castle and the landscape. The colors look astonishing. Okay. The landscape, it did. Well, let's share the conceptual art here. The meaning of life. I will keep all of this here. Maybe I'll make it. Yeah. Let's keep the same landscape format here, Guy scale. Let's keep it a little bit lower than 55 would be good. Let's generate, wow. Look at this image, this is amazing. There's some tunnel in another world. And you have a ship, a moon, and that just looks magical. You can see a small house. There's so many details. I love that. Everything else also looks pretty good. And then there's a person standing, observing the beautiful. I don't know, looks too large for anything, but maybe it's a different planet, so who knows? And here we have a rainbow and a galaxy in a wave, I'm not sure, but a beautiful merge and a person again here observing chanting scene around him. So yeah, that's some photos we've got. I'm not sure why we cannot upscale this, but I'll save it. 24. Prompt Guidance Parameter: We've tried all the prompts that I've prepared for you with Lexica Art. Now, I would like to play around more with guidance scale and show you what effect does guidance scale have on images. For that, it's best to have a prompt that has a lot of elements in it. For this one, I chose a prompt with a lot of things going on. It's a girl holding a tiny kitten. A girl holding a tiny kitten in her arms. Waits for a bus at bus station. Here we have a lot of details. We have a girl and she's holding a Keta. The background should be a bus station. Maybe we'll see bus. Let's try guidance scale of middle, maybe a seven. Let's see what it will generate. The images we've got here depict exactly what we wrote in the prompt. So we have a girl and a small kit, and she's holding the key ten. I'm not sure if it's a bus. It looks a little bit more like a train, but it could be here. It's definitely looks like a bus. The kitten is bigger than the girl. Okay, let's count. How many fingers does she have? She has 123456 fingers. That's the problem with AI. It gets the fingers wrong. Maybe we can put that in the negative front, but I'm not sure if that will help. Extra fingers here. I think this is one of my favorite one. She's holding a kitten and I don't know if I tried to make it into a jacket or something, but that just looks like a big blanket with the hand, it got it correctly here. Five fingers here. The proportions are all messed up. This one was the best one. This is what we've got with guidance scale of seven. Let's now do the maximum. Let's do 13, that's the maximum we can put here. Let's generate again on this image we've got some past station which was missing from any of the previous images we've generated here. The egg. Got the fingers right by fingers. The cat looks a little bit bag. What can you do here? The kitten is small. I love this one. Again, problems with fingers guidance scale of 13.7 The images are aligned with whatever we wrote in the prompt. However, I would say that the images with guidance scale of seven, they feel more natural compared to the ones with the guidance scale 13. I'm not sure. Here I find that we have more artifacts not just with hands, but with the jacket. And the whole composition feels a little bit more forced compared to these ones, for example. So this is something you need to be careful with. So when you increase the guidance scale, you may get more artifacts and the composition may look a little bit more forced. Because now AI tries to integrate all the details that we've included in our prompt. Including the bus station. It tried to integrate as much details as possible in one image. Let's now try the guidance scale of the low guide scale. Let's two and see how that compare to everything else. Again, we have a girl holding tiny kitten and negative prompt with extra fingers. Here we've got the images way more darker. If we compare these images to the ones we did with the guidance scale of 13 or number seven here, it feels more cheerful, bright, and even though we didn't specify any lighting here, but with the guidance scale of two, the colors feel dull and also the whole atmosphere feels gloomy. I want to emphasize this point when you increase. Guidance scale, the contrast and color saturation will increase with the higher guidance scale number. If it's a lower number, then you would get more foggy and less saturated colors as we can see in these images. Also, I want to focus your attention that in these images, in some cases, we've got exactly what we asked in the prompt. So we have a girl here, she's holding a kitten. And that does look like a bus station. However, on others here, this is not a kitten, this is some other animal. And here the background is not clear that it is a bus station here as well. When your prompt guidance is on the lower side, then some elements of your prompt may not be reflected in the images and that's something that you need to be aware of. If we go to stable diffusion guide, this is a guide to guidance scale parameter. Here we have a panda playing a guitar. Here's the guidance scale of eight and this is the image that it has. If we make it small, let's the smallest one. Guidance scale of one. Here we've got something quite random. It doesn't look like a panda anymore. But when we increase this, now our image starts to look like Panda playing a guitar. Again, look at the colors here we still have some foggy, more unsaturated colors. As we increase, we've got more contrast and more saturation going on. I think guidance scale of ten is pretty good here. Number 12 works. We get a little bit more details here now. Let's 17, 18. I would say 13-18 It feels quite similar. But then let's zoom in a little bit so you can see better at number 20. We are here. It was to move here. Now if you look, we're starting to get more artifacts. Look at the guitar, look at the eyes. So the whole image starts to look worse. As we move the guidance scale even further, you can see the image is deteriorating at the guidance scale 30. Here we have oversaturated image, the quality is very poor. You can see that the whole image is pixelated and the quality has deteriorated a lot. Here we can read that the most creative and artistic results are usually generated around a guidance skill of seven. But using a skill up to 20 still produced results with little to no artifacts here. For this image, the best guidance skill value is, in my opinion, between number 8.18 and then just the quality is getting worse. But it all depends on your image and what you're trying to achieve. If your prompt is longer with many elements at it, maybe it's worth trying a little bit higher guidance scale to make sure that the image incorporates all those elements. But sometimes it's worth trying to do a smaller guidance scale if you're doing more abstract art. So it really depends on your artistic vision here. For guidance scale, that's all that I wanted to show you. In the next video, I would like to go over how you can upload your image and how to do image to image generation with Lexica. See you in the next video. 25. Lexica Image to Image Generation: In this video, I want to finally show you how you can do image to image generations with Lexica. It's very simple. All you need to is click on this button, Upload Image Here. You can choose any image from your computer, or if you found image from Lexica, you can click on this button and click Load Image into Editor. And that will load the image here, but for now I want to use my own image. So I will click Pod Image and choose this ballerina that we used with Ali. As you remember, that was a catastrophe with Ali. Once you have the image here, you need to write a prop. Basically, you should describe your subject and what you want to generate here, I want a ballerina dancing. Instead of this white background, I want a magical forest. A ballerina dancing in a forest. Okay, let's try this. In these images, we can see that the background is more or less that we have in our source image, which is pretty much white here. It got the hands wrong. Here we are getting a few more elements in the background. Here we get butterflies and a few stories. This is a little bit better, but I want more of the force. I want to see trees, I want to see leaves and so on. This is what I'll put trees and leaves. To make sure that we do not get this white background, I will put white background here. Let's try this. Okay, this is a little bit better. We get some problems with still with the leg here, but the background is a little bit more detailed. Okay, here we've got quite good legs. Okay, to improve this. To improve the background details, I can use the guidance scale. And I can make it, instead of seven, I will make it ten. Because now it's forced to use the word. It's forced to have magical forests in the background. As you can see, for dimensions, we cannot change the dimensions. The image that will be created will be the same dimension as our original image. Okay, let's try the guidance scale of ten. As you can see, increasing the guidance scale here actually improve the background a lot. Now we have way more elements of the forests in the background. This one looks pretty. Let's try one more time and let's make it 12. I also want to put fantasy fantasy. In order to avoid poorly drawn legs. I will also add extra limbs. I'll put extra in the negative, prompt. Extra extra hands. I have white background, extra limbs, extra legs, extra hands, Poorly drawn feet, poorly drawn face. This is something that I don't want to see here. Okay, here are some images, let's check them out. The legs are a little bit better here, but we do not get the back around three legs. I think this is the best image so far in terms of the legs, the hands, and the correct facial features here, the posture is quite similar to the original image we have. Here. We have this beautiful, magical forest background. Okay. As you can see, she's standing on some, a lake that's beautiful. Now, I would like to explain a little bit, how does AI generate these images from our image? Basically, the generator doesn't use a single pixel from this image. What it does, it analyzes this image and then converts it into code. It then uses this code. Input to generate all other images, you won't be able to get exactly the same image, but you'll only get the variation. As you can see here, the posture looks quite similar, but not exactly the same one. Again, I will try to capture as much detail from your original image and integrated in the new images. But again, some details or composition may be quite different. Or it may not capture, for example, facial expression. Or just facial features may be quite different because it may not capture well the information from the source image. Now I would like to create image to image generation with my own photo. Let's try that. This is my photo myself. Here I will put, I'll describe myself. A girl with curly hair. Now I want to make images in anime style. Yeah, I'll keep this. And then maybe in the CT Street background, let's make the scale maybe ten. Let's see here, the AA actually captured my blue and white striped dress quite well. The overall posture of the girls is similar to my image here. However, none of the girls here look like me. The reason is that we've provided AI only one image for certain things like posture colors. It's certainly easier to give a description when it encoded. It's easier compared to facial features. With facial features, it needs a little bit more extra information. If it had more images of me, then it would be easier to compare and see what are my facial features. However, here, because I provide only one photo, there is not much I can expect from AI. Let's try to generate a few more and maybe I'll change the guidance scale to seven, back to seven, let's see. Okay, as you can see, it captured the wavy hair, but everything else, again, the face is very different as you can see. If you were to upload a photo of yourself, you would get a completely different person. But the posture, the colors we built, the clothing items look quite similar. I would recommend using maybe a full body images, because image to image generation does the posture quite well. And here you can experiment and try all different backgrounds. You can get really creative here compared to Dali. These are way better. That's it for Lexica. In the next module, we will cover more AI image generators. See you soon. 26. DreamStudio.ai Introduction: Hello everyone. In this module we will cover another AA image generator. But before I begin, I wanted to cover a little bit more about stable diffusion because I feel like that we didn't get a chance to properly introduce it. Stable diffusion is a deep learning text to image diffusion model, and you might be wondering who developed it. It was developed by the start up company called Stability AI in collaboration with academic researchers and nonprofit organizations. One of the collaborators is Runway ML. This is actually an AI platform right now for AI image generations, for AA editing, and for video editing and video generations. We will be covering Runway ML in our course as well. Stable diffusion was released pretty recently in August 2022. It's open source model compared to Dali and Mid Journey that have their models closed source. That means nobody can access them. Stable, stability. I actually made their model open source. That means everybody can access it and use it as they wish. It has free license for commercial and noncommercial use. Because of that, you can actually write it on your personal computer for deli and mid journey. Of course, you get some free credits. But after those free credits, if you want to generate more images, you have to pay. But here you can, you can use table diffusion as you want, and if you run it on your personal computer, you don't have to pay anything. It's free. You can generate as many images as you want. So that's the beauty of the open source model, stability. Now that you know that table diffusion was developed by Stability, I, I want to show you their website. This is stability website. And they actually have a few products here. One of them is Dream Studio. And Dream Studio is an AI image generator similar to images. However, here you actually need to pay for image generations. You may be wondering if stable diffusion is open source. Why should they pay for Dream Studio? That's basically for example, for some reason you cannot run stable diffusion on your computer. Such as if your computer has low compute power, then you can use stability is compute power to generate your images. You'll use Dream Studio in that case. They also have other great products. Clip drop, we will also cover that in our course. That's for image editing. There's also Photoshop pin and blender pin. We were not covering pins in the scores. Okay, that's for stability for Dream Studio. This is what will be covered in this module. It's an image generator, it's stable diffusion. It's a web app hosted by stability. Let's go to Dream Studio. This is basically studio interface here. Now let's talk about some advantages. They give you free limited credits, which is great. And after that, you actually need to pay based on your usage. They don't have the subscription, it's based on the usage. You can generate images in different styles. As with any stable diffusion, the images can be generated in various styles. It has advanced settings. If we go here, you can change the dimensions of the image. You can change how many image you want to see generated, as well as you give a prompt guidance. Prompt strength here is the same as prompt guidance. You can put generation steps and seed number. We'll talk about seed later as well. You can choose the model here, okay. It also has a image editor. If we go back here here, you can click on the edit and you will be able to upload your image here. And we'll do in painting and out painting as well. In terms of disadvantages, Dream Studio doesn't have user friendly interface. Actually, I was surprised because Dream Studio is a product of stability AI, which is one of the leading companies in AI and all the other products are quite good. But the Dream Studio, I found that it can be a little bit buggy and just the whole interface doesn't feel that good. Another problem is that it's beginner friendly. If you want to use some advanced settings here, you actually need to put everything here yourself. You have to know exactly all the terminology here. Compared to, for example, images I, where they have lots of images where you can choose like styles. They give you hints in terms of which prompt guidance to choose or like steps, for example. They use like draft or detailed words to help guide you. Here you have to know everything yourself. But at this stage, I think we've covered this terminology. You should be pretty good. Another thing is that it requires detailed prompts, because Dream Studio is basically stable diffusion, as we talked in our prompt writing module. In order to achieve good results, good image results, you actually need long prompt. Here you go. Great way to practice your prompt here. Another disadvantage is that it has only a few stable diffusion models here. Put SD, that's an abbreviation of stable diffusion. If we go here here we only have the three new stable diffusion model. It's stable diffusion version 2.12 0.1 768 and the better trial model of SDx L. As you remember from images I, there are a lot of stable diffusion, fine tuned model and also their previous base models like 1.5 and so on. Actually, I personally prefer working with other stable diffusion models. This is a big disadvantage for me, that here I can only choose these three. That's it for Dream Studio. In the next video, we're going to go and explore some features here and try out some prompts with this Dream Studio. See you soon. 27. DreamStudio Features and Models: Now I would like to show you Dream Studio first when you sign up. All again, this is the interface that you will see if you click on this generate pattern. Here are a few parameters that will go over. The first one is style and what style you want your image in. Here are some options. For example, anime comic book, digital art, fantasy art, Neon Punk. Then we have some isometric low poly origami line art craft, clay, cinematic D model in pixel art. If you want to generate in one of these styles, choose the style. Otherwise you can just keep the default one here. We can write a prompt here, you can randomize, it will just give you a random prompt here. In the negative prompt, you can write some negative prompt, something that you don't want to see in the image. Then you can upload an image if you want to do image to image generation similar to what we did with Lexica. Another thing here, if you go to Settings here, you can change the dimension of the image. This will be vertical, If you go here, it will be horizontal. Here you choose how many images you want to see generated every time you click Dream. By default it's four. But if you want to save some credits, you can put maybe two or even one. You can see here, if you want to generate this image that's horizontal with one image count, that's 2.6 credits. If we go to advance, here's our width and height, that's proportional to our dimensions here. Then we have prompt strength, which is the same as the prompt guidance. We have the generation steps. We also have a suit and we will talk about sit a little bit later. Okay, as you can see, as you change the dimension, the number of credits also changes. If you choose some, either horizontal or very vertical, it will be the highest number of credits. But if you choose a square, it's the cheapest number of credits for model. Let's login for model. There are three models, these are all most up to date models. Stable diffusion version 2.1 is up to date. Stable diffusion model available publicly. There is another one called SDL and that's just in a better mode. It's not public yet, so you can only try it in drip studio. This is something that I wanted to talk about here because models are updated regularly. In a few months, you'll likely see maybe some different models. That's why it's important to check out what is the model about what kind of prompt you need to use for that model and so on. So for example, for stable diffusion version 2.1 from the sources, I've read that the negative prompt for this model is super important, which may not be for some other stable diffusion model. It's always good before starting to use any model to read a little bit about the model. Here are some articles about the stable diffusion version. 0.2 0.1 by stability I, yeah, they describe a model a little bit. They say what they added, how it works, and so on. Another source that I really like for checking out the different models is the stable diffusion. And they have great guides here. For example, for 2.1 model, for example, as you can see here in Dream Studio, you have the version 2.1 and version 2.1 768. If you're wondering what is the difference, if we go here and here, it says that there are two text to image models available. 2.1 base model, which has default image size of 512, 512 pixels. Or. The 2.1 model, 768, which has the default image size of 768 by 768 pixels. The 768 model is capable of generating larger images. It's especially useful for generating larger scenes with small characters. Here's some description of these models. Now we have this SDX. Again, you can go to the source Stability AI. Let's see what they say about the model. Highlights of SD L capabilities include next level photo realism capabilities, enhanced image composition and phase generation, reach visuals and jaw dropping aesthetics, use of shorter prompts to create descriptive imagery and create a capability to produce legible text. From all of these, I would say the most important one is that this model can produce legible text because all other models were not good with text. We can actually try it out here. If we go back to Dream Studio, let's choose the SD Cel beta, Let's keep the square, and I'll choose image count. Yeah, let's do four here. Now, I'll choose a prompt that has some text in it. The style. Yeah, let's do enhance here. The first prompt we can try is a photo of a man holding a sign that says thank you and I want it highly detailed. Okay. As you can see here, the I got thank sign very well legible, no artifacts create. So let's compare that to the previous model. If we choose stable diffusion 2.1 again, let's generate it. As you can see, there is a huge difference even though it tried to write. Thank you. But all of these are just artifacts. Now we know that this model is great with text. Let's try something more difficult. Photo of a bus stop advertisement displaying a burger in a text, Hungry Close a View, highly detailed, 64 K. Let's try that. Let's do the Dl beta model. Okay? As you can see here, the first image is missing the hungry sign. However, the other three you can see here clearly, the hungry and the burger. Here are a few artifacts, but this one is one of the best ones. I would say maybe we can try one more time and see if we can generate better images. Okay, here it actually disregarded the message completely out of these images. I think this one is the best. And I would probably change it a little bit. I will change the prom strength to ten. I will also display an image of a burger and a text, Hungry. Hopefully that will help. Let's check it out here. I think only the first image depicts my prompt correctly, although I don't like that. It's black and white. Here's actually, you have an editor. You can go to Editor here. You can click Edit Image on the right hand side, you'll have the image. You'll have frames. You can add as many frames as you want basically. But for now, let's remove all the frames. Let's add a new one. Let's say I don't want it black and white. I will erase this whole advertisement here, and I want it to make colorful. Let's do that. Now, since we have this frame here, it captures it. I found a little bit hard to work with editor here because it's baggy. For example, here I cannot move the frame. Sometimes I cannot move the frame. I usually need to restart it. Let's try to restart it. When I restarted, my image disappears. Let's go back here. You can click to edit this image. Finally, we can move our frame around again, I have to erase this image. Now, I'll move my frame to the place where I want generation to happen, which is here. I will line it with my image. Now I will write a prompt. I will use the same prompt. I'll put an advertisement, tisementoardsplaying hamburger and text hungry. I'll put that highly detailed. In the negative prompt, I'll put black and white image count. Let's try, let's put prom strength at 12 to make sure it aligns with our prompt better. Let's see. Okay, not bad. Yeah, it edited the part that I've raised. It didn't add the text here, but on the next one it did put the burger and the text exactly as I wanted. This is the tool that you can use as well. The interface is not great. Just heads up. 28. DreamStudio Image Generation & Seed Parameter: Okay, let's go back to generate a few more image. S Let's see what are some other improvements of the model. If we go to Stable diffusion Art guide about the SD Excel model, here we have a person writing their own experience about this model. For example, let's go to Improvements, legible text then. It's better human anatomy, the postures, we'll try that as well as you can see the differences between a yoga practitioner here and the images with the previous stable diffusion model and more aesthetic images. Here we have a house, and here's our indoor setting, as well as the style you can see is a little bit different in the images. More accurate images. The ability to understand the prompt improves over version on E models. Here we can see or tone portrait of a woman here in the previous version we got the black and white, and in the newer version it actually used a variety of colors. Let's try something else for Dream Studio, we've got this burger and now I fought off a modern bakery with minimalistic interior design, with clean lines. History showed in glass, displayed contemporary environment, highly detailed. I want to make sure that the bakery has a sign with the text bakery displayed on the wall. Let's paste this prompt and try it out with the new version. For the negative prompt, I will put poor proportions blurry. I don't want it to be unclear. Okay, let's try that. Okay, on the first image, we do not get anything here. Looks like bakery, but there are too many artifacts. I'll try again. Every time you make a new generation, your advanced settings reset. Make sure you change the advanced settings before you make a new generation. For prom strength, I'll put 12 here. I will use Dream again. As you can see here, I still try to incorporate bakery. Here we have two Science, the second one is more legible. However, my takeaway is that this model still struggles with text, especially when it's a little bit more complicated here, because it has to be three D and it has to have the right proportions Compare to, for example, the first images that we generated where it's nice and flat here, Here it did a great job. Still needs some improvement in this area. Let's strike one more prompt. In the article that we've read, it says that this model is way better with postures and I want to check it out. I designed a prompt, a wide overhead, short yoga practitioner in a tree pose, mountain setting, a soft morning light by Thomas Moran, Highly detailed, let's try that. In the negative prompt, I'll also base my negative prompt, bad framing, out of frame, deformed, and so on. Make sure to choose the style. For example, here I wanted a little bit more photo realistic. Actually, I'll choose the cinematic in the settings. I will make the prompt strength even further. Maybe 14 generation steps. I'll keep that the same. And the newest model, let's try that. Okay here. Actually it does somehow looks like a tree pose. If you're not sure what is the yoga tree pose? Tree pose. Yoga. That's how it should look like. As you can see, the images that were generated have this exact pose. We can use the other model, the 2.1 version, and see how this model compares to the S D L one. Here with the 2.1 model, we got this image which is not bad. This one? Yeah. Here, the proportions are messed up. Yeah, I would say that the newer version has better proportions. Okay. For one last thing, I want to explain to you what is seed, so you can already start using it. A Ed is a randomly generated number assigned to an image. Every time you generate an image, it will have a different number. For example, this image here has a number. This one. This number tells AI how to generate the image. What it's great for is that if you use the same prompt and you use the same settings, the seed number, you'll get exactly the same image. What's even better is that you can make small changes to the prompt by using the same seed, you'll get almost same image with slight variations. Basically, you can make small variations to the image. That's very important for artists. Let me show you what I mean if we go here. This is, by the way, an article about the seeds, but it has great examples. I wanted to show you that as well here. This is the first prompt when the person generated this image. This was the number when they use the same prompt and settings and added this seed number. Also, they added smiling to their prompt. Here they've got the same girl, but now her mouth corners are left up. Then instead of smiling, they'd be added angry. And now you can see also basically the same girl, but now her expression seems like she's angry and here's excited. The same thing you can do with the landscape. This was the prompt of a park and they used the same seeds and they only changed the time of a year. Here's the spring, now it's summer, autumn, winter. As you can see here, the composition is exactly the same. It's just the color of the trees is different. Here's another example of Elon musk. Again, same composition. However, here they changed the medium. Now here it's by Vincent Bango, here by Pablo Picasso, Salvador Dali, and so on. This is how you can modify or improve your image. Let's try that. Let's go back to Dream Studio to try out how to see work. I've prepared another prompt, and that's a portrait of a young woman with a Asian market background. For the negative prompt, I'll add the basic negative prompt here. Now let's choose the style. I want it to be cinematic advanced. Make sure the DX model is selected. Let's make the prom strength. Let's put ten. Right now, we don't put anything for the set. We first need to generate something. Let's try that. The images I've got, I don't quite like any of them. I think some of them have artifacts and the other just the face is too dark. I will change the style from cinematic to enhance and try again. Okay, here, it's way better. It's either this image. Or this one. Okay. I'll choose this image to work with. Here you can see it's seed number. Let's now we can paste it. Let's paste our seed here. Now let me show you that you can actually generate the same image. I will make the image count to one using the same settings. We will generate our prompt with the seed number. You can see here we've got exactly the same image as before. Exactly the same. Now what I can do is to add a few modifications to the prompt. They should not be big modifications because with big modifications, the whole image will be completely different. But I want to keep my subject the same. Now I'll add smiling a portrait of a young woman smiling. Let's try that, make sure the seed is the same one. Let's try that. In this image, you can see that the composition is almost the same as the image before. And if we zoom in a little bit, so this is the smiling one, you can see her math corners are lifted up here. Yeah. Basically we have the same person here, the same subject, the same hair and background. However. And a add, try to add the hand. Not successfully, unfortunately. Okay, now we've got the smiling one. Let's try frowning. Here, you can see that we've got the same person here. Same hair, clothing item, and facial features, as well as the background is quite similar, the composition is different. This is the front view. Here we have the side view. You definitely can see different expressions. That's the beauty of using the set. Now you can use the set to generate a character and make different images of that character with different emotions or different postures. You can play around with the set. Now you know all those advanced parameters that dream studio have. Because we've already talked about the prompt guidance, the generation steps, and now we've also talked about the set should be all set and try it out. Personally, I don't usually use Dream Studio because I'm using the stable diffusion on my computer, which is free. It also gives me a little bit more freedom because I can use any model that I want. And there are more advanced settings, but Dream Studio has its own advantage, is that this newest model is the Excel is not released yet and you can only try it with Dream Studio. So here you'll be able to try any new models that stability AI is planning to release. So try it and see you in the next module. 29. BlueWillow Introduction: Hello everyone. In this module, we will talk about another AA image generator, and it's called Blue Willow. If I go to gallery here, here are some images that were generated using Blue Willow. As you can see, there are some high quality images. Okay. Blue Willow was founded by a group of AA engineers. It was launched in January 2023, and it operates on Discord. If you haven't heard about Discord, Discord is a messaging platform. I would say quite similar to telegram. For you to be able to use Blue Willow, you need to have an account with Discord. But don't worry, I'll show you how you can set it up. What is unique about blue willow is that it's an aggregator of multiple AA models, including models like stable diffusion. What it does, it picks the best model based on your propped. For example, if you write a cartoon image of a dog for example. It will choose a model that's best for cartoon style images. That's basically what it will do. So if I go to their questions and answers here, what makes it unique from other AA text to image generators? Blue Willow is like a Google flights for AI models. It enables users to find the right model depending on their goals. Unlike other AI text to image generators, Blue Willow is an aggregator of multiple AA models, including models like Stable Diffusion. Who owns the rights to images produced by Blue Willow? You own the rights to your creations. You're free to use them in your art for commercial gain. Here's some information and you can further read about blue willow here. Okay. What are some advantages for beginners? It has some free limited generations. Currently, it gives ten generations per day, but of course you can buy a subscription with higher number of generations. It's beginner friendly if you have a Discord account. Let me show you here. All you need to do here is go and click Imagine, Write your prompt, and it will generate images for you. You can upscale, create variations, and do out painting with your images. You can also do image to image generation. This is not trivial how to do it in discord, but don't worry. I'll also show you how to do that for disadvantages that it needs a Discord account. It's not like a website where you can go and try it out. No, you need a Discord account. There are limited settings here, even though it's stable diffusion. For now, Blue willow doesn't have any way where you can add the seed number or alter the prompt guidance steps, number of steps. Or choose your own model. Because it chooses the model in the settings for your image based on your prompt that's handled automatically. Also, of course, because the models they use are stable diffusion models. So detailed prompts work better with stable diffusion, and that's the blue willow. In the next video, I will show you how you can create a discord account and how you can add the blue willow there. And we'll go from there on. 30. BlueWillow Overview and Discord Setup: Here's where we left off in our previous video. Now I want to try some prompts I get. Let's try our props, the realistic photo, and so on. Now, I actually made some modifications. It's still a photo, but I've changed the British woman to Indian woman and also changed the background. I modified my prompts a little bit so we get a little bit more different spectrum of images here. Okay, let's do that here again. You need to put, you can click on the skin here, or you can type whatever you want. Imagine, and then you need to click space. Now we can it be our prompt here. Here I have the professional portrait photograph of a young Indian woman with long hair, beautiful symmetric face. Cute natural make up colorful street market background. Highly detailed sharp focus, deba field, and aperture. And okay, let's try that. Just click Enter. We will get our results promptly. Okay, let's check this out. Okay, as you can see here, the phase looks good. The eyes are not messed up, the nose is not messed up. Facial features are correct, which is great. Here we have the four images here. Let's say if you like any of them, you can upscale it. For example, the first looks good. I can go ahead and upscale U One, It's short for upscale. That refers to the first image. This is the second image, will be upscale two. This one is the third image, three, and this one is the fourth image, four. Let's upscale the first image here. Here's our image. The eyes are not that great as we look on the upscaled version. These buttons you can use if you want to out paint this image. If you want to out paint to the left, you can press this left arrow if you want it to the right, then right arrow up and then or down. If you want out painting in all the directions, you can press this button here in the bottom, we have Mogi also cross. If you don't want the image to appear here, you can click this Cross button and it will disappear. Also, as we've talked about, blue willow has feedback, don't like the image. Then you can click on the Emoji. Or if you love the image, you can also give them feedback. Actually, they also right here, rate your image. After upscaling your image, you'll see new emoji buttons that allow you to rate the image from worst to awesome. This helps us a lot in improving our trading data. You can rate your image and help Blue Willow improve here. For example, I would say that it's okay. I'll put maybe this emoji here also. I will click out painting so you can see what that does. Okay, let's go here. This is our out painted image as you can see it out painted in all the directions here. And again, you can choose the image that you like the most. Go ahead and upscale it. Okay. Now I also want to talk about parameters because as you can see, you can just put, imagine, then it's only your prompt here, where can you put negative prompt or how you can change the dimensions of the image? All of this are in parameters. If we go to blue willow dogs, they should be here are blue willow dogs. Here you can see this prompt and parameters here, all the parameters that blue willow has. For example, they have a negative command. This negative command is basically a negative prompt. Now you can imagine. Here's your prompt painting of a cute cat. Then if you want to put a negative prompt, then you put two dashes. Then no, you put anything that you do not want to see in your image. For example, here you don't want to see the three D or cartoon. Let's try this out with some simpler prompt here. For example, I have magical realism. As you remember, we've had the **** reading a bog. Now I've changed it to three D Render of a panda playing chess with a rabbit in the campy home. Dim, lighting realistic, unreal engine. If I go back and paste it in my imagine prompt again, I will paste my prompt here. Now all I need to do is to put two dashes, no, then something that I do not want to see here. This is a three D render. I do not want to see any cartoon. I also don't want to see any extra legs or extra arms. That should do now. We just click enter and see. Okay, let's see if I zoom in here. The panda has rabbit ears. For some reason, I don't like that here. This is a little bit better. Again, rabbit panda with rabbit ears. Playing chess with a human. Again, panda with rabbit ears. Okay, that's not too good. Then let's try something different. Let's add that in the negative prompt. Here I want to put, imagine the same prompt. Now I don't want cartoon extra legs, extra arms. I also don't want to see Panda with rabbit ear, rare bits ears. Panda with the rabbit ears. Okay. Hopefully that will fix it. Okay, Let's see, the first one, we got some animals. Actually it does look like a panda and a rabbit. Again, I'm not sure if this is a panda here, but on the third image, I think that looks good, actually. Again, you can upscale the image that you like or you can make variations of the image. Again, the number corresponds to the image number will be this, one will be 23.4 because I want to make variations of the image number three, then I will click on three here. Also, while it's rendering, this button is also the same cross. If you don't want to see these results, you just click on this button. Or if you want to redo this prompt, you can click on this button and that will just redo and give you more images for the same prompt. Let's see what we've got. Here are some variations of the images. Here are some different postures. Again, we are getting a little bit of rabbit ears. But I think number two, actually 12 or four works for number four. You can see that the rabbit has double ear here. Let's try to upscale it and see if that will fix it. Also, if you've noticed, there is an image of the rabbit which is quite neat. Here we have some human portraits, but here is a rabbit. Okay, let's upscale the number four. As you can see here, the upscaled version didn't remove the defect with the ear. Sometimes the upscale helps to remove certain defects here. It didn't just to let you know that whatever you'll see in the small image will be in the upscaled version, Okay, now again, you can rate this image, let's say not good because of the problems with the ear. 31. BlueWillow Image Generation Part 1: In this view, I want to introduce you to Discord. And again, it's a platform for messaging and it's widely popular for programmers and crypto community. And now it's also becoming very popular for a community. If you're wondering why Blue Willow is on Discord, they actually have an answer to that. So why does it operate on Discord? Discord is a community platform that allows members to share and discuss the images they're creating, as well as participate in contests, discussions, rewards, and events. Discord also enables blue willow to gather feedback and improve the platform quickly. We plan to lodge our service outside of Discord soon, so stay tuned. Discord is this community platform where you can do a lot of things. How does it work? Basically, you have two options here. You can download Discord on your computer, or you can use it in the browser. I will use it in the browser. I will click on Open Discord in browser Here, I'm already logged in, but I will log out. If you don't have a Discord account, then you'll need to register. You will need to click this Register button and put your e mail address, create a username and password, also the date of birth, and they also ask for phone number verification. Okay. After that you can login. So let me log in. After you login, you will have something like this on the left hand panel. I have a lot of servers here. But in order to add Blue Willow to your Discord account, all you need to do is to go to the Blue Willow website. Here they have this Join the Free Better button. Just click on it will take you to Willow Discord. And you just need to accept the invite. Then it's asking if I want to open the Discord on my computer, I'll call counsel and I'll continue on Discord. Now after this, you should see blue willow on your left panel. Here is their logo and this is what you will see. I know it looks intimidating. There are a lot of things going on, but don't worry, we will go one by one. Here is the blue willow server. Here's some information. Let's start here. Getting started. Here's the information about blue willow. You can read it in different languages if you want. The here are questions and answers rules. You can sign up to their newsletter gallery. Okay, now we're going to more interesting stuff. Here are a few chat groups where you can generate your art. If you go to any of these ones, let's say maybe number 23. It doesn't matter which one here. Now put dash and write. Imagine then space and radio prompt here, for example, a t in a box. Now you just click Enter. That's all. Now you will see your image is being generated because there are other people using the same group chat. You will also get a lot of other art. It's easy to lose your prompt. Here we go. This is our prompt, So we have a cat in the blocks. As I was saying that, it's quite easy to lose your prompt because every second someone else is using and generating their own art. For that reason, I always recommend to use the direct message in order to do that. Once you see the blue willow here, just click on it here, you have an option to add it to server here. More experience, you can do that. Otherwise, you can just message the blue willow. Let's put Hello. Now this is the direct message and now we have blue willow in our direct messages. Now I can use the same imagine prompt and I can put a cat in a park. And then I can enter here. Only my images will be visible even though the images I generate are public. However, here at, I don't lose them and it's in one place. Okay? Here. In order to get to direct messages, you just need to click on this Discord logo and you'll be brought to direct messages here. And this is the Blue Willow. Chat here. Okay. Now, I recommend going back to Blue Willow and I wanted to show a few more things that they have. They do have some support here. People can ask questions. Then we have announcements. These are announcements by Blue Willow if something is changing or for example, if prices are changing and so on. Then we have the prompt questions and answers. Which I think is great because for example, if you're looking for coloring pages, you want to create color pages. Here. Some tips and tricks. How to get what you want. For example, background removers or how to generate this one letters and images. Here people are writing the tips on how to create this image with text. For example, here you can see legible text. And then we here you can give feedback to blue willow, you can connect with others. There's also daily contest, this is the description of the contest. Then there are daily themes and you can take part of it and enter the contest in terms of this blue willow server. I think I've wet through a little bit here. Let's go back to the direct messages and let's start exploring blue willow and what kind of things does it have? Blue willow is a bot, Any interaction with the bot will require a slash. I think I previously misspoke it. It's not a dash, it's going to be a slash. Now, when we put a slash here, some commands that we can use with blue willow. The first one is, imagine this is when you want to generate art. Then we have info if we want to read more information about our own accounts. Here I have the information, my username plan I am at how many prompts remaining? I have seven remaining prompts, and this is time to reset these prompts. Then I can go to subscribe. Let's say I want to buy the subscription. Let's go ahead and subscribe. Now I can choose my subscription plan. We have this $510.20 dollar. The $5 gives me early access to version 450 prompts per day, five concurrent images and member badge. The 101 gives me 100 points per day and like member batch exclusive access to VAP contents and so on. For now I'm going to go with the $5 per month Willow. Let's go on here, you'll see a typical payment information. Just add your payment information and subscribe. Now I have subscribed to Blue Willow. And in the next video, we can go ahead and try out some images. 32. BlueWillow Image Generation Part 2: Now let's talk about a different parameter. If we go back here. You can use aspect ratio. If you want a horizontal or vertical image, you can put the and then R. Then it's either three, column two, or two column three. If it's a landscapes three column two, or if it's a portrait that it's two column three. If there are no aspect ratio, then it's going to generate a square here. Let's try with another prompt. For example, here I have anime. And it's a portrait of a skin me boy classes listening to music in the street of a rural Japanese city. Anime boy, high detailed, a sunset, relaxed pink and purple cloud stars, soft light realistic eight K. Unreal engine. I'll change the Japanese city. Okay, here, let's again put the slash, Imagine, I'll paste my prompt here and change the Japanese city here. Japanese. Okay, now we can put no limbs, extra arms. Make sure that you put the no at the very end of your prompt. Because if you put the no somewhere at the beginning, that it will treat everything as your negative prompt. Sure, it's at the end. Now we can also add another parameter. Because it's another parameter, it's fine. It's not going to treat it as the negative prompt. Here we put a R, Let's say we wanted a portrait, so we'll do two, column three, Let's generate that. As you can see here, all the images are in the portrait aspect ratio. That's what we've asked. We have this dime boy here and right now the generation uses version three. We've got some advertisement here as you can see that this is the version three. The version four is the improved version. In order to be able to use the version four, you need to be subscribed, otherwise, it will automatically use the version three. Here you can choose different models. For example, if you want to generate with the first model, you can put V and then space one. Here it will be. Imagine watercolor painting of a cat. Then at the end, you put the space one. If you want to use the second model, that it will be number two. If it's a third model, then this is a default model. You don't need to put anything. If you want to use the four version, the newest version, then you need to put the four and it's only available for subscribers. This is what we will do now. We'll go back to our discord. I will copy this whole prompt here. Just copy it here again. I'll imagine I'll paste my prompt at the end, I'll put version space, and I'll put number four here. Then I'll click Enter. As you can see, these images are way better, especially I like the number one, Even the number two, and the number four looks very realistic. Let me upscale the number one. I'll just use the one here. The color palette is amazing. I love this purple pink sky and how it matches with his hootie here. Overall, I think this is amazing if this is version four. If we compare this to the images we got with the version three, these look way more simplistic. Okay, let's try a few more images. Again. I'll imagine for the next prompt I have the landscape and this is something I took from prompt hero. I've changed the prompt a little bit here, but the images at prompt hero were super good. I want to try this out and see if it's going to be good as well with blue willow. For now I'm not going to put any negative prompt, but I will put the aspect ratio again. I'll use the portrait one and then we will try also the landscape, 223. Let's do version four. Version four, the images we've got look amazing. The rock, I don't know, in the small river looks interesting here we even have the small waterfall. My prompt was actually the Lost Valley rock arch vegetation, exotic forest and plants landscape concept art. And then there are a lot of stylists as well. Here we can zoom in and try to see which image is better. I think I would choose number two. I can either create the variations or scale. Let's upscale it. It's image number two. Okay, looks impressive. Let's try to out paint it. We can out paint the whole thing or into a specific direction. For example, let's try a specific dimension. First, I want to extend my image to the left. Let's click on the left arrow. As you can see here, we've got four different variations in the left panel. It added this little section here. It added different elements. For example, here, it actually tried to add a completely different image and it even put the line here. On the other ones, it looks more realistic. For example, number 3.4 looks very natural here. We can keep any one of them. Let's, for example do number three. Now this is the app scale version. As you can see here, we cannot actually do more out painting. We can only rate the image. I think it looks nice. So I'll put this loving mog here, since we cannot out paint the image further. Which I think is a pity because it would be nice to have the image extended to the right as well. We'll just have to leave the image as is. Okay, let's finish with our prompt. I have a logo here. I've also changed the logo a little bit. It's a tree inside, a water droplet, slick and nalmalistic logo. To graphics, we color white background, contemporary style, perfect for a modern eco friendly business. Tailed eight K. Let's imagine and I don't want to see any three D image, I'll put three D, the aspect ratio I want to square, so I'm not going to be adding anything here. I wanted to be a version four. I'll put version four. Let's see these images. I think for our prompt, it done a pretty good job because this one for logo, it's a bit more complicated compared to our cupcake with cherry Here, it's a tree inside a water droplet. As you can see, we have this tree inside Water draw pled. I like the reflection here. I think for logo, the best one would be more simple ones, probably number three. However, we would need to move this reflection down here. Definitely for logo, we would need to work more for images. We cannot just use it straight away from here, actually. Now I'm curious how. Would blue willow picture our cupcake with cherry on top? So let's try that one as well. Imagine as you can see here for the first and the second one, we didn't quite get the line logo of a cupcake. Probably Blue Willow chose the wrong model for us. It looks like cartoon style cupcake, but number 3.4 looks good to me still. I wouldn't say it's a line logo, just has too many colors. Maybe it can be used for a menu or a website, but not for a logo. Okay, we have one more prompt. That's the conceptual art, the meaning of life. Let's see how blue willow will work with that. Let's make it a landscape. I'll put A R and then three, column two. I'll also put the version four. This looks fabulous, even without upscaling. Let's zoom in. Yeah, all those details. I love the color choices here. Here we have a lot of details. Beautiful landscape here, the trees amazing here. We even have this, I don't know, a town or a futuristic city. Let's choose the best one. Oh, here, it's actually a tree house, that's fun. On the third one, we have a few people walking or moving towards the sun. Let's make the third one bigger. I'll scale it number three. Here we have this beautiful light and the reflection of it in the river. We have these people walking and look at those trees, look at all those details and lines. Beautiful. We have the mountains in the background. I really like this one. I'll save it for this prompt, the meaning of life. I'm impressed with blue willow. We can actually out paint this image even further and I think that would be interesting. Let's see here. I'm disappointed because as you can see it, it didn't continue with the element here, with the trees or with the grass. It just added some frames, even some text. This out painting wasn't successful. Okay, I've tried another one. I clicked twice on the out painting. It generated the second one again. Let's check this out. Maybe this one is better. Again, as you can see here, we've got some frame on the fore front as well. We've got the frame, the second one and the number three here. It actually added more details, Not too many, but it expanded the image. I don't think this looks natural here. If we go here, this is better. I will choose number three here. And let's expand it. Let's expand number three. Here we have it. 33. BlueWillow Image to Image Generation: This is where we left off from the last video. Now I want to show you how to do image to image generation with blue willow. Here, there's no extra button to do to image generation. You'll still have to start with the slash image here, where you write your prompt, you will add the link to your image. For example, if we just go to Google, let's search for cats images here, for example, let's choose some good image of a cat. This one is pretty here. I can copy image address. Let's make sure that it's working. I'll paste my address here. And it should lead me only to the, not to the website, but to the image. This is something that we can past. Our prompt here, can image address here. Now we can write a prompt. A small it, a small kitten for example. As you can see, it took the information from the image. If we open this image again, you can see a kitten here. Now I pay attention to details. Look at the fur coloring and eyes. If we go back, you can see that I tried to use the same colors. The eyes, you can see like green, bluish tones here. It actually used the information from the link to create these images. Now you ask me, how can I use my own images? This is super simple. Basically, you just need to convert image to a link. There are different ways you can do that. One of the ways you can just upload image here, then here you choose the image that you want to upload. For example, let's use the ballerina and just click Enter. It's going to upload the image to the Discord server. Now if we click on it here, if we click on the right button here, we can have copy image or copy image address. This is what we want. We want the image address. Let's copy image address. Let's try it out. Instead of this kitten. Let's paste our image here. As you can see, we've got address to this image now. Again, put the imagine, put our link to the image. We can put something that we want now, ballerina in a magical forest. Let's add the negative prompt. Because I don't want any extra limbs. No extra extra extra for aspect ratio. Basically, it can be either horizontal, vertical, or square. By default, it will be a square, as we can see with this kitten. It's not affected by the source image dimension. For example, for this one I think the portrait would be the best one. So I'll put the aspect ratio of two to three. I will use version four, version 3.4 They can generate any aspect ratio from the source image. However, versions 1.2 they will be the same format as the source image. Okay, lets zoom in. As you can see here, proportions are not too bad. The background needs some more emphasis. However, overall, it's fine. These shoes are not drawn properly. This prompt is a bit short. Let's make it a little bit longer and add all our stylize, Usually stable diffusion likes those words. I will, I'll just copy this whole thing and add extra information. So again, imagine, now we want to emphasize the magical forest. I'll put the parentheses here. I will also describe colorful trees, leaves, flowers, grass background. I will add styliz, highly detailed. I will also add Unreal engine. And then eight K, and then again, no extra limbs and same aspect ratio and the same version. Okay, here again we're getting this white background. I think the importance of the image is way higher here because we have this white background again. Let's try one more time and we'll put the white background in the negative. Prompt again. Let's try it, no white background. Let's see if this will fix anything here. Here as you can see, we're still getting this white greyish background. The effect of the image is way higher than of the prompt. In that case, Lexica was way better because we were able to generate the magical force background here. We can't Blue willow still needs to work on those features in the future, there will be ways that we can modify the prevalence of the image. Okay, let's do one more image and let's try my portrait. I will upload my image. This one. Let's upload it now. Let's copy the image address again. Imagine I'll put an oil painting of a young woman curly hair. And then I'll add a symmetrical face cubed make up and stylizi detailed realistic eight K. Let's try that. Let's no poorly drawn faced the aspect ratio. Let's have a portrait aspect ratio. I'll put 23. Let's do the version four. Here we have the image. Actually I forgot our link here. Let's do it again. Don't forget the link. Okay, the images we've got here are not too bad. All quite similar. It's just the facial features are a bit different. But given that we only gave Blue Willow one image, like one photo of myself, that's pretty good quality here. Especially the number four though they're not much resemblance with my face. But I think it's just because it's only one image. There's not much information to work with in this task. It did pretty good here. Okay, that's it for blue willow. I think we've covered pretty much everything here. We've started with the blue willow server, We've talked about all those different things in the left column here, and then we've talked about how you can make it work in the direct messages. We've talked about different commands, they can imagine info and subscribed. We've talked about different parameters. Parameters. You can find them in blue willow. They're pretty much quite simple. Negative command, aspect ratio and versions. Maybe there will be more parameters in the future and you'll fight it here. We've tried different art styles here. Try it out, and if you don't have a Discord account, it's worth creating one and trying it out. In the next module, we will cover another platform on Discord. And I think you should be familiar with that because it's mid journey. See you. 34. Midjourney Introduction: Hello everyone. Have you seen a viral photo of Pop Francis in a stylish, white puffy coat? Or maybe Trump being arrested or jive basis cleaning a hotel room. Well, all of these photos have two things common in them. One, they're fake photos and second of all these, all photos were generated by the same AA image generator that I'm going to talk about today. And it's Mid Journey. Okay, mid Journey. Mid Journey was developed by a company called Mid Journey, Inc, which is a San Francisco based independent research lab. It was launched in open beta mode on July 12, 2022, Not quite a long time ago, It only operates on Discord. Now you would need to have a Discord account to be able to use Mid Journey. Already, the company released multiple versions of its algorithm, and the latest version is 5.1 Mid Journey gained a large popularity, and it's already been used for magazine covers, including a famous magazine like The Economist. It's also been used for book illustrations, comics and much more. And now I want to tell you a little story. You probably know this painting, a famous painting of a girl with a pearl earring by Johannes Premier. It's located in a museum in Hague. What happened was the museum, they loaned this painting to a different museum for the time being, they decided to launch a competition to replace with other artworks painting they called this competition. There were about 3,500 submissions, there were only five winners. And imagine what one of the winners was. An image generated by Mid Journey, and it was sent by AI artist, and he submitted it with the title, A Girl with glowing earrings. Out of the 3,500 submissions, an image generated by Mid Journey was chosen. So now you would see this image in the Hague Museum. And as you can see, the image quality is incredible. And that's what makes it my favorite program platform for image generation. And I'm very enthusiastic to tell you all about journey and show you the tips and tricks, how to use it. Let's talk about why I like journey so much. Let's talk about prose. As you've seen already, Journey generates very high quality images that are also realistic. It's hard, or you can say, impossible to distinguish between a real photo and journey generated image. That's how realistic it is. Also, journey is great for beginners. You can use prompts and it will images. It's not going to have cropped images or things like that. If you use short prompt with stable diffusion, it's likely you'll have a head crop. The facial features would be all incorrect. That's why you need longer prompts. You need to add stylizi, highly detailed, all those smaller words, and maybe also add artists to make sure the facial features look good. But with mid journey, you don't need to do any of that. You can just put a girl and mid journey will generate amazing images of a girl. Of course, if you're looking for a specific images, then it's best to kind of elaborate what you're looking for. So in that case, the image that's generated by mid journey will be more aligned. With your own vision. But if you are thinking or if you're looking for some concept, just ideas you want to brainstorm, then you can just put short prompts and that will help in your brainstorming. Okay. It also has many parameters and settings. If you are advanced with mid journey, you can generate and get the results that you're looking for. Some things that you can do with journey is you can upscale images, you can create variations of images, and you can blend images together. You can also do image to image generation. What I think the advantage with mid journey is that you can use many images in your prompt. Also, you can generate images in a private mode if you have a pro plan, which I think is important for some AA artists. Another cool feature about mid journey is that you can use M in your prompt, just modes, and it will make images based on your modes. Okay, now let's talk about some limitations of mid journey or some things that journey may not have. Well, first of all, it requires a Discord account. But hopefully from the Blue Willow module, you've already got your Discord account and you're all set up. But for some people, getting a Discord account may be a challenge. Another thing is that recently the free trial was disabled. And they explained that because a lot of people abused the platform, they tried to find loopholes to generate many free images, and so they closed this free trial. Another thing that I find annoying with mid journey is that it can be quite challenging to generate consistent images or characters. What I mean by this, for example, stable diffusion. If you use a seed, then the images you get are quite consistent. Or you can also train your own models and your models. If you train models with your face or with the character that you want, then you would get consistent images with that character or your personal images. That's impossible with mid journey because you cannot train anything here. I think that's a big limitation with mid journey. Hopefully in the future, they can add this feature so you can train with your images. Of course, you can try to create consistent images and characters with mid journey, but it just takes a lot of time effort. And also you need to know all those tips and tricks, how to do that. We'll talk about that as well. There is another app called, Let me Show You Inside Face. I also want to cover this in our module, because here you can actually put your face to journey generated images. Again, you would need to use another app to be able to get consistent images of yourself, for example. Another limitation is that there is no image editing, so you cannot do in painting or out painting, which I think is a pity because sometimes when you have a nice image, maybe there are certain things like maybe hands that needs improvement. And it will be nice and easy to do within the same program. Just like in painting, however, mid journey doesn't have that. That's for mid journey. In the next videos, we will cover mid journey. Go over how to start using it and all the cool stuff with it. See you soon. 35. Midjourney Overview, Setup, and Basic Commands: In this video, we will start exploring Journey. And I will show you how to get started with Journey, how to set it up as well as I will show you basic commands that you can do with journey. So let's get started. First of all, you will need to sign up or sign in to your Discord account. So hopefully by now you already have a Discord account, so all you need to do is sign in, then go to the official Mid Journey account, Journey.com Here you'll see a button called Join the Beta. This will redirect you to Discord. Here you'll need to click this, accept invite, click this, continue to Discord. Since you're already signed into your Discord account, mid journey server will get added to your Discord account automatically. This is how it will look like. This is their logo here, you will see this is the channel of course. On the left hand panel, we have a lot of different things going on. First it's announcements here, you can check out the announcements. Then we have recent changes, for example, like the changes in prices or changes in the algorithm or maybe a new version that's coming up. It's good to be up to date with. Then status rules for example. Here are some high level guidelines and so on. Terms of service, if you want to read on that a little bit more and so on. Getting Started Guide, but this is what I will show you now. Don't worry about that too much. Okay, here's a lot of information, some support, and so on. But what you're mainly interested is this newcomer rooms. You can click on any of them. For example, new B. Here you can see different images that were generated by others. You can try to generate your own image by going to the window and writing Imagine. But probably because they ended the free trial, you won't be able to do that if you click Imagine. First of all, you probably will need to accept their terms and conditions. That's first, and then they will ask you to subscribe. Let's do that. Let's first Subscribe, and then you can Subscribe Subscribe button here and then click Enter. Okay. Here, Mid Journey will generate a personal link. We can open this page. Yes, it will open a page. Here in this window, you can see your plan if you have any. For example, I have this basic plan. I can see some of the details. For example, how many hours are included, how I've already used up information about the billing and payment. If you probably will see this as the first thing, because you don't have a plan yet, You can choose between three different plans, Basic plan, Standard Plan, and Pro Plan. For Basic Plan, you have limited generations. That's around 200 per month. Again, here for example, the Standard Plan has the generations in hours. The Basic Plan for example, the Standard Plan has 15 hours. The Basic Plans, 200 generations per month is about 3.5 hours. Just to give you the rough idea here in the basic plan, we also have general commercial terms, access to member gallery, optional credit, top ups, and three concurrent fast jobs. Okay, how is that different to standard plan? So here we have way more fast generations. We have unlimited relax generations. The difference between fast and relax generations is that the relax generations, it takes way longer to generate the image. Okay, so here it's unlimited here. Then we have the same general terms, commercial terms, and so on. In the pro plan, we have even more fast generations, 30 hours. Again, we have the unlimited relaxed generations. We have this stealth image generation. You can generate images in a private mode, which I think can be important to some people. Then we also have the 12 concurrent fast jobs. It will generate 12 images at the same time. That's basically what this means, okay? So after you choose your plan, you just click to buy the plan, for example, here I can upgrade my plan and so on. Then you just fill the payment information and pay. Okay, once you have this plan, let's go back to our Discord account. Once you have your plan, you will be able to generate images in this new B group. But what I would recommend you is, again, the same thing that we did with Blue willow is to add the journey bought to your direct messages. How to do that? Again, we just need to click on this mid journey pot. Here we can write a message to this journey bot. For example, High. This will bring me to direct messages with this mid journey bot. Since I've already been using mid journey with my direct messages, I have a lot of images here. Another way you can add mid journey bot to your messages. Direct messages is okay. Let's go back to this journey server. Here you can choose any image. All you need to do is to write. Click on it, you can add Reaction. Click a reaction here. There's a lot of different reactions you can add. But what we're interested in is this envelope mog. How you can find it, you can search for envelope here, you have all those different envelope mog. What you're interested in is this basic envelope. Just click on this. When you add the envelope mog here, add this image directly to your direct messages. As you can see, I went to my direct messages. Here it is. The image that I reacted with this envelope is now in my direct messages. In this same scenario. For example, if you like any images that were generated by others in the same way, you can save them by basically clicking this envelope, MOG. Okay, now let's try to generate something. For example, let's do slash. We can either choose slash, imagine, or we can write it ourselves. Let's write it, Imagine. Now here we need to write our prompt for jury. It can be very simple. I will start with a basic prompt. Like a girl, a girl. And let's click Enter. Now we have absolutely beautiful images of different girls. I think these few are quite dark in terms of the background they have and so on. But if you try writing the same prompt every time, you'll have different, different backgrounds, different facial expressions and so on. So in this case, journey is amazing. As you can see, none of the images have any problems with facial features. For example, or the head is cropped no compared to stable diffusion where if you have a very short prompt, it's likely to give you some problems with eyes, with other or maybe hands also, it will maybe give you the subject off center. So here you can see the girl, like on all of these images is in the center. This is how the portrait should be, but in stable diffusion, as you remember, we were getting a lot of images that were off center were cropped, maybe head was cropped and so on. Here in my journey, it doesn't have flaws with basic prompts which is incredible. Okay, so now that you have this imagined command, I want to show you a few other commands. So here's a list of basic commands. I've already showed you the subscribe command and the imagined command. Now let's check out a few others. Let's do info slash info, for example. Here you will get information about your account. You can see here I have this basic subscription here. The information about how I'm generating the images, I'm using the fast mode. It can be either fast or relaxed in some plans. Relaxed in the standard and the pro plan, the relaxed mode is unlimited. Okay? The visibility mode is public only. The pro plan allows you to do private image generations and then I have this fast time remaining lifetime usage and so on. So it's a basic command to find out about your subscription, what kind of mode you're using, and so on. And maybe when it expires and renews. All right, and this is for the slash info command. Then I want to show you the slash settings, the settings command. This is the one that allows you subsetting. This command actually allows you to change the mode or visibility mode. If you have either the standard plan or the pro plan, instead of this fast mode, you can switch it to the relaxed mode. If you just click on it, you'll be able to switch. If you have this appropriate plan for me, I'm on the basic plan. It gives me this message. Your current membership plan doesn't include relaxed mode, so I would need to upgrade my plan to be able to use this. Similarly, if you have the pro plan, you will be able to switch from public mode, private mode again. You'll be able just to click on it and it will switch again. I cannot do that because I don't have a pro plan in the settings. There are other things that you can change, we will talk about that a bit later. Here you can change the version. Currently I'm using the 5.1 version, which is the latest one. Then there's also different styles. The G is the anime model and so on. The Stylize, we will talk about that as well. Now I want to show you how you can use the help command. If we go back here and write help here, you'll have all the information to get you started with mid journey. It has great resources. For example, the first link is to Mid Journey Docs. If you open it here, and let's click on this quick start guide, here is the information how to set up your account. Similarly, here, you will find all the information about the parameters, settings, and basically everything that you need to know about mid journey. Now I want to go about to subscriptions. Subscription plans, if you are thinking which plan to buy, then you can go to the subscription plans. Here it shows you in more details what are the differences. For example, how much extra is the GPU time. For example, all of these plans, it's $4 per hour and so on. For example here, stealth mode, you can read more about the stealth mode. It's journey is an open by default community and all image generations are visible at Journey.com Including images created in private, discord, servers, direct messages, and on the Journey web app. Right now, the images that I'm generating are all public. They can be accessed by other people. If you don't want your images to be public that you need to buy pro plan and use this private mode. If we go back here here you'll find more useful information. For example, the Mid Journey app. Here you'll see the gallery, the community gallery, and images that were generated by others, for example, let's try that. This is all the images that I've generated. If you go to explore, here are the images generated by community. You can find some inspirations here. Basic commands, we've already talked about that. Imagine info and subscribe the direct messages. How you can add the journey bought to direct messages. We've talked about this envelop emoji, but you can also react with other mom. If you react with this cross, then it will cancel or delete a generation basically. For example, here I have a girl, let's say I want to remove it for some reason. I can go and right click here again. I choose Add Reaction. Okay, let's try Cross here. I don't see the Red Cross Mogi, I'll just try the X. Okay, here we go. This is the X emoji. If I click on this generation got deleted again. You can use these ones if you like. The image, you can react with the star. That's it for this video, I've showed you different commands. The subscribe command allows you to check out plant subscribe and manage your plan. Imagine we'll generate images Info. Allows you to check out your plan information. Settings allows you to configure your settings including to change the model version, Stylized Value. Here we've talked about how to change between the public to private mode and from fast to relaxed mode. Of course, if you have any more questions or need extra resources, you can enter the help to get extra information. 36. Midjourney Text to Image Generation: In this video, we will continue exploring journey. Here I want to start with basic image generation and show you some features, parameters, and also more basic commands. Okay, first of all, again, let's write imagine slash imagine. For example, here I decided to use a very simple prompt and its universe in a bottle. Here we can see four different creative decisions for universe in a bottle. They pretty much look quite similar here. Here as you can see, you have these four images, You have the upscaling, you can upscale any image. This is again similar to the blue willow here. The first image is assigned with the one, this is the second image, third and fourth here. You can upscale them in the bottom row, you can generate versions, You can generate a different version. For example, here, none of the images are quite what I want. What I'll do, I will redo this prompt again. I will click on this pattern. This allows me to change prompt if I want to. I can put a new universe in a perfume bottle. This looks more interesting, I would say number two. The perfume bottle is strange here. And the same with the number four. I'm not sure what this item is, but number 2.3 looks good to me, and I want to upscale the number three. So you can upscale number three, for example. Okay, here we go. The upscaling in this version is actually very fast. And that's because the images that were generated here with the version 5.1 they are already fully rendered. They just need to be separated. This upscaling command basically separates this image from the other ones. However, for other models, upscaling is a little bit different, and for that you need to go to the mid journey dogs. Let's go and explore that because I think this is important because models will change. But just the scale of using mid journey dogs will still be relevant if we go to do journey.com Here's the getting Started here, user guide. Here we have the scalars commands, parameters. We will talk about the commands and parameters later, but for now we're interested in the up scalars. Let's click on this here. As you can see, these are different models for version five. There images that you're getting are already the full size images. Thousand 24 by thousand 24. Version four, for example, the grid images are half of that, only 500, 1,212 The upscaling actually makes the images bigger and it can add some details. Okay, now that you know that we can go back and try something else, let's use, imagine, let's put a portrait of a ring called Tame Farmer. Let's put elderly, okay, hopefully that's correct. A portrait of wrinkled elderly Vietnamese farmer. And let's click Enter. Here we have these four slightly different photorealistic images of Vietnamese farmer and we see this typical Vietnamese rice field hat on all of them. If you don't want specific items in your prompt, then you can use the negative prompt for that. Let me show you. Let's say you don't want the hat, all you need to do is to click on this Remix button. You don't want to see, for example, this hat. All you need to do is to put no and write your negative prompt. For example you don't want to see at. Let's put hat here. Let's write it again. Here you can see that none of the images have a hat in them. And that's because we've added this no hat, which is a negative prompt. If you don't want to see some element in the image, then you should use this negative prompt. Let's say if you liked any of the images, but you want slight adjustment or little change. For example, I think number two looks very interesting. But maybe I would look at other versions, other variations. I can click on V two and that will give me different variations of this image. Let's try that. You can add some more information if you want. For example, smiling a portrait of wrinkled elderly farmer. You can add smiling for example, here as you can see, we've got slight variations. For example, on the first one the farmer wears a hat. And on the fourth one as well, I think the best one is the number two just looks more natural to me. I would say that the number two is very similar to this image, apart from the person is smiling on the other one. Again, because we made the variation of this number two, this clothing also the facial features were kept very similar in the variations that were created. You can see the clothing style, the hair style is pretty much the same. Now I would like to talk about aspect ratio here. By default we're getting this square. But what if you want a portrait or landscape? Let's say for example here I want to. Let's do variations of number two. I don't want a square, I want a portrait. What I can do is I'll put the R for portrait. It's going to be nine, column 16, or you can use some other aspect ratio. If we go back to the, it's going to be in parameters. Here we have this parameter list. Here are all the parameters that our support we've already talked about this negative prompting. No now the aspect ratio. Here you have this brief explanation. If you want to learn more about the parameter, you just need to click on it. Here you'll have a full explanation of this parameter. Again, we have different support with different versions. For example, for version five you have ratio. You can use pretty much anything that you want. For version four, you have a limitation of one by two to two by one. Again, the G five has a new ratio. Here are some examples. For example, this is four to 54 to 774. I also like the nine to 16 or 16 to nine. This is a common video format or wallpaper format. Okay, let's use the nine to 16 because I think that will look nice here. Nine to 16. And let's submit that. As you can see here, we've got distorted result. That's because in this first image it's been a square. And when we tried to use the variation, it relied on this original size here, it just squeezed it in order to not have the problem. You cannot use the variation. You'll have to write the prompt again. Let's try to write a prompt again. Again, we will to imagine, I'll copy this whole prompt In this way we should not have any problems. You can see that here, even though the aspect ratio is the same as in the previous one here, this image is distorted. But this one looks normal. For some reason, not our negative problem didn't work. Maybe we should add more weight on it. And we will talk about how you can add extra weight for your negative prompt in the future video. For now, let's just try a few more generations again. For example, here the number two is quite nice. I'm going to upscale it upscale, number two. For upscaled version, here are a few things that you can use. You can make variations of this image. Again, it should be the same aspect ratio because as we've seen here, any different aspect Ct will result in distortions here. Again, you can make variations or you can check out this image in the Journey app. If I just click here, it will open in mid Journey app. And here is my prompt. Okay, also you can put it in your favorite if you want. If you mark it with favorite. If you go to Journey app here, go to home. Here we have hot new top and favorited. The images that you've marked with favorite will be listed in the favorited and here as you can see, the image that we've liked. Okay, so just to summarize, in this video, we've talked about simple image generation. For now, we've used the simple prompt and we've covered basic parameters. We've covered the negative prompt that you can use the no if you don't want to see certain elements. Although as you can see in the last example that didn't quite work here, we'll talk about that a little bit later. How you can add extra weight to the negative prompt. We've also talked about aspect ratio. That you can change the aspect ratio of the image. For example, you can use the landscape or the portrait mode. Or if you don't add any aspect ratio, then it'll be square by default. We've also talked about different functions, for example, the upscaling functions and how you can make variations and about the remix pattern that you can generate the prompt again and maybe add some more details if you want. 37. Midjourney Image to Image Generation: In this video, we're continuing exploring journey functions. And here I want to show you how you can do image to image generations. So here, similar to the blue willow. First of all, in order to add images to your prompt, you need to get an image address, for example. Okay, let's try something. For example, imagine here you can paste the image link. If I go to let's say Google, and here I found this image, which I think is quite nice. What I'll do is right click here, You can copy image address. Make sure you don't copy link address because the link address is likely to be to the article or somewhere where this image is part of, but you actually want the image address. Let's try it. Make sure to try the image before using it. Okay, let's write this link. As you can see, this is the link to the image. Now we can use it in discord. Again, we have this, imagine now we can baste the image here. Let's write a few words or something that we want to be in the prompt, for example. So this is an image of a wolf. I'll put wolf, wolf. And I want a special effect. I will add phantasml iridescent of wolf. And let's try this here. As you can see, it actually show displayed the image from the link. Now it's generating the images here, we've got the school iridescent effect. All of these images, they resemble the photo that we've provided. All of them have a similar look to our original image. If I go back our original image, you can see this is zoomed in image of a wolf in the images that were generated. We have very similar wolf here as well. That's how you can use images in your prop. Okay, now I will show you how you can upload your own images. For example, let's upload a file. Let's try using the same plena that we've tried with other platforms. Here's the image of the ballerina. Now I just need to click Enter. Here we have it. Now let's write a prompt again. Again, you start with here, you just need to click on it first will expand the image, then right click and copy image address. Let's paste it here. Now we can describe, for example, a beautiful ballerina dancing in a magical forest. Then let's add some details, Trees and Leif again, we've got this image that we've attached in our product. Let's zoom in and see these images. I can see some problems with proportions. For example, here, I'm not sure if the leg is here, and I can see her feet in a wrong place. Here we have some problem with the arm. This one is pretty good. The second one is also not bad. Just the rotation of the arm. Actually, this is not good because look how long this arm is, and this hand is unnaturally rotated from all of them. The best one is in terms of the proportions of ballerina is the number four. And let's upscale it to check it out even further. Upscale, number four. Let's see. Yeah. As you can see, the proportions are overall correct. We have this beautiful ballerina and we actually can see. Forest in the background. As you remember, it was pretty hard to do in Lexica. You had to use different negative prompt and so on. Here, we didn't use any negative prompt, we actually did it for the first time and we got pretty good results. Then we also tried to do it in other platforms where it wasn't successful at all. And here from the first time it was a success. Now I want to show you a little t, let's say you're happy with this image, but you want to add more details or something. You can actually copy this image address and include that in your prompt. Another thing is when you write, imagine your prompt here, you're limited to only one link. You can include more than one link. For example, you can include 34 and as many as you want, basically. Let's say we've copied the image address of this. Let's paste it here. Let's say you have a different vision for the background. You can go to Google, for example. Here I have a wolf. But now you want to search for this magical forest, magical forest background. Look for the images that resonate with you. For example, this one is beautiful, maybe this one. Now we can copy this image address. Let's copy image address of this. Let's add that to our ballerina image. Address the link. Okay, now let's also add our prompt again. A beautiful ballerina dancing in the magical forest. Let's put some colors, so like her pal. Let's try that. Okay, this is something you will see if there are problems with your link. As you can see, I haven't checked the link before I've pasted it. It's always go to check the image address. Before you paste it, let's go back and copy image address. Let's try to paste it. Okay, here we have this image. Okay, we're getting this arrow. And it's possible because of the extension of that image, let's try to find something else. Or alternatively, we can save that image. If we go back here, we can save image, save image, save it on our, in our download file. Now we can upload it. Upload this image and copy image address. Hopefully that works better. As you can see, indeed it was a problem of this image when we saved it and reloaded and use this image address. Now it works. Sometimes you will have to go through this process. As you can see here, I used these two images. It used the image of our ballerina and of this magical forest. And combine it in these images. Here I find that we're not getting those magical forest results. But of course, you can work on it and try to generate more images. And when we go to the advanced parameters, we'll talk about how you can influence the image that you're generating. Now you know that you can use many different images and add it to your prompt to guide AI in. For example, what subject or what style you want to make the images. It also works if you have a specific character, for example. Now I'm not only limited to one image, I can applaud more photos of myself, for example. Let's do that. When I upload a file, here are three photos of myself with totally different background, totally different make up. This one is without make up very different images. Once we've uploaded them, I can write my prompt. Imagine I will now start adding the image addresses. I will zoom and corporate image address. First one, we have these three images, now I will just describe myself and what I want in the background. For example, a girl with curly hair, green eyes, a modern jacket in a park, for example. Let's try that here. In, none of these images captured my facial features. The hair looks quite similar. Sometimes this would work, other times you would just need to add way more images. Also here, as you can see, the images are all very different. If we actually app the very similar images, then it would capture the face a little bit better. But again, it wouldn't be 100% for that. In the next module, we will talk about how you can swap your face with the images that you get here. For now, you can play around and see how close you can get to your own phase. Let's say for example, you can try uploading more images, maybe six or even seven. Yeah, just play around with it. To summarize this video here, I've showed you how you can do image to image generation with mid journey. And basically you would need image address and you can just paste it to your prompt. You can upload more than one image address and it will try to combine different things together. Also, make sure that your prompt describes the images you're getting, something that you want. 38. Midjourney Basic Commands - Blend: In the last video, we've talked about image to image generations. Here, I would like to continue this topic and show you a different feature last time in order, for example, to create this plena in a forest. What we did, we added two images. We added a link to ballerina image, and we also added a link to the magical forest. Now I'd like to talk about a different feature. It's called blend. What it does, it basically blends two or more images together. Here we can write slash and then blend. Here we have it. All we need to do is to upload two images here, or you can drag and drop your images from your desktop directly. I'll upload them. Let's do the same ballerina and the same forest that you know. What's the difference between the regular image to image generation versus the blend function? Let's use this Bellona and the same forest and see how that compares to the regular image to image generation. For now, let's keep it simple. Just upload the two images and click Enter. Here are some images that we've got. Basically, the blending function will just blend two images together, combine certain features or lights, clothing items, and so on. Here we have this nice purplish and bluish background with trees and ballerina in the front. Okay, let's compare that to what we did here. Here we actually wrote a prop. A beautiful ballerina dancing in a magical forest. Trees, leaves, purple, pink, magical atmosphere. In the blend function, you cannot add prompt. Basically, you just rely on the images that you've uploaded. And let me journey, figure out how it combines them together. Here we've got a little different images. What's the difference between this adding the image addresses in the prompt versus blending them? One of them is that you cannot add prompt to your blend function, which is quite important. For example, sometimes you want to combine the image with the prompt. Want to emphasize the prompt, it also have image four reference. In that case, you would use this prompt image generation. If you only one blend of different images, then you should use the blend function. For blend function, there's actually a little bit of information on mid journey dogs. If we go to mid journey here it's a command command list. Then here we have blend. Here is some information. We can upload up to five images here and we can write our dimensions. We can add the dimensions as well. That's basically all the information here. Let's go back here again. Let's try a few more examples for blend function. You can combine images of different styles so you create a new style. Or you can combine a subject and a background, like we did here, for example. There's a lot of things that you can combine together and create really cool images. Let's try something here, for example. Well, I have this image of a beautiful lady that we generated from our writing prompt module. Let's try that. I have this image of a line with interesting effect. We can also use it. Let's try that. Let me show you. You could upload image three. Image four. If I click image three, I can now upload here if you change your mind, you can delete it. You can add the dimensions. For example, here we have portrait square or landscape. Let's keep it square, Let's try that. It's always fun to see how images blended together. Again, to remind you, we have this lion with cool effect. This effect now we have on the girl, which is pretty cool. Here is one of the examples that we can do with blend function. Let's try something else. Let's again use the. I want to use the same photo, but now I want to combine it with an image of a robot. I will applaud this image first and the robot second. Again, I found this on Internet for dimension. Let's use the landscape. Landscape. And let's go again, we're getting these amazing results. Here we have a highly attractive woman here. It tried to add some details maybe from the roboto. The costume looks very futuristic. Just the whole, everything looks very futuristic here. I particularly like the number four. I think the robotic thread in her hair looks amazing. I will actually upscale it. Let's upscale number four. This is just amazing. For example, if you want to make variations of this image, you can click to make variations. And let's see what it will produce. It should be highly similar to this image here. Here we've got some variations. For example, here the probot threats are beside her ear. Here, here we actually, it goes into her head. Not a fun of this one. I think this is random. Again, it's a little bit not structured here overall. The best one in my opinion, is this one. I love how it repeats this hair curve. And just overall composition is astonishing. And this is a very high quality image that you can use for presentations. For example, if you talk about some futuristic stuff, maybe you can put that in one of your presentations. Blend is a great tool to create the images. Let's try one more time. Let's do blend. Again, here I've prepared image of Emma Watson, also a cartoon image. Here I want to show you how you can combine a style with the portrait. Let's use a dimensions, let's use a portrait or square. Let's use a square because I think this is nice and square. It's always surprising how AI would combine the images together. Because for example, here we can see on the first one that it looks more realistic and maybe the eyes were taken from the cartoon. It's always unpredictable. What would you see in the final results? Just worth trying and playing around with. For example, here we didn't quite get cartoonish Watson, but instead we realistic half cartoon style of things. Let's try to generated one more time and see if anything will change or it will create similar style images. Here we've got quite similar style. If we look here, there is not much difference. The facial features will change, but overall, it's very similar style in order to emphasize certain image more. For example, if I want to emphasize the cartoon image here, we cannot write a prompt. We can just add images. In that case, my tip here to add more images of a cartoon. For example, if you want to emphasize cartoon, then add more images of cartoons. For example, here Bland. Let's again add Emma Watson here. Let's add this cartoon image here. I will add one more image. Let's add image number three, for example, this one. Let's try this again. Now here we've got a cartoon style images. If we compare this to the previous images where this is maybe a realistic half cartoonish, Somewhere in between, because we've included one more cartoon image. The cartoon images, We're driving this process now. We are getting the cartoon images. You can experiment with this plant function, you can add more images and just try it around and see how adding more affects the generated images. This is very cool because you don't know what you're going to get. It's always a mystery. For example, for prompts you have the prom to guide your image, so you kind of have an idea where it's going to end. But with blend function, you don't know how it's going to combine those things together. And sometimes it has very creative, unique solutions that can spark your imagination. Play around, experiment. Try different styles. Try combining different totally different images and see what are you're going to get. 39. Midjourney Basic Commands - Describe: In this video, I want to show you a new command. It's describe. Basically it generates prompts based on your image. Let's try that. All you need to do is put describe, you will need to upload your image for which you want to generate the prompts. For example, here I've selected a few famous arts. The first one is a painting by Ali. Let's try. This is a surrealism. Let's see if Journey knows that. Okay, here we have four different prompts here. Surrealistic, grotesque, I think all of the first prompt tells me that it's surrealism. This is very interesting. Number three is the journey that started with the death of a woman for the loss of her son. In the style of visionary surrealism, distinctive noses, surrealist influences. Okay, after we've got these four prompts, you can get the keywords, for example, surrealistic, grotesque, or illusionistic detail, and add it to your prompt. Or you can basically just copy this whole prompt, the one that you choose, and paste it in this window. Let's choose something interesting. Okay, let's try the first one, because I think the first one is the most accurate description. Let's see what journey can generate with this prompt and how close will it be to our original image. Let's copy that. Imagine, and let's paste our prompt. As you can see here, we've got some surrealistic elements here. Overall, it does feel a little bit like deli in terms of the colors and the overall composition. I think here the prompt was to the point, my only concern is that it didn't identify, that this is time. Maybe it should have added time or clock in the prompt. Okay, let's try a different image again, Let's put the describe now I have the Mona Lisa by Leonardo da Vinci. Let's see how Jeri will handle that. Okay, here Jeri spotted on that, this is Mona Lisa by Leonardo da Vinci. In all of the problems, we have the Mona Lisa by Leonardo da Vinci. Then here we have style of oil painting, the style of Leonardo da Vinci. Oil and smooth brush work. Monalisa in Italian painting, in the style of women artists. Here it added more details to it. Other artists as well. Style of Leonardo Da Vinci, Mans Marcel Ucome, classical academic painting. You can use some keywords. For example, if you want to recreate some images that look like from Monalisa, you can look at those prompts and just find the keywords that they're using. For example, the classical academic painting, oil painting and style of Leonardo Da Vinci and so on. Let's actually check out how mid journey would portray Mona Lisa. Let's choose maybe the second one, the longer prompt here. As you can see, it even gives me the aspect ratio. Let's copy that slash, imagine paste the prompt Here we've got images that are in this classical painting style, which is very close to the original style. You can use this prompt. For example, you can replace Mona Lisa with a celebrity name. And use the same prompt to generate a subject with this style. Let's try that. For example, I'll copy this prompt and I'll put imagine. Then I'll write a celebrity name, for example, Zenda. And then I'll copy this whole prompt and paste it here. Okay, I didn't change anything, I just added Zendaya. Let's see if we will see any resemblance with Zenda in the next images. In these images, we can see that the composition, lighting, and the overall classical painting style of the image is very similar to the original Mona Lisa painting. However, because we've added Zenda, it tried to combine the, her facial features with Monalisa. I think here we've got some merge, it doesn't look like either Monalisa or Zendaya. That's what we've got here. You can try to experiment for that as well. Again, let's try another image. Let's again describe as a final image. I chose the art installation by David Tuna. It's the banana with the duck tape. So let's see what journey will come up with. Okay, here we have the first one, a banana with gray tape around it. Very accurate. Okay, and in the style of Jamie, not sure who this is, symbolic object. So I think the first prompt is the simplest and most descriptive one. Okay, let's see. Other ones that we have huge and rose taped banana on a white surface in the style of Mcdonald Punk and so on. Okay, a painted banana with some tape on it in the style. And then a plaster banana with a banana tape to it. In the style of kinetic mixed media, dark gray, conceptual installations. Okay, let's try the fourth. And it has it with the grocery art. Let's very curious what it's going to be generated from the prompt. Again, imagine I'll just base the prompt. Let's check this out. As you can see in most of the images we'll, except I think the second one we've got the realistic image of a banana. It's an art installation, so here we have the wall. It does look like an exhibit. Yeah. Here are some ideas for your next art exhibition. Okay. To sum up, in this video we've talked about described command that allows you to generate prompt from your image. If you are wondering what style your image is, then you may want to use this described command to help you out with the style and also with the keywords. Play around for that and see you in the next module. 40. Midjourney Prompt Writing - Keyword: Hello, hello. In this module, we will continue exploring mid journey. I will show you how to write prompts in mid journey, as well as show you more advanced parameters and commands. Okay, let's get started. The first thing is the prompt writing. We've actually dedicated the whole module to prompt writing with stable diffusion. And we've talked about the organization as well as different parameters like seed and steps. With mid journey, all the parameters will be completely different. It's best to separate stable diffusion with mid journey so that you don't get confused in terms of structure and organization. It can be more or less the same. You can start with medium, then talk about subject action details, background include lots of stylizers, then artists. In terms of the organization, you can keep the same structure even though you can organize prompt any way you want. However, in my opinion, it's nice to have the structure. It's easier for you to write prompts. Let's now talk about keyword weight in staple diffusion. In order to emphasize a certain keyword, you used parentheses. To de emphasize a keyword, you used brackets. Here, it's a little bit different concept. Mid journey uses double column to separate concepts and assign weight to them. For example, we have a hot, do we have the double column dog? What do you think is the difference here? Let's go to mid journey dogs to check if your answer was correct. For hot dog you would have a hot dog, but for double column dog, you would get a dog with some elements of heat, maybe sweat or just the colors are more bright, bright orange. As you can see, the difference is huge because in the first scenario, mid journey treats hot dog as one concept. But the hot double column dog, it's treated as two separate things. This is how you can use double column, two separate concepts. For example, same with ice cream and ice double column cream, because ice cream is a dessert. But double column cream are two different concepts. It's ice and separately a cream. We can try that actually out. Let's use our meat journey to try those two concepts. In the first one, I'll write ice cream. In the second prompt, I will write double column cream. We'll see what are the differences. As you can see when I write ice cream, we are getting different ice creams here. However, on the double column cream, we're getting quite different images. Here I can see maybe two images of the ice cream, but the other ones here we have a woman with dripping frozen cream. And in the first one, it's also a frozen river. We have these miniature balls and dripping cream of some sort. You can see just by adding double column how big of a difference it makes. Okay, If we go back to our presentation here now, we've talked that this double column is used to separate contents, but we haven't talked about the weights. What is it about the weights? If you just use the double column, then you don't assign any weight. It's just going to be a default one, the double column has a weight of one and the dog also has a weight of one. However, you can also increase the weight. For example, here we have cat double column six. The keyword cat has the weight of six. The roof has the weight of one. You can put it in the opposite proof has weight of six, cat has the weight of one. When we talk about weights or emphasis, because cat has way higher weight, that we would see more of the cat in the image. By using higher weight, you indicate that you want this subject predominantly in the image. For example, you should see more of the cat here and a little bit of the roof. In this prompt, you would see a lot of the roof and maybe a small cat. By assigning weights, you can create composition. Okay, let's try that. Let's try these two prompts and see what's the difference. The first prompt is a double column six on a roof Here I'll put one, let's try this one. And the other one is a cat here, we'll put just one on a roof here, we'll put six. When we have a cat double column six or a roof with the weight of one. Here we have a close up of the cat, so we can see that the cat is the main subject of the image. However, if we have a cat with the weight of one or a roof with the weight of six, now the cat is really small. You can see that it just very small portion of this whole composition. And the roof is being the primary object in this image. That's how weights will affect the images. Also, I want to mention that weight, for example here six. The weight of six is applied to everything that's after the separation. Here we have a cat with a double column. One double column is a separator. Everything after the separator has the weight of six. On a roof has the weight of six. The keyword weight is also normalized. For example, double column dog is the same as double column one dog just because if you don't put any weight then it automatically is one. It's also the same as hot double column 20, double column 20 because it doesn't use the specific number, but it uses the ratio between the keywords. Here we have 20.20 the ratio is one. That's why all of these are the same. Keep that in mind when you use the keyword weight. Okay, lastly to de emphasize the keyword, you can use negative number. For example, if you don't want certain things in your image, then you can put double column and then negative in the number. For example, negative two. You don't want to have deformed elements in your prompt, you put deformed double column negative two. That's emphasis on the negative prompt here when we talk about the negative prompt using the parameter no, it's the same as if you used the double column negative 0.5 If you want to de emphasize a specific keyword even more than the no allows, then you can put a higher number and that will de emphasize it even further. Okay, let's strike an example. Let's write a prompt for a portrait, for example. Imagine, let's start with the medium. It's going to be a portrait photography. Then we have our subject. It's a girl with freckles. Then we have the action, she's holding a fox. Then we have a background in a beautiful forest. Now we can add the styliz, such as high quality, award winning photography, aperture, lighting, and so on. Let's put some of them here. Here put high quality, artistic, award winning photography, aperture of 1.8 and natural light, here I put high quality. You don't have to put high quality because mid journey already produces high quality images. But why not? Lighting is important and angles are important. These are the elements that specify how you want your image. Okay, now let's put things that you don't want to see in the image. For example, I don't want black and white. I don't want to have it deformed, and I don't want any watermarks. Now, let's put the weights because right now everything is together. The portrait photography and our subject should have the highest weight. Let's put portrait photography with double column and let's put a weight off to then a girl with freckles. Let's put that maybe a four, double column four, holding a fox in a beautiful forest that also should have quite high weight. Maybe I'll put a three here. Then we have our stylizers and negative prompt. We have to separate the stylizers with the negative prompt Even if you don't want to assign any weight to your stylizersn, need to put the double column. Let's put the double column here, then space. Now our negative prompt is separated. Then for negative prompt, we need to assign the weight. Again, double column. I'll put minus three because I don't want to see these elements. Now we have a prompt orchard photography with the weight of two. A girl with freckles with a weight of four. Holding a fox in a beautiful forest has weight three. Then we have our stylizersn. We have this double column that separates the prompt with the negative prompt. And for the negative prompt, we have black and white deformed water colors with double column and the weight of negative three. Okay, let's try this out. Let's see, we've got some gorgeous images here. As you can see in all of the images we have our girl, I'm not sure if she has freckles here. Maybe we should put more emphasis on the freckles because maybe slight ones but not apparent ones. Then we have a fox here and everything that we put in the prompt. 41. Midjourney Prompt Writing - Option Set: In mid journey. Actually, you can save a part or all of your prompt if you would. It's mainly useful if you generate images with the same style, Such is portrait photography, let's say if we want to generate another subject in the portrait photography medium, Let's copy this whole prompt. Let's paste it here. We would change a girl with freckles. Who? Something else? I don't know. Let's put a boy with freckles, for example. Let's remove the holding folks in a beautiful forest, let's say. Or in a gym, let's put it in a gym. As you can see, you've changed the subject, the action, and the background, but everything else stays the same. All the stylizersre pretty much the same because these stylizers apply to the medium, to the portrait photography also. The negative prompt also works here as well. In order to save time, you can actually save these things in a command. Instead of adding all these words, you can add something as simple as photography. You would get all these styles and negative from within this command. Okay, let me show you how you can set it up. There is a feature called Prefer option set that allows you to set options. This is the command name that you want to give to your specifications. For example, here we can put photography. The value will be all the words that you want to be part of it. Let's try it out. Okay then and here we have prefer option set option will be the name you want some short name and maybe descriptive. These stylizatrait photography you can put like photography for example, photography. Okay. The option is here and then we click on plus one more. We choose value for the value. We choose everything that will be repeated when we use the portrait photography, which are the stylizers, the negative prompt, here we have it. And you can also add parameters, for example, if you want, every time you generate portrait photography, if you want a specific aspect ratio, you can also add the aspect ratio, for example, four to five. After you're satisfied with the value, you click Enter. Now this option was saved. Now you can use the command Photography that will add all of these words automatically to your prompt. Let's say again, we want the portrait photography. Let's copy that. Portrait photography, a boy with freckles in a gym. Now, instead of adding all these Sty lasers, we can add the command photography. Again, you'll need to use then the name that you gave. In my case, I give it photography, so I'll use the photography. Okay, let's see. As you can see, it automatically changed the photography command to the words that we've used. Here we've got the images and as you can see, all the stylizers, negative prompts were added. Similarly, you can save other stylizers and parameters for example. Another example is if you want a character then you can use the stylizers. Digital Art three D rendering real engine. And then for negative prompts you can put deformed or simple. Then you can use the character command to add all of these words to the end of your prompt. For example, here if I put fantasy L and then character, it will add all these words to the end here. In this way, you can create different stylizersfferent negative prompts for specific mediums or specific compositions. For example, when you set up options, you can check them out by using preferred option list. This will show you all the options that you have. If you have a lot of options, this is quite handy if we go back and we put Preferred Option List. Now we need to click Enter. Here, we'll have. Options that we set. For example, here we have only one. Because we've set only one option, we have this photography, here are all the words that are part of it. In a similar way, you can add more options. If you want to delete option, all you need to do is again use the preferred option set here. Instead of setting a new option, you can choose the existing options. For example, you can choose the photography here. You don't choose a value, but you just click Enter. As you can see now it says that our custom option photography was removed. Another thing that I wanted to talk about here is an alternative prom structure. Instead of using the medium subject, action, details, background and stylizers, artists. Here we can change our prom structure a little bit. It's more convenient to save everything including the medium right now. For example here we have only stylizers can also include the portrait photography. We can re arrange our prompt a little bit here. Let's change it up. I'll just copy this whole prompt here. I'll paste it in this window. Let's rearrange it. Let's put our subject first. Move the portrait photography. We have our subject, We have a description in the background that will probably change with every image you make. However, the stylize would be in one place. We have this medium stylers and negative prompt and then parameter. Now let's just remove the subject and the background. Now we can save as the option, let's copy that and let's set it up. Prefer option set, let's call it again. Maybe photography, let's put our value and let's based everything here. Again, we have the medium stylizers, negative prompt, then parameters. And now click Enter to set it up. Okay, now it's set up. We can use it with any image. For example, we can imagine, now I can put any subject here. For example, a mice in a house. I want the photography setting. I will use photography. Then click Enter, because all of the words are ready there. Here we got a photograph of a house, maybe there is a small mice somewhere here as well. Overall, this is how you can save the styles with the stylizers and negative problem together. Anytime you want to, for example, create portrait photography, all you need to do is to write a subject and then use the command that you set up. In my case it was photography and mid journey will automatically add everything else. 42. Midjourney Prompt Writing Resources: Here I've prepared a few resources that can be helpful for prompt writing. For example, this journey styles and keywords. We've discussed this website when we talked about stable diffusion, prompt writing, but I think now it's more relevant. So this is a Github website. Here you can see a lot of keywords. For example, if you're interested in lighting keywords and you just want to explore what kind of lights are there and what would be most useful for my image. You can click on it once it's loaded. Here are different versions. You can select the version that you're using, for example, four V five or Ige for anime. And illustrations here are also different lights, for example, types of lights, lamps and tubes, types of lasers and so on. Find something that's more relevant, for example, types of lights. Here you have different keywords and you can see what image can be generated with this keyword. Right now we're using the version for these images were generated with the version four, for example, air glow or Alp. And glow here, there's a lot of different lights. Okay, if we go back here, there are lots of different styles, colors and palettes, Lots of different stuff that can be used for your prompt writing. Okay, let's go back. Here is a medium you prompt tool, and it's pretty cool, so we just click on it. You can start by typing your main idea, Maybe a subject action and details. Then here are all the different stylized artists that you can use. For example, let's for example, try now. Let's choose a style. The good thing about this website is that it has the example images, for example, charcoal style. Now you have an idea how it can look like. For example, if you're looking for a specific effect, you can browse through all these options. Let's choose futuristic. For example, here you can actually add the weight. The default one is one, but let's say you want a higher weight, so you can move to two, let's say. Then continue. You can repeat that. For all the other Stylis artists, for example, you can choose Gusto Clem. The only limitation here is that these artists are limited. You can see there's not that many. You cannot add your own here. But at the end you actually can add it to your prompt to yourself basically. But you just cannot choose the artist that's not part of this template, okay? Once you choose all these parameters like size, for example, aspect ratio, vertical quality, deol, let's say colors blue. Now you can copy this prompt. Let's paste it to mid journey. Imagine some strange randomly prompt here. Let's try it out. Here it, the version four. If you want to change, you can put version five or you can just delete this parameter altogether. Okay, let's try this out. We've got very interesting results here. I don't see any cats here, but in our prompt, the futuristic has the weight of two. Maybe it interprets as the name of a woman, for example. Okay, now you know how you can use this prompt tool to help you generate prompts. 43. Mijourney Parameters - Image Weight, Quality and Stop: Now let's talk about some parameters that you can use in mid journey. The first one is the image to prompt weight, as I call it in mid journey. It's just called image weight. It's a parameter with the values range 0-2 The default one is one. Basically, lower image weight values less than one and bigger than zero means that the text prompt has more impact. The higher image weight values bigger than one and less than two, that means that image prompt has more impact. If we see this example, this is the image prompt. This is the prompt example. This is the image birthday cake. This is the 0.5 image weight. As we can see here, this looks more like a cake rather than flowers, as we can see here in the image. But as the image weight increases now with the image weight of two, which is the highest value, we're getting something that really similar to our image prompt. This is how you can use image weight to manipulate how much of the image you want to see in final results. Let's try this. Here I have an image of Monalisa by La Da Vinci. I will use that as the image prompt. Let's copy the image address. Here I have my image address and I'll put Zendaya. Now let's try the image weight of 0.5 This is the first prompt. Then in the second one, we'll copy this whole thing, but we'll use image weight of two. We'll see how that compares in the first prompt here, where the image weight is 0.5 We can see quite contemporary images or even photographs of Zenda because here the emphasis is on the text pro. On the second example here the image weight is two. Here we can see that the whole composition, the hairstyle, the clothing, looks very similar to our Mona Lisa image, that what the image weight does. Okay, let's move on to the next parameter, quality. Quality, I would say is somewhat similar to steps, number of steps in stable diffusion. Because the lower the quality, the less details the image has. When you increase the quality, the more developed is the image. However, compared to steps in my journey, you can only have the three values here. It's 0.25 0.5 or one. The default one is one that basically means, for example, if you're doing maybe abstract art and you want less details, then maybe you should decrease the quality, make it 0.25 An important thing to say here is that quality doesn't impact the resolution. You'll have the same resolution as the other image grids as the default image grid, for example. It also affects how long the image will be generated. The lower the quality, the faster the image is generated and less GPU it uses. If you want to read a little bit more on quality, you can go to Parameters here. You can go to Quality and read more about quality Here, let's try something out with Quality. For example, imagine let's put robotic arm. And Quality is, let's use the lowest, the default, 1.25 If you want the default, you don't need to put any quality. Let's just use that one. When the quality is 0.25 the images we're getting, it's pretty detailed, but you can see this little bit. Noise It's not as sharp, but if we look, when we use the default parameter, the quality is one. Here, you can see all the details. If we zoom in, you can see outlines of the wires and all very clear and sharp. Okay, so we've discussed quality. Let's move on to the next parameter. The next parameter is Stop. What it allows to do is to stop the generation part way. For example, halfway. If you use Stop 50, for example, it accepts values 10-100 and default is 100. Stop 100 will give you the default image. Anything less than 100 will give you underdeveloped image. If I can say it that way, we'll have more smooth lines. If you use low stop values, then it can be blurry. Let's try it out in our journey. Let's imagine I want to try something fun. So we can put a Safari hat in a jungle. Now we can use the stop parameter, stop. Let's use ten. Let's now copy this whole prompt. Paste this prompt, and let's use 50.80 Let's see, the first one is the stop with the ten. As you can see here, we've got these very blurry images and doesn't resemble anything. Maybe because you know the prompt, you can imagine this is a dog and a hat here. But as we move up, for example, stop 50. We already have our Chihuahua in a hat. But as you can see, the lines are smooth and the background is blurry when we move to stop 80. Now the background gets more detailed. We've got, we're getting more and more details here. This is what you can do with the stop. If you are trying to create maybe more smoother lines, then you might want to use the stop parameter. 44. Midjourney Models: Now let's talk about models. You can get more information by going to jury dogs for example. You go to versions here, it describes different versions and what things they're good at or maybe not good at. For example, version five here, it says it produces more porter graphic generations than the default 5.1 model. This model produces images that closely match the prompt, but may require longer prompts to achieve your desired aesthetic. Each model has its specificity and you can read about H a little bit more in the docs, and especially if there are new versions that says where you should go and check them out. Okay, the version 5.1 currently has a default and raw style version five. As we read, it produces more photographic generations, but it requires longer prompts. The G five is a fine tuned model for anime and illustration styles. The G five has five different versions. It has the default original, cute, expressive, and scenic. If we go again back to our journey dogs, if we scroll down, this is Ng model five, this is the default image. This is the original cute, expressive, and scenic. As you can see, there's slight differences between these versions. Then you also have older versions like version 43.2 okay, where we can change our model. First of all, we can use that in a parameter. For example, if you write, imagine for example, let's use the same juju in a Safari hat. Here you can version, you can put, for example, five if you are interested in the version five, If you're interested in older versions, for example four, you can put version four and so on. Another way you can do that, if you go to Settings Settings, let's click Enter. Okay, here we have different versions. Version 12345, currently I'm using the 5.2 I'm quite surprised because they've updated this version just today. The 5.1 version is now odd. It's important to go to the journeys and read about different models because they update the versions very quickly. Make sure you can follow up with all the different models. Here, I'm curious, let's try version 5.2 Let's compare that to 5.15 and use different raw mode and so on. I will use a prompt. Imagine character mixed race girl in stylish clothing. I will first try the latest version, 5.2 and then we'll try others as well. Right now I'm using the 5.2 When you click on the settings here we can see the suffix, the parameter 5.2 Here we go. Let's try this out. Okay, now I will use the same prompt with a different model. Again, imagine now I don't want the 5.2 I want the 5.1 change the 5.1 Let's enter the 5.1 also has the role mode if you ever get confused how to write it up. So for example, version 5.2 and row mode, here you can see you'll use the modal number 5.2 and then you'll add the style raw. Let's add the raw style. Imagine we have 5.2 Let's add the raw style raw. Let's now try the version five. Let's try the G version five. Again, we'll put the imagine or prompt again. You can go back here. As you can see now it's G four. If we want the G five, it's going to be G five. Let's put that in. Okay, let's check out some models. This is the G four. As you can see, it's more simplistic compared to the G five, but again, it's very different styles. Then we have our version five. Well, let's start with the version four, This is the regular version four, this is version five. We're getting these characters in full body size 5.2 style is in a way similar to the modal five because here we are having full size characters. Then this is model 5.15 0.2 As you can see, the 5.2 has really nice lighting and background compared to 5.1 where it's more simplistic or even white background. Here you can see that with all these models we've got quite different images. It's worth reading some information about the models and trying different things or different styles. 45. Midjourney Parameters - Stylize: Okay, let's go to Stylized Values. The stylized values are also part of the settings. If we again go to Settings here you can see the versions. Let's switch back to the latest 5.2 version. Here you can see different stylized values. You can see a stylize, stylize medium, stylize high, and stylize very high. What are the stylized values? Let's go back here. The low stylization means that images closely match the prompt but are less artistic. High stylization means the images are more artistic, but they can be less connected to the prompt. This is somewhat reminds me of prompt guidance for stable diffusion, But with the prompt guidance, it's opposite. The lower the value, the less it matches to the prompt. However, here, the lower the stylization, the more closely it matches to the prompt, and less artistic, the high it is, then less connected it is to the prompt. Different journey versions have different stylized ranges. How can you check which version supports what range? Well, you go to journey dogs here. Again, if we go to parameters parameter list here. If we look at the stylize here, you've got some ranges right now, version five, version four and G five have the same stylize range, 0-1 thousand. The default number is 100. Let's look at some images. For example, here we have a prompt colorful risograph off. If you don't know what's risograph, it's basically a printing technique for the style zero. We're getting this basic Gu form that does look like a risograph. Here we're getting this basic background, yellow, white, and so on. As we increase the style parameter, now we're getting more details. This is 100 is the default parameter. As we move even further, 250 here, the images start to look less like a Risograph but more as a realistic drawing. For example, here as we increase the styles to 1,000 here we have so many things going on. Very rich backgrounds, very rich details, and so on. As you can see for style zero, we're getting these basic images that are aligned with the prompt. However, as you increase the styles parameter to high values to maybe 750 or 1,000 you start to get more detailed and reach images that may not align with your prompt. Sometimes you want to generate more basic images that correspond to your prompt, and other times you're looking for more creative and artistic images, then you can use a higher stylized value in the settings. Here we have stylized low, stylized medium, Stylize high, and styliz very high. What do these keywords correspond to here? The stylize low is the same as the style 50, Stylus medium is the same as the default Stylus 100 Stylize high is the same as the Stylus 250, Stylus very high is the same as Styles 750. You can set up stylized parameter here. All of your images will be generated with the same stylized parameter. For example, if you want more artistic images, you can use the stylus very high, or you can use the default and write your dial parameter in your prompt. For example, imagine I will use bike infographic illustration, for example, I will put the stylize you need to use style, let's use zero. You can also use, instead of stylize, you can also use the. Then let's put, for example, default 100. Let's check this out with the style zero, we're getting this basic background this big, doesn't even have any keywords. Yeah, as you can see, very basic composition, just the two D illustration. As we move to 100 here, we're getting a little bit more creative. As you can see, images have texts. The first one is interesting, it has the forest background really cool. And then we move to 400, and now you can see that the bicycle is now more three, you can see some cool things inside of it, I'm not sure what they are, but it's definitely more than just a regular bicycle. This is the stylized 400, and when we move to 1,000 now we're getting really crazy things. A bicycle with lots of electronics, wires, a lot of things that were added to bicycle. And on the fourth one we can see some information about house trees and so on. A lot of things. That one is 1,000 and we've got another image with 1,000 Again, too much things going on here. Now you know how you can use the stylus function to change how much of creativity you want to add to your image. Or if you just want to generate basic images that align with your prompt. 46. Midjourney Parameters - Chaos, Tile, Seed and Remix: Let's move on to chaos. Chaos is an interesting parameter to call chaos. You just need to put the chaos or the chaos affects how varied are the initial image grids? It ranges 0-100 and default is zero. This is the default image. As you can see, the images within the grid are quite similar. We have this pink owl with green eyes and green background. When we move to Chaos 80, now we're getting different images in different styles, different medium. Here we have carved watermelon. Here's maybe a plastic owl. This is a tattoo or cool illustration. As you can see, each image is very different. That's what chaos variability. Low chaos produces images within the grids similar to each other. High chaos produces images that are varied and have unexpected composition or artistic mediums. Let's try this out. Let's try a fun prompt, A cartoon lizard in a raincoat walking in a forest. To add chaos parameter, begin using chaos or simply C. Then we can use value 0-100 Let's try with zero, that's the default one. Let's generate a few more. Let's do 50.100 with the default chaos of zero. This is a default image. We're getting quite similar images between each other. Of course the style is a little bit different. For example, here we are getting more illustration here, the background and lighting, it's more three D. But overall, the image and composition is overall similar. When we go to Chaos 50, now we're starting to see that all the images within the greed are quite different. Here we have a cartoon here. Also cartoon but completely different composition. Here we have a moon, a lizard is in the puff jacket. Here. Again, completely different composition. And this is some realistic, a bit scary thing. Okay, let's move on to Chaos 100. Here, I don't quite see a lizard on the second image. I don't see a lizard on the third one. I see a person on the fourth one. This is not a lizard, this is some random cartoon character. The first one, well maybe this character looks like a lizard a little bit with a butterfly ears, I don't know. But as you can see the images very a lot, that's the whole point of chaos. It increases the variability. You can use chaos to look for different composition. If the first images that you generate are not something that you're looking for, you can increase the chaos a little bit, but maybe not too much. Otherwise, you couldn't get like very weird things. Okay, let's move on to tiles. That's a fun one. If we add a tile to our prompt, for example, we can use a simple one, like Music Now we can put the tile, what it does, it will create a similes pattern that you can use for clothing Merge and so on. Here we've got four results. You can choose the style that you liked, for example, the first one is quite nice. Let's upscale the first image. Okay, now let's save the image. We can actually check the seamless pattern. I found the seamless checker here. You can upload your file. As you can see, it works. There are no problems with it at all. It nicely adds to each other. This is how you can use the tile parameter, okay. Let's move on to seed. Here would be quite similar to the stable diffusion seed. However, in mid journey seed numbers are not static and should not be relied upon between sessions. They're not quite reliable before people use them to create consistent images, but right now there are other ways you can do them. Let me just talk about a little bit here. Using the same seed number in prompt will produce similar images. That's the same as with stable diffusion. Let's try something. For example, imagine a fox in, in a forest. Let's generate this here. In order to get the seed number of these images, you would need to go to reactions and add an envelope reaction to the image. Once you add the envelope reaction, you would get the job ID and the seed number. Now you can copy the seed number again to get the same prompt, I will use the same prompt. In a forest, I will add the seed. It's again, it's seed. And the seed number here, it should produce the same result here. As you can see, by running the prompt with the same seed, we got the same results. But the problem is, is not quite reliable in my journey because it can be different between sessions. There is another way you can create consistent characters or modify your pro, and that's the button that we've talked a little bit about. The remix button is a new feature that allows to change subject lighting, add remove elements, and adjust settings while keeping the overall similarity to the starting image intact. For example, let's try a character. Let's, for example, imagine a portrait of a girl with freckles. Let's generate tests here. For example, let's say you're interested in number three. I want to add a different reaction. For example, smiling or a different background. I can add that in the remix. For example, a girl freckles, let's put smiling. I also want to change a background, green background here. As you can see, I think the first image is very similar to the bird girl here. The remix is in a way like a seed in stable diffusion that allows us to modify our prompt a little bit and add different emotions like smiling or frowning. And also change the lighting background and other parameters. I want to show you another really cool thing about remix. Let's use a different prompt. Imagine a dozen eggs in carton illustration. Now we've got some X here. Well, I think all of them are quite similar. Let's use number two for example. You can use the variation or remix. It doesn't quite matter, but if you like a particular style of the image or composition better then use the variation. In this case, I like the number two. I use the variation two. Here I can remix the prompt. I will change the subject. Instead of x, I'll put hamsters, a dozen of hamsters in carton illustration. Everything was kept the same except of the subject. Here we can see that we've got the same composition as we have with the x. Here we have this carton with x. Now the ****** that were for x are taken by the small he hamsters. As you can see, the composition is the same. But now we change the subject to hamsters, and instead of x, it put the hamsters to these places. This is something that I think is really fun because it allows you to use the composition from one image and use it completely different things. Okay, Maybe, let's try a different image. For example, the fourth one here, I'll change eggs to happy owls. A dozen of happy owls. Here we've got some owls. I think number two is the best one. In terms of details, I like how journey tried to make all the eggs here into owls. As you remember, those three eggs were flying not in the carton. These ones mid journey made it into owls. Happy owls. This is something very fun that you can do with the remix function. I think what it also really good is if you have a certain image with the composition that you like, you can use it with a different subject, but keep the same composition. Yeah, the remix takes the general composition of the starting image and incorporates it into the new generation. You can also add or remove parameters such as no style or stop and much more. Also if you remember from the first module on mid journey that when we tried to do remix and change the aspect ratio, it stretched the image basically. If you are changing the aspect ratio, that's what it's going to do. 47. Midjourney Emojis: I decided to finish this presentation with modes. Basically, with mid journey, you can only use modes. For example, you can use this microscope to symbolize the microscope photography. Let's try to use modes with mid journey. I can again use the image and I will use the microscope emoji and the strawberry mode. Here we've got the straw brace, but we didn't quite get this micro photography. Let's try a different prompt. Here I have the microscope pug and a mushroom here. I want to try to use the version four. What I found is that with the version four, if you add the microscope, then it would give a micro photography better than other versions. For example, we can see here maybe inside of the mushroom with some fruits. I'm not sure, but it definitely looks cool. Okay, let's try one more. For example, imagine I'm again going to use the version four, and this time I want to use very different modes. I'll use the rocket and a person in the yoga pause, a man in lotus pose. Let's try that. I'm curious what it's going to generate. Okay, let's add the version four here. We've got an astronaut with really beautiful background. However, I don't see anything to do with the, the lotus position or yoga and you were here. But it's something to do with the rocket and space, that's for sure. This is how you can play around with Mo. You can add emoji and prompt text. Prompt. You can add both things together and see what it's going to generate. It's a fun way to play around with mid journey. 48. Midjourney Image Generation Example- Portrait: In this video, we will try our prompts with stable diffusion, the same problems that we've used with other platforms. Let's get started. Okay, first is the photo realistic portrait. Let's use the imagine. The first one is professional portrait photograph of a young British woman here. As you can see, we don't have any weights. I will add some weights here. Professional portrait photography, that will have a weight of two of a young British woman in a jacket. Let's put young British woman in a jacket with wavy blond hair. Let's put that if four. Then we have beautiful symmetrical phase, cute, natural make up that. Let's put the weight of three. Then blurry, rainy city, street background, that's also important. Let's put the weight of three as well here. Then we have our stylizers, we can keep that the same for the aspect ratio. Let's make it four by five also because I want the images a little bit different from each other. Within the grid, I will add the chaos again. Chaos, maybe not too high, Let's put ten. There's a little bit variation between the images. Let's also use the stylizer will be a little bit more artistic. Let's put the stylize, let's use High Stylize 250. In these images, I actually don't see any street background. And that's because I put the space here between the weight city street background was treated with the weight of one. Because here we have this space. Let's run it again and not have that mistake here, The blurry, rainy city street background. Let's remove the space here while it's generating the image, let's try the other portrait. Prompt again we have, imagine, let's portrait photograph of a young Indian woman with long hair, beautiful symmetrical face. So basically very similar. The background is different, colorful street market background. Let's also assign some weight here. Again, portrait photograph I would the weight of Indian woman. Let's use that as cute, natural make up. Let's put that with the weight of three. And then we have colorful street market background. This is very important. I'll also use the same weight as with the woman. Let's put three here. Let's also add the Chaos. Chaos. Let's use the same ten and stylize, let's use maybe 200. Also I want to use a different model. I want to use the version five. I'll put the five because it's more photo realistic. Let's use that one. Okay, we have these images of a young British woman here. As you can see, we've got this nice street background. The space here is very important. Make sure you don't have the space between column and the number when you're citing the weight. As you can see, now we are getting this background here. Overall, the images are quite nice. I like number two here. If you want to upscale a specific image, you can upscale it, but because it just separates the images. If you want to, let's say upscale all the images, the easiest thing to do is to put an envelope Emoji here, add reaction and find your envelope Mogi. When you put the envelope Emoji, you'll have a new message. Now you'll have the full size images separated from one another. We've also got the images for the young Indian woman, here we. Get a girl here. I think the more appropriate images were third and the fourth one, but the background is not colorful enough for me. Let's try to change this a little bit. I'll copy this prompt again. Let's use this prompt now. I'll put the weight on a young Indian woman with long hair. I'll put the weight here with four colorful street market background. That four, I also want to emphasize colorful. I'll put the color, I will put that also. Four, highly detailed. Okay, let's try with the version five. The version five has less of this journey aesthetic. Let's use that one. Let's use the newest version here, 5.2 for the version five here. Well, some colors, definitely with the image number two and number three. Number four looks a little bit more plain. With the version 5.2 This is what I'm thinking about is having these very colorful clothing items. Maybe like here, but we've got wrong ethnicity here. Let's try again. Let's put five with long hair, that's not too important. Long hair, beautiful symmetrical face, colorful street market background. With weight of four, Colorful, highly detailed, and so on. I'll put the chaos lower. Let's put chaos, for example, two style. I'll remove the stylized parameter at all. It is more aligned with my prompt here. Here we're getting colorful images, but I was looking for Indian as the person from India. Now we're getting Indian as the Native American feathers and stuff like that. In order to direct me journey to the right ethnicity, let's add some Indian traditional clothing like say here. I'll add the clothing attribute wearing. Say here I also specify South India and the woodland. Also, I'll add lower stylize. The default stylize is 100. Here, I'll put stylize 80. It aligns with my prompt a little bit better. The results here are way better. This is something that I was looking for. We have this market in the background and overall, just very colorful palette. We have an Indian woman here in Sari, very beautiful. I would also probably emphasize the background a little bit more because in these two we have quite plain background. But the other ones, I think the number two and number three are pretty good. 49. Midjourney Image Generation Example - Logo: Now let's try some logos. The first one is line logo of cup Kick with a tear and top clean line, simple shape, minimalist vector. As you can see we're getting these very simple images that what I'm looking for in the logo. Very little detail, quite plain background. You can see we've got different colors here. If you want white background, then you can put white background. Let's use it again. But now let's increase the chaos. For example, if you're looking for some logo ideas, I think it's important to add a little bit more chaos. You get different results. Let's put chaos. I think we can put 30. Yeah, let's put white background. Now we're getting these a little bit different results. You can choose the one that you like the most. Also with Journey is nice that you can actually add the text here as well. For example, if you're creating a logo for a bakery, for example, you can put a line logo for a bakery and then portraying cap with a tear in top. If you want a name you can put also at then we want simple shape vector, white background and all use the same caves. Now we're getting not only the image of the cupcake, but also the name of bakery, for example. As you can see here, the words are quite legible, but they don't mean anything. But I think it's a good guide for fonts. For example, this image, a phone like this one, would be very suitable. And here we have this cursive font. This gives you ideas and I think this is a great resource to try it out. For example, I really like this number four. I think this simple and very cute. Let's use a different logo. Let's imagine tree inside a water droplet, slick and minimalistic logo. Ecto graphics. One color white background. Let's remove the color. Let's put two color, two color palette, temper style, eco friendly business details. I would actually put a higher weight here just because we have lots of words. I'll put the weight of two, weight of two, no ****** and space here three inside of water droplet double column two. Then we have logo vector graphics and so on. Okay, let's try this out. Actually let's use sleek and menus logo, let's also put that width two. Now for our parameters we can put chaos. We have some variation. Let's put 20. We don't want too much stylize. We can make stylize a little bit smaller than the default stylize. The default is 100, let's make it 80. Let's try this out. Let's see, this is not bad. However, I think this is too much details for a logo. I will actually change the prop a little bit again. I'll put, imagine here inside a water droplet. I'll, I'll just use the minimalistic logo. Minimalistic and line logo, then vector then that's with the weight of two. We have two color palette, white background taper and so on. Let's make the style even lower. Yeah, let's put 50 here. I think the number three is the best one in terms of the Vector logo here. However, we are not getting this nice. Let's try again. Inside a Water D. Let's use just the line logo because sometimes too many words is not that good as well. Then we have two color palette. Let's try one more time. I'll use the, imagine I will use the tree inside of water droplet. I'll keep that same. I'll change the quality to 0.25 I'll make the chaos a little bit higher so there are more variation. Let's try this out here. The images are a little bit more on what I'm looking for. However, the best way to let me join you know what you want is by having a reference image. Let me find a reference image on Internet. Here I put the loba try in water droplet and here I've got different company logos. I can use something for a reference. For example, this one is quite nice. I'll copy the image address here. And I'll put to my prompt. So I'll use the same probed, but now I will add the image address and we'll paste my probed as a side node. You need to be very careful using other images as you reference just for copyright or legal issues. Let's see these images. I think the second one is nice. We've got this quite simplistic tree, even though we didn't get this water droplet shape. But we can create variations and see if number two, I want to also add what a draw put shape, maybe that will help. Here we've got some variations and as you can see, I think on the fore front we're getting this, what a draw put shape. And the tree, of course, you would need to change it a little bit, but overall this image is not too bad. 50. Midjourney Image Generation Example - 3D Render, Anime, Characters, Landscape and Concept Art: Now let's move on to magical realism and create some three D renders. Again, let's imagine our favorite three D render of the raccoon reading a book. Yeah, let's keep that the same in terms of the aspheric ratio. Let's use a square for chaos. Let's be chaos a little bit higher chaos, Let's use ten. Here we've got really cool images of the raccoon. In terms of proportions, everything looks really good. Here we have this dim light cozy background, cozy setting, and we actually can see a lot of details. If for example, upscale number four here, to upscale the number four here, we can see a lot of objects like shelves, candle box, then we have a lot of other things in the background. It does feel like home. Okay, now let's move on to illustrations and characters. Here I have portrait of skinny anime boy with glasses listening to music in the street of rural Japanese city. Here, because this is anime, the keg model works better with anime. I'll use the keg, I'll put a Eg five if we go quickly go to the Journey Dogs. Here you can see all the different versions. There's the default, the five, but also you have different styles. It can be original cute, expressive sat. Let's try first with the default and also try with different styles. Here I have the keg five. Let's use the same one with the styles G five here I can put cute. Let's do another one with scenic again five and then style set. Here we've got different styles. The first one was the default. We have this Anime Boy, as you can see, change when we add the style style cute, now it's a different style. Then when we add the scenic, then it's also a different style. Depending on what style you want to create Anime, you can choose the corresponding style. Okay, now let's do some characters. For example, a girl riding a bike by our, I have here a girl riding a bike by artists. I wanted to be a character sheet, I'll put the sheet. And then a girl riding a bike. Let's try this out here. You can see that we've got this white background. When you put the character sheet or the character concept art, that will usually give you white background and more of the character. If this is something that you want, then you should use these keywords. Okay, let's try something different. Again, I'll use the, imagine I will, I'll use the character sheet, then I will put a mixed race girl in stylish clothing. I would also add keywords that would be applicable to characters. For example, three D and I'll put concept character art with front and back view. Also, we can add Unreal Engine, that's good for characters overall Unreal Engine. Okay, let's try this out here. We've got characters from different angles as you can see. So we've got the clothing, and now it really looks like a character from a game. If you're developing a character for a game or for a book, you can use Journey for inspiration for example. Or if you're developing character for a book, for example, for book illustration, then you can use these remix patterns to place the same character in different positions with different emotions and so on. So let's move on to landscape. We have here digital art of magnificent medieval castle between the hills and fields, large panoramic background with dense nature and mountains. Grand fortress, Epoced fantasy. Let's try what it will generate and then maybe we can add more parameters for Asper ratio. I want it to be a landscape. Let's put three by two here. We've got some epic images with the panoramic view of the castle. I think for landscapes, if you want to make landscape a little bit more dreamy, then you can use the stop parameter. Let me copy the prompt again. If you want more soft lines or more soft, for example, sky or trees and so on, then we can use the stop parameter. Let's add it here. We have the aspect ratio, let's put it stop. If you remember, stop is 0-100100 is the default and fully developed image. If you stop the generation process halfway or part way, then it's going to have more smoother lines and bluer things. Maybe not make it too low, maybe somewhere around 75. Let's check this out here, we're getting more dreamy atmosphere. And I like this more because with fantasy it works really well compared to the previous image where the lines and outlines are very sharp. Here we're having the more smoother shapes and especially the sky or the background looks more enchanting and light. Okay, let's move on to our last prompt. And it's the meaning of life. The meaning of life. Breathtaking art, stunning high resolution, highly detailed, inspirational, and eight K. We don't need that with mid journey. Yeah, let's try that out without adding any parameters first. Okay, here we've got some amazing artworks. Look at all these details. In three of them, we have a person and just lots of things going on. I want to upscale the images and look at all these details. Let's do that. Let's upscale maybe 1.2 Let's take a look. Here we have a back view of a person. We have all those different elements. We have contemporary elements like cars, sculptures, a lot of things. I don't know if it's a theater or some cool architectures combined together. Very fine concept. If we look at the other image, that's also very interesting. This is in more surrealism style. Again, we're getting all the details. Also I want to add that with the new update that happened while I was recording the scores, We've got new features that allow us to zoom out, basically do out painting, let's try that. Here the image was extended, the boundaries of the image were extended. It's a bit hard to see all the elements in order to see them better. Let's scale. Let's upscale the first one. As you can see, now we have even more details and things going on. Yeah, you can use the zoom out hinting feature. One thing I want to try with this concept art here, we're using the default stylize. I'm wondering what's going to be generated if we increase the stylize parameter to 1,000 This will generate the most creative and artistic images. Let's try that. We'll put the stylize the highest, 11000. Let's see, Again, we're getting so many details. Yeah, this is, the third image, reminds me of the artworks by Dutch painter Bosch. So I'll show you what I mean. This is one of the works by Boss. As you can see, there are a lot of elements here and we're getting also a lot of elements in our images here. The second one, I think this is deep. The fourth one is a fun, dreamy image. Okay, so we've tried all our prompts with mid journey, and for every prompt we wrote, we got beautiful, amazing images. And we've also tried different parameters, such as stylized parameters Aspherio. We've talked about the stop parameter and many more. So now it's time for you to try it out, play around and bring your vision to light with my journey. In the next bonus video, I will show you how you can put your face on any image that you create with my journey. 51. Midjourney Bonus Video - Faceswap with InisghtFace: In this bonus video, I've decided to show you how you can put your face into one of the mid journey generated images. For that, you would need to create a server. So pay attention because it's a little bit advanced, but it is fun. So let's do it. Okay. When you sign in to Discord account on the left panel, you'll see this plus. Click on this plus here, you can create a server. You can create your own, or you can choose from a template. I usually like to choose from the template because the set up is much faster. Once you choose one of the templates, then let's choose for me and my friends here. You can put some image, for example, something like this. Or you don't have to put any image. Now we can name our server, for example, my face swap. Let's create it here. You've got some channels for information, then you have the text channels. You usually want to use the general one. And you can create more channels if you want to. What we'll need to do, we'll need to invite mid journey board as well as the inside face bot. This is the bot that allows you to put your face to mid journey generated images. Let's start by inviting mid journey bot. We've created our server. Now let's go to mid journey here. Right now we're in new B group. You can go to the Mid Journey server and you'll need to find the Mid Journey bought here. And you can click on this ad to server. Or you can go back to your direct messages here in the mid journey board. Also, you can click on this journey board and then click Add to Server. Once you click Add to Server, you can select which server you want to add. We've named our server my face swap. Let's add the Mid Journey bought here, then let's continue. Let's give authorization to Journey bought for all the following. Let's authorize it. Let's confirm that I'm a human. Now we can go to my face swap and that will bring us to our server. Okay, if we go to general here, you can see that journey bought was added to our server. How do we know that? If we go to now we're getting all the commands that we can use with mid journey. For example, we can imagine here such as a grass hopper for example. You can see that we feed our server. We can use mid journey board the same as we used with direct messages. Here we're using the general text channel. However, you can also add more channels and create private channels if you want. Okay, now we have this mid journey board here. We will also need to add the inside phase board if we go here. This is just the link to inside phase I. And here they have all the different projects and information. However, what you really need is this inside phase discord. But if you click on the link here, you can add the bot to your server. Here we've already got our server selected. Make sure this is the correct server. My face swap. Then let's continue. Give authorization, send messages, attached files, and so on. Authorize, confirm. Now inside phase has been authorized and added to our server. Then let's click to go to our server. Now you can see that the inside phase swap was added. Now we have two bots, the journey board and the inside phase swap. In order to use inside phase swap, you'll need to upload your own image. Let's do that. For that you'll use the command here on the left panel. You can choose what bot you want to use. For now we want to use inside phase swap. Here you can see what commands a part of the spot here for example, it has the safe ID. Said ID swap ID and so on. What we're interested here is the saved here. It takes the ID name and the image here. You can upload image of yourself or anyone that you want. Let's use my image here for example, this one. The better the quality of your image, the better the results will be. Here we've uploaded our image here. You need to write the name, the ID name. You can put whatever you want. For example, one. Let's click Enter. Now our ID name is set up. What we want to do now is to generate some image with which we can swap our face. For example, I will use the Mid Journey. This is the Mid Journey command. Imagine here I can put a girl superhero with blond curly hair. Here we've got some superhero curls. Let's choose one image. For example, number two, upscale it. Now we've got this image. If we write, click here, now we select Apps and swapper, choose the swapper. That's it. Here you go. As you can see, the inside face used, put my face on top of the superhero girl face. As you can see, the proportions are quite all right. It captured the features. However, what I found out is that you have a straight looking face, the feature works a little bit better. And also, if you find the image that looks more like, in order to make an image that looks more like you, you can actually use your images when you're creating mid journey image. For example, let's again use this one. I will copy the image address here. Now I will use mid journey. I'll use the image, I'll paste the address of this image. I will describe myself. A girl with dark blond curly hair, green eyes, business portrait photo linked in avatar photo professional. Let's try that here on all the images, the model is looking straight. That's because we've used the reference image. My photo where I'm looking straight at the camera. Here, we've got nice and straight looking photos. That's exactly what we need. We can swap faces with one of them. For example, let's use number four. Now we need to double click, choose Apps, and choose Swapper. Here we go. We've got an image, a professional image, that resembles my photo. That's how you can use your photo on any image that you generate with mid journey. I think this is a fun tool to play with. Try it out on this node. We finish our mid journey module. In the next modules, we will talk about AI photo editing. See you then. 52. New Update: Midjourney New Features: Hello everyone. This is a mid journey update. We will see what's new in mid journey in November 2023, Okay? So let's see what features have changed, improved, or are just new. Okay, now we have a tune command that allows to choose a particular style of images to generate. In, in a sense it's fine tuning and enables us to generate images in a specific style, which is pretty cool because they didn't have any fine tune functions before. Then we have an up scaler, so now we can upscale Journey by two x or four x. Then we have editing, which is also very exciting. Now we can edit specific elements in the picture. I will show you how. Finally we've got this weird parameter that makes images weird and you can try and experiment with it. Let's get started with the tune command and see what it does. Okay, let's go to mid journey Do here. Just put in style tuner here what it says here. It personalize the appearance of your mid journey images using the style tutor. Use the tune command to generate a range of sample images showing different visual styles. Based on your prompt, choose your favorite image and you'll receive a unique code you can use to customize the look of future jobs. Okay, you just put tune and then your prompt if another user has previously generated a style tuner. With your prompt, you will receive a link to that tuner. Click the link to access it. We'll try just that. Let's go to Mid Journey. Here I am in Mid Journey board. I'll just put tune here. We can put either a very simple prompt, something that someone already has done before. Let's do that. For example, photography. Hopefully someone else did the tuner. As we can see here, we have prompt photography. We can see that someone else has created a style tuner using this prompt. Here you can see different number of images, 32, 6,428, It's best to use the biggest number of images because that gives you a wider range to choose from. But we'll start with the 32, just for you to understand how it works, let's just click this, okay. From here you can choose the style of images that you like. For example, I want to generate my images in this minimalistic style. I will just choose this, and it gives me this code. If I use this code, for example, I imagine, and then I'll just put a boy with a balloon. And now I will paste the style. This is the code. It should generate the image in this style. Let's try it out. The images are in the same style as we chose here. This code corresponds to the specific style. Let's say if you chose not this one, we can put that back to the center and choose a different style. As you can see, the code has changed. Now if we use this code, it's going to generate in this new style, but you can also choose multiple styles and it's going to combine them. The more you choose, the more generic it will become. If we go back to the numbers here, the more images it generates in the style tuner, there images you can choose from. Let's say you want to create a specific style that you're looking for. You generate let's say 32 images and the style you're looking for is just not there. Then your best bet is going to 128 images. That's going to give you a wider range of styles to choose from. But say you want something more specific, then you're going to go and tune here. You're going to put a specific prompt, For example, a minimalistic logo. Of angry cat, something like that. And let's press Enter. Now you can see that no one else has the same prompt, hence there are no links for that reason. Here you will need to choose how many images you want to create. Let's choose the simplest 116 style directions you can choose or Default. Now let's use Default and click Submit to start generating. Click. Are you sure it's going to cost this number of hours? Let's do that. It's going to notify us in around 2 minutes while it's generating. Let's check out other features that's new to Mid Journey. Okay, here I have a realistic photograph of Burster creating later art in a cozy coffee shop. Wo Ambience, let's upscale an image, for example, this one. Okay, the image is upscaled. Now here you can see that we have way more options that we can do. The very region is the added in painting, which allows us to edit certain specific parts of the image. Let's do that. It's going to open a window here. Now we can actually change a specific part of the image. Let's highlight this. Now I want to say put hands holding a cup of coffee. Let's generate. Okay, now we got four images with the edited part. As you can see, the woman is holding a cup of coffee, but that cup is quite huge. I think the best one is this one. Let's zoom into the hands seems okay. But what I find about these journey edits is that sometimes they're quite out of place. For example, like this one, there is still a room for improvement for Journey in editing. Okay, that's the editing here, for example, let's choose number four. Now we can upscale and upscale four times. Basically it's just going to improve the resolution. Let's just do two X. Okay, while it's upscaling, we actually have our style tuner ready. We've got the link here. Here was our prompt and minimalistic logo of an Angry Cat. We can compare styles. As you can see, even 16 images is pretty good to make a decision about a style. Yeah, I liked the first, the third one here. We can choose it here and it's going to give us the code here. There is actually another method to choose between styles. We are choosing between two styles, this one or this one, or neither. If you don't like neither, another way is to pick your favorite from a grid. Here you get a big grid and you can choose between two images. This one for example, or this one. In this case, this image or this image in this big grid. It might be easier just to see different styles. It's up to you which one you use. At the end, the code will be the same. If you choose the same styles, for example, if I just choose this one, it's exactly the same as on the other method. Okay, here we can try to combine two styles. Let's say this one and that one. They are pretty similar, so let's copy it. Let's go back to at Journey. And let's imagine here we have a manual sic logo of an anger cat. Here we can change it to, for example, a dog. Let's try. Here are the images. There are slight differences between the logos here, but the overall simplicity and style is quite similar. This is how you can use tune command to generate many images for. Your specific prompt, choose the style in which you want your images. Now with this code, you can generate consistent style which was not previously possible before. Okay, now let's get back to upscaled version. Here we have the two x upscaled version. As you can see, the resolution is way higher here. I don't quite like this cup here. You know what we're going to do? We're going to go back to the original image, the one that was edited, and we'll upscale this image by a factor of two. Okay, here we've got our original image upscaled. What I wanted to show is that even though the resolution is higher, but the problem is the elements and details, it doesn't improve them while upscaling here we do have a problem with the hands here. It still persists in the upscaled by factor of two version. This is just something that you need to be aware of. The last feature that I want to show you is the weird parameter. Let's see what Jeri has to say about it. The optimal weird value is dependent on the prompt and requires experimentation. Try starting with smaller values such as 250 or 500, and then go up or down from there. If you want a generation to be conventionally attractive and weird, try mixing higher stylized values with weird. Try starting with similar values for both. For example, a cat stylized 250, weird 250, here are, this is the weird zero result and weird 1,000 Again, this is weird zero, looks pretty normal those two. Let's see, the weird 1,000 Okay, what's the difference between weird chaos and Stylize? Chaos controls how diverse the initial grid images are from each other. Stylize controls how strongly mid journey default aesthetic is applied. And weird controls how unusual an image is compared to previous mid journey images. This is something you just need to experiment with and we'll try something basic here. Let's imagine a cat in at, let's weird zero, then let's go common a cat in the hat. The 500 for the last one, let's make it crazy. 1,000 weirded. Okay, let's see, this is weird, zero, weird 500, the same as zero. Maybe the prompt is not the best one for trying the weird stuff, weird cat and had 1,000 Here we definitely see more interesting things going on on the hat as well as the cat is now a little bit different. Reminds me of a header from Alice in Wonderland, this one. These were the updated features in my journey. I especially like the tune command. So go ahead, play around and experiment. 53. Introduction to Basic AI Photo Editing Tools - bigjpg.com and vectorizer.ai: Hello. In this module as well as the next few modules, we will be covering photo editing AI platforms. So what's that? Well, before when we used mid journey Dali staple diffusion, we used text to image generator. So basically you provide a text and AI generates an image from that prompt. The photo editing, you can take that image and improve it. You can change the lighting, change the background, maybe cut out the image, or convert the image to a different format. All those things now we can do with AI as well. In this module, we will cover the simplest tools. These are very easy to use, they are for a specific purpose. For example, here have the biggbgt com that enlarges images without losing quality. Then there is vectorizer that converts PG and PNG images to SVG vectors that's useful when you want to create a logo. For example, you generated logo with Journey and you've got the p image. But now in order to make it into a logo, you actually would need to convert to SVG vector. This is where this app will be handy. Then we have the segment, Anything.com That's a research demo by Meta and that allows to cut out any object from the image. Another app that I've decided to include here is Creator.com This platform is mainly for e commerce purposes. It helps to place your product in a nice background. It helps to generate background for your product. Okay, let's start with the big.com here. When we go to the website, it's pretty basic. All you need to is click on the Select Images. And select Image here I'll go with the concept art that we've generated with Mid Journey. Then when you upload the image, all you need to do is to click the Start here. You can choose the image, type artwork, or photo. You can upscale up to four X for free. If you want a higher upscale, then you would need to upgrade for noise reduction. Noise reduction, basically it fixes a grainy area of the image and smoothens out the lines. If you find that your image has too much noise, then you can select more higher noise reduction. I think here let's select a medium. Let's click okay. Let's also select a few other images. Here I have a low resolution photo image. Let me show you, this is the low resolution photo image, because if you zoom in, you can see pixels and just Noise. I will upload this photo photo. I'll click Start here again. I can choose the image type. First I want to try the artwork, then I will also do the same, but with the photo you can see the differences noise reduction. Let's do the highest, and let's again use the same image. Now I also like to photo for X and highest. Let's check out these images here. The first image is the original one. It's the image that was generated by mid journey, and that's the highest quality that we could get from mid journey. We zoom in. As we zoom in, you can see that we start seeing those pixels and more noise when we scaled this image here is the upscaled version. If we zoom in here, you can see that the lines And overall the shapes are way more smoother. If you are trying to print a poster, then using a tool like this one that enlarges the image even further may be very useful because now it's nice, clear and sharp. However, one thing I want to say is that if there are any artifacts on a mid journey, for example, with these R or sleeves, then you would have to manually fix it. Because when you enlarge it, those things will be the same as the original image. If there are certain things that you don't like in the original image, then you would have to fix it yourself. Okay, let's move on to the photo. Here is the original photo image. As you can see, the quality is not great. You can see you have these grainy areas. Okay, If we compare this original image to the upscaled versions. This is version where I chose photo as the image type. Here I artwork as the image type. See, there is quite a difference when we choose the artwork type. Then it smoothened everything here. Now we don't see any noises, but just the whole photo now looks more like a drawing or painting, perhaps it doesn't have that photo realistic effect in the photo type here, we still have some noise. It didn't get from the overall noise, but it did improve the quality a little bit here. But overall, I like the artwork type better here. Okay, so this is the big Pg.com There are plenty of other tools that enlarge images, and they use different algorithms and stuff, so you can try finding the one you like the most and stick to it. Okay, let's move on to vectorizer. Here again, very simple interface. You just need to drag images. Let's try with some of our logos, for example, this one. It processes quite fast. This is the original image. If we zoom in a little bit, you can see that the original image gets the pig cells. But the vectorized result, it doesn't. You can see that the lines are nice and smooth. This is very important for logos because with vector image, you can make it into any image resolution you want. For example, if you want to make your signboard where it has to be a very big image, then this vector would work as well as a small menu image. That's what's great about the vectorized result. And you can download it very easily here. You can choose the Pig or EPS, these are both the vector formats. And then you can choose the version and other settings. And then just click Download. 54. Basic Photo Editing Tools - Segment-anything.com: Okay, let's move on to our next tool, the segment, Anything.com by Meta. Here, when we go to the website, we see this landing page with information about the model. If we want to use the tool, we need to click on the stride. The demo here, we will need to agree to the terms and conditions. Here it says that this is a research demo and may not be used for any commercial purpose. Any images uploaded will be used solely to demonstrate this segment. Anything model, all images and any data derived them will be deleted at the end of the session. Any images uploaded should not violate any intellectual property rights or Facebook community standards. We need to agree that here we have a large gallery with images. You can select and try this model on these images or you can upload your own image. Let's upload our own image here. For example, let's use some con we got from mid journey. Here we have con. Now the segment, anything will process our image. Here we've got our image. If we point to any element of the image, you can see that the whole element gets selected. For example, if we point to the ****, the whole **** is selected. If we point to the book, then the whole book is selected. Let's try that. I point, then I just need to click. Now the recon is selected and I can cut it out. Here I have the cut out object option. I just need to click this. It will cut out my ****. And as you can see, it just cutted out the portion of the ****. It didn't put the book or anything or any of the background, just the **** here. If I want to use this image, I can copy the image, or I can save the image. Let's save. And here is our image. Okay, what if I want the **** with the book? Let's go back here. I just need to select the **** first. So you can see this blue dot when you select something gets added to the element. And then I also want the book, so I'll click on the book. Now we've selected two elements. Here. Again, I can go to cut out object. Now we have this raccoon with the book. Let's save the image. Now we have the raccoon with the book. If you want to use that image anywhere you have this PNG image. Okay, In a similar way, I can add multiple elements. For example, add the ****. I can add the book. I can add the lamp. I can add the armchair, and so on. Maybe a window or something. If I don't want a specific element in the image. For example, it selected something that I didn't want, then I can click on this remove area and click on the element that I don't want to see. For example, I don't want to see this element here. As you can see, it removed it from the selection, or for example, I don't want to see this now, it removed that element. Okay, Another way you can select elements is by using the box. In the first one, we just hover and click on the element. Here, you can use the box around the element you want to see. For example, if I want a raccoon with the box, I just make the box around the raccoon. As you can see, again, only the **** is selected. If I want the book, then I need to click on it as well. I will need to click on the book. Now we have two items selected. I can cut out the object and it will be in my cut out gallery. Another thing you can do is Use Everything button that basically will scan your image for all the elements. If you want, you can cut out everything. You can cut out all objects. Now you can see that every element is cat out. We have the raccoon, we have the orange chair, we have strange objects and so on. I think segment anything model works great on product photos. For example, if you have a product photo with some background, maybe a black background, and you want to just have image of your product without any background, then you can use this tool that will be very handy. For example, let's try that. Here I have an image of sneakers that I've generated with mid journey. Here we have the sneakers. Let's say I want to just have the image of the sneakers again. I need to select one here. Let's select it. We've selected one, then let's select the other one. If you made a mistake for some reason, you can always undo or reset, or you can redo. For example, here we have selected our two sneakers. Let's cut out the object. Here we have it. Let's save this image. Now we have a very nice cut out image of our product. This is the tool that quickly allows you to cut out any object or element from your image before. What I was doing is, for example, this is the image I'm using. Preview. Preview. Here I have this magic wand and I was selecting it and trying to remove. However, with this, as you can see the selection, it just used the similar color. It's pretty difficult here to cut out it nicely. If you compare this to what meta cut out here, it's just in one click, you have the full object. Very easy. And a great improvement over previous tools. Okay, so that was segment anything.com 55. Basic Photo Editing Tools - Creatorkit.com: Now let's go to Create Kit.com The Created Kit is mostly for E Commerce and it's used to help you place your product in a nice background. Let's try it out. Okay. First you'll need to sign up or sign in. You can sign up with Shopify. I already have an account, I'll sign in. Okay, once you sign in, you'll have this button here created with AI. And here you can upload your product photo or you can try with one of these images. I think it's fun to try it out with your own product photo. Let's use the sneakers image that I've generated with Mid Journey, and then I made a cut out with the segment. Anything. And if you look here, it says that using transparent PNGs gives you a better result. Because otherwise they'll need to automatically remove the background. That may not be as good as the segment anything model. Let's use the PNG image that we have here. Okay, here you can make the image a little bit bigger or smaller. Sometimes that doesn't work and you may need to reload the website. That just helps you to choose a composition for your product. Okay, let's continue here in the settings. First, you'll choose your product category. What is your product? This is a footwear. This will help I place your product in a correct manner. Okay, Then you can choose styles. There's a bunch of different styles. Flowers, wooden floor, and so on. And also for example, here you can also see what it can be with the shoes. Here they have different examples of different products. Jackets and so on. Sofas, yeah, cool stuff. You can choose the style or you can choose to write your own prompt for the background. For example, here you have a chair inside and minimalistic studio with plants. Now we can write, you put sneakers. Sneakers in a beautiful mountain with snow can be something like that. The negative prompt is we don't want reflections. Maybe we can try that here. We can also select the model for created kit diffusion model, version 0.8 This is the model specific to the created kit. And here's also an option for the 1.0 but you'll have to have the enterprise plan to be able to use that maybe later. Let's try with the 0.8 version. Okay, this is pretty cool. I like the first one. If we go back here, I think this one is a little bit strange. The shoe was hanging in the air. But the other ones, yeah, it's all right. I think the first one is the best we can say image and check it out here. The background is quite nice. The only downfall here is that the image resolution is way worse than the original image that I gave. This is the image that I gave the app. If we zoom in here, you can see that our product is now pixilated. Okay, let's try something else. Let's go back and maybe this time choose something from the styles. I think I like the paint one. Let's see what other things they have. They have the forest, they have tulip field. The grading back drop also looks fun. Yeah, let's stick with the paint again. Let's select the footwear here, let's generate it. We've got some cool backgrounds here. I think number three is my favorite one here, just because it aligns with the shoes and we have a nice reflection shade here. Everything looks pretty good compared to the other ones where it's floating around. Okay, yeah, we can save that as well. Here we have this nice background. The only problem again, is that the resolution is pretty small here. If you want a higher resolution image, then you would probably use a different tool or different method. For example, creating the background in mid journey. However, what's nice about this tool is that it created those shades. If we go and see our original product, the original image is this one. As you can see, there are no reflections. And here the created it nicely added those reflections and integrated the product into our background. Okay, in this module we've covered basic and simple tools that help to edit your images. For example, we've covered the big GPG, which helps to enlarge your images. The vectorizer that converts your images to vectors then segments, anything that cuts out your images, and the created kit that helps to create a background for your product. 56. ClipDrop Introduction: Hello? Hello. In this module we will talk about Clip Drop. Clip Drop is a fun platform. It's an ecosystem of apps, plug ins, and resources for creators powered by artificial intelligence. Basically, it's an AI image generator, as well as a place where you can edit those images or any image or photo. It's developed by stability. We've talked about stability and we've talked another platform that they've created, which is Dream Studio compared to Dream Studio. In my opinion, Clip Drop has a way better interface. Clip Drop has a bunch of tools, for example, clean up, remove background, and so on. Let me show you the first tool we have here is the stable diffusion model. And that's the basic text to image generator. If we click on this here, you write your prout and you will generate images. But what's fun about the clip drop is that here you can try out the new models that stability AI develops that may not be released to the public yet. Similar to Dream Studio where you have the access to the newest models, here you can try it out. Okay, the next tool is the crop and that's the basic out painting tool. Then we have the reimagine that allows you to do variations of your image. Then we have the clean up that allows you to remove objects from your image. We can also remove the background, We can change the lighting of the image. It's especially nice, four photographs that we have the image upscale. We can upscale 2x4x in seconds. Then we can also replace the background. In the previous module we talked about create a T e commerce website where you can use your product and then change the background here. You can do exactly the same thing. You can write a prompt to generate background for your product or any other image. Then we have the text remover. If you want to remove text from your image, you can also do that. 57. ClipDrop Tools Overview - Stable Diffusion, Uncrop and Reimagine XL: Let's get started and let's explore those tools in a little bit more details. The first one is the Staple Diffusion. Here, I'm not going to spend too much time because we've already talked a lot about staple diffusion and we've tried the dream studio, and it's basically the same thing here. Maybe we can generate a few prompts and then move on. For example, let's put a fairy inside a purple galaxy bottle, Magical background here. We cannot put the negative prompt, but I believe we can choose the style here. If you click to a No style here, we can choose the style that we want our images in. Again, you can choose origami line art and so on. But for now let's just keep the default and let's click Generate. You can actually subscribe to have faster generations or you can skip. Okay. Not bad. We've got pretty good images. We've got this fairy in a bottle and she's holding another bottle that's pretty cool. With the free medium version, you have this water mark with the clip drop again. Well, this one is nice. You have some butterflies as well. If you want to subscribe to clip drop, the prices are quite affordable. If we go to pricing here they have the free version and the pro version, if you have the monthly subscription. Here you have the unlimited tools that you can use for stable diffusion. You have up to 1,500 images per day, which is a lot. Here again, we've got these beautiful images based on the simple product we've provided. I'm actually surprised because the quality is pretty good as well as the proportions. Again, this is, it should be a human like figure, not bad. Okay, let's move on to other tools. The next tool is crop, which is basically the out painting here. You can try it with their examples, you can check the original one was just this image when you extended the image. Now you have this, you can see with other things, this is the texture landscape, pretty cool. Let's try it out with our images here. I have a fun image, we can use memes here. I think it's going to be fun for out painting. I have this meme here now. I can choose which way I want to extend the image. I can extend it as a landscape. This is the custom one. I can write my own dimensions here. Or you can choose the landscape or portrait, or square. Let's choose the portrait. And we can move these as well. For some reason I cannot move the upper one or the bottom one. Okay? But let's try with this next. Now you can see that it generated pretty fast and it extended our image. Now we have four different versions and we can see which one is the better one. This one is pretty nice. She's in the dress, although what does she have here? The upper one is not quite bright. Okay, here again, we've got the space here. Okay, let's generate again and see if there's anything better. Okay, we're still getting something strange on the top here. The bottom half looks nice, but the top is just horrendous. Okay, I think the best image here is the second one, the dress. It's pretty good how it extended to the legs. Maybe this leg is too skinny, but overall, I think it did a pretty good job. Okay, let's try with something else. Maybe let's use another image. For example, let's use baby here. Let's again do the portrait. It's definitely has some weird things going on on top of the image. I'm not sure why the other images we have, this one is pretty good. It extended the C and now we have this nice Sky. The boy is wearing a cool shirt here, also changed the shirt and so on. Okay, so these are the things that you can do with the out painting in clip drop or it's called crop. Okay, Now let's move on to Reimagine here. You can again try with examples below. For example, let's try with the bed bedroom. Here we've got some beautiful interior design. This is the original image. Now we got some variations here. All of them. I like the colors here. This is interesting. The lamp. Okay, this was with the example image. Let's try out our own image here. I've generated an with mid journey. It's an image of the office. Let's see what variations we can make here. Now we've got our images here. This is the original image. These are the variations. This one is overall quite good. This one has a few artifacts on the floor. I think something wrong with the lamp here and the computers here. The original image that I've appl image generated by mid journey. I think this image is superior over the variations. The variations were done with stable diffusion. It's a journey. Still is a little bit better because here you have less artifacts compared to other images here. But overall the style and design is pretty spot on. No complaints here, especially that you can try this for free. Okay, let's go back. This is the re, imagine you can create multiple variations from a single image. 58. ClipDrop Tools Overview - Cleanup & Remove Background: Now let's move on to our next tool, the clean up. I think this one is pretty good. Here we can again try with some examples. Let's use this image here. Let's say we want to remove a pen here, we can just highlight the object that we want to remove. Then we can just put the clean. Now, it's really cool for brush size, we can make it smaller or larger depending on your image and what you want to do. Yeah, let's now upload your own image here. I want to use the image that we've generated with mid journey. As you can see, it has a lot of elements here. I'm just curious how it's going to remove things. Although the quality is not the best, I'm not sure if it's going to do anything. If you make a mistake, you can always undo it. If you want to move the image, then you can press the smooth button and move your image. I don't want to see the sculpture here or I don't want to see this person here, which will be even more intriguing. Remove that person from here. Okay, let's clean it. Okay, that's interesting. It just left this white space here. Let's try with a bit more realistic image. For example, let's use our me here. Let's say we want to remove the so here, let's do that. Let's increase our brush size a little bit here. Now I can remove the lady here. Okay, here, it's not perfect. That's probably because our image was too big. It covered a lot of the background, didn't quite capture the background here. Let's try it again. Let's undo it. Try to remove all this area, maybe that will help. Let's clean again. Here, we just have smudged area and that's probably because this image was too large. Okay, let's try a different image, So maybe we have a smaller things that we remove. Maybe that will work better. Okay, let's go back. Let's choose a different image. Here I have a photo of the beach with people. So here we have a few people here. And see if we can remove some of the people here. Let's maybe this girl here first, kill it clean. Okay, actually, not bad. The only problem here, we've got the reflection now. Let's also try to remove the reflection here. Okay, this is pretty good. Like nothing was there in the first place. Let's try a few others and see if we can remove all of the people here. Very good. Let's try a few more. Let's see if we can remove two at once. Okay, that worked. Let's remove everyone else from here. And boy, oh wow, we removed all the people here. And it's undistinguishable that there was anyone in the first place here. That's pretty cool. Now, let's remove those as well. I'll move my image then again, select also this boat. Let's remove that one as well. Let's clean. And maybe not sure what this is, let's clean this. Now here with this clean up tool, we've made a beach full of people to a secluded beach with no trace of a person. This was the original image here, we've made it from here to this. That's what the cleanup tool will do. If you have anything that you want to remove from images, then you can use this clean up tool. As long as the item that you want to remove is quite small and doesn't take a lot of the image, then it will work well. However, if it's too big then you might run into this smudge area thing. Okay, let's move on to our next tool, it's the remove background tool. In the previous module, we've talked about the segment, Anything by Meta that allows you to cut out a specific element. Here, it does pretty much very similar thing. It just cuts out your object from the background. Here you can read some more information. They claim that they have the most accurate background removal solution available and you can see an example. So this is the image here. You have the person's hair. As you can see, that hair was kept when the background was removed. However, competitors, usually some of the small hairs would be deleted. Let's see other objects. We have complex objects, we have this thread, Competitors have the left behind. Then some sharp edges. Clip drop removes image backgrounds and keeps the edges of objects extremely sharp here. If I zoom in a little bit here, you can see this sharp curve here. For competitors, we see some of the background left behind. The last one is focused only on the main object. Here, you can see that it can detect the object very well here. For example, here it knows where the stool is versus the shade. However, for competitors, the shades may be interpreted as part of the image. Let's try with some example image. Here we have the photo of a woman with lots of hair. So see how it's going to be removed. Okay, here we can actually move our background left and right here. If we move it completely to the left, we have this image without the background. One concern here, I think it left a little bit of the background here. Maybe it tried to save the small hairs, but now we've got lots of this yellowish colors here. It would need extra editing here. We can download this image. Actually let's download and see. As you can see, there's a lot of this yellow hue here. Let's try with a different image. For example, let's use the image that we've generated with mid journey. We have this glass with galaxy inside of it. Let's see, Overall, I think it removes the background pretty well. You can see that it's sharp and no, it doesn't leave any artifacts. And it's also very fast. We can download that here we have nice sharp edges as they promised. Good. Okay, let's do one more. And here I want to use the sneakers that I've tried with a segment, Anything by. And see how will the clip the same job. Okay, let's download this now. Let's compare to our, the image on the right. We got with segment anything by the image on the left. It's by, as you can see, the results are comparable. If I zoom in pretty much the same results, actually maybe the segment anything here is a little bit better because it didn't include this blue line here. But my conclusion that overall they're quite comparable tools you can use whichever you like the most. 59. ClipDrop Tools Overview - Relight: The next tool we can try is Re light here. We can choose image from example. Let's use this photo for example. Here by default you're getting two different lights. You can add more lights if you want with the new light button for the background. You can keep the original background, you can the background have more lighting received, light that is going to be affected by these lights. Or you can make it transparent. If we just click transparent, then we only work with the subject, with this person here. Then we have ambient. Ambient is basically exposure of the cut out of this person here. We can increase exposure or decrease it. Let's keep it, it wouldn't affect the background. You keep the background. Let's remove the transparent and keep the original here. It only affects this person here. See, we're changing the ambient light without any effect on the background. Okay, let's keep the transparent ambient somewhere in the middle. Then for lights, we can move out the lights depending on what you want to create. Then you can also increase exposure of the light and the distance, how much it can reach. Here's small reach versus large reach. Then we also have the radius. If you decrease the radius, that will also quite similar to distance, would decrease the light exposure. Here you also have the second light. You can try out playing with the second light, the first one, depending on what you try to create. Let's try with a different image. Let's use maybe a portrait. Here we have the portrait of girl with pearl earring, as you can see. Here we have the pre selected color of the light. Here you can actually choose a different color if you want. You can select maybe gray green. You can try different colors. Or if you have a specific color, you can input that color here. For example, I think maybe brownish color would work well here. Maybe like this on the color feels more natural here. Then the second light is blue. And then we can manipulate the slight to make a visual effect on this image. For example, here, be nice. Okay? Or you can delete this light if you want. Then you were just left with light, one source of light. Okay, let's keep the second one. I just want to make it blue to match maybe her scar. Now we can make the exposure a little bit less. Yeah. And if you want, you can add multiple lights again. We have different light here, make exposure smaller up to your image nation. When you're done, you can download the image. Let's download this image. Here we have it. Another thing I want to try is I want to try it with a product image. Here we have the image of the product, I think with the product photo, this is actually a really good tool because you can spend so much money for a professional product photo shoot Here you can make it in one click for the background. I've actually uploaded the PNG, but you actually a background. You can add a background if you want. You can add a specific color, for example, white. You can also make it to receive light. Now you can create this fun effect for the background. Maybe we can make the ambient a little bit lighter. Here we have the light. Let's make the green exposure higher. Let's make it green, Let's make it blue. Well, this is worse. But basically here, you just need to play around with these different settings until you find the perfect combination. So I'll try to do that here after playing around with different lights. Now I've added one more light. Light Number four, the background, I used a lighter color. It wasn't too black because black wouldn't receive any light for ambient. I made it a zero. I don't want too much exposure for the lights. We have some distance here, not too much then the distance is similar for all these lights. It really depends on lighting and where you want it to be, just changing settings based on your product. Okay, after you're satisfied with the image, you can download it. I'm happy with this one. Let's download it and see the final result. This is the original image and here's the image with the lights. I really like this one. I like how it highlighted the night sky on the shoes. Pretty cool. That's the real light tool that you can use in clip drop A. 60. ClipDrop Tools Overview - - Image Upscaler: Let's move on to our next to image scalar. Here you can upload your image or photo and upscale it to 2x4x8 x and 16 x. However, if you want to upscale more than two X, you will need to pay for subscription. As you can see, you can subscribe to an annual or monthly plan. Okay, let's try with some examples here. This is, I assume the two X. This is the original image, as you can see, it's grainy image. Some things are blurry and pixelated When we move to upscaled version here, let's see, on the right hand side is the upscaled version. Here you can see the skin texture very clearly overall, it's very high quality. On the right hand side, the upscaled version, okay, this is the photo realistic image. Let's see some other things, maybe with the objects. Okay, so we've got this blurry, pixelated image with upscale. We got the C sharp edges. Very good. Very good. Okay, let's try with our own images and see how that compares to the tools that we've tried before. Okay, I will upload the same image as I upload it with Ppg.com Okay, let's scale. Let's check it out. This is the original image here. Now the app scaled version. Let's move in a little bit. It's definitely cleared some areas, made some smoother lines and outlines. But I would say that for this example, I liked the result with Big GPG better. Let me show you what we've got with Big GPG that, you know, this is the image we've got with Big.com and that was four X. Even though this is two x, of course the quality will be less than the four x. However, in my opinion, the overall I really like the way the big improved this image here. Okay, let's move on to a different image. Okay, here I will choose the photo that we've used. I've used image here. Let's upscale it. Okay, let's check it out. Here is the original image when Drug. Now it's a two X upscaled version. As you can see, this is better. The upscale worked quite well here. Here on the left hand side, we have this gray image. The upscale smooth the photo out very nicely. Let's see, even with two X, it did a pretty good job here. Okay, let's check what's the difference with the big here. Here on the right hand side, we have the image up scaled by clip drop with two X. On the left hand side we have image upscaled biggs. As you can see, I think the clip job did a better job here in terms of it removed the patchy area. If we look at the hair for example, we still have those patchy areas here. It's way more smoother. We can see that the hairs are more clear and distinct. If we look at the sleeve here, look at how sharp is the edge here? Here? It's a little bit more blurry. Overall, I think with this photo clip drop did a better job. Okay, I wanted to try the 16 X and detailed version, so I bought the subscription. Let's try our photo with the 16 X and see if there are any differences. I'll choose detailed and 16, once you have a pro version, you'll have all your images that you've generated here in the history. You have up to 14 days to save them, okay? Let's check out this image, okay? Here, the detailed up scalar actually made it more blurry. So this is the original, we've got these blur clusters, okay? So that was the detailed, the detailed setting. And 16 x, I've actually generated the 16 x with the smooth, let's check that out as well. This was the smooth up scalar. As you can see, this is way better than the original. It removed the noise and enhanced the image. I wouldn't say the 16 X is too different to the two X because here we still have some areas for improvement in terms of the colors and the skin texture. But it's probably because the original image was just so poor quality that there's not much to work with. Okay, in my opinion, the two X version was very similar to 16 X. Here in this particular photo clip drop may be a very useful tool if you want to upscale and nose you images. Here you have a smooth or detailed version. If you have a subscription, you can try to do four x or even 16 x, depending on what you want to achieve with your image. One thing I want to point out here is that the image up scalar is a bit different to photo restoration if you are looking to restore your old photo, for example, old photo like something from the 20th century, black and white photos and so on with scratches and so on. Then image up scalar may not be the tool that you're looking for. You may need to Google some old photo restoration AI tools right now. There's plenty of them. They will restore your photo. For example, I just found a random one vans AI. As you can see, it allows to restore this old photo and completely removing those scratches and those folds to make a colorful image. Here, those are the tools that restore the photo. This is the tool that scales even though it can remove the noise and enhance your image. It's not a photo restoration tool, it's the upscale tool. 61. ClipDrop Tools Overview - Replace Background: Let's move on to our next tool. It's the replace background tool. That only works if you have the subscription. You need to buy the subscription to be able to use the tool. Here you can try some examples. Here you can see the photographers. Here is the original image. Now we've replaced the background. You can see the cut out of this person. What's noticeable is that the hair, you can see the cut out because the a lot of hair, it's not as smooth. You would need some photo editing to improve the quality. Then we have in the park, we removed the background here in all of these three images. For me, it's apparent that this was a cut out just because of the lighting and the background doesn't match. These things would need improvement. Let's check other examples. Creative agency, here's the car, then we've changed the background here. I think for the object, it's a little bit better. For example, for number three, it's more realistic here. Here at least it added the shades to make the object fit in the background. Let's try some other examples. This is the sneaker to remove the background. I think with the shoe here, it actually did the best job. Selfie, I think would be the same as with the person that we've seen. That yeah, the cut out is quite apparent here. Okay, the conclusion here is that for objects it works way better than for people. For now, let's try with some objects here. In the last model we've talked about creator kit that allowed us to replace background, we're giving an image of our product and then we were choosing a style or we were writing a prompt to generate the background here we can see how the clip drop compares To create a kit. For that reason, I'll be using the same sneakers. He our PNG image. Let me upload it. Okay, here are our sneakers. Let's use some prompt. Maybe modern colors, modern colors, splash ground. Let's click Generate. Okay, this is actually pretty fine. We've got some sparkles here. Here are the four images that we've got with this prompt. I think number four and number two are the best ones here. Let's try a different prompt. Let's use the sneakers again. Here we've got a random background prompt. We can use this button to randomize it and see if we like it. In this case, I want to compare this to Create. I will use the same thing that we've used there. Let's use our sneakers on top of the mountain. Here, I'll put sneakers placed on the Snowy mountain. Let's generate that. Okay. Here we see the mountain background and we can actually see that sneakers are sitting on the Brock or Snow. Let's see other images here. Here we have a good integration of sneakers with the background. Because here, the shades. Yeah, I think this one is very nice. Here again, we've got these shades that make it more natural. I think the best one here is the second one. Let's download it. Okay. On the left hand side we have the background generated with clip drop. On the right hand side we have the background generated with created T. As you can see, both platforms integrated the product, the Seekers pretty well. Into the setting. It added the shades here as well as here. Both platforms did a good job on that. However, the big difference here is the resolution of the image. Here we've got a much higher resolution compared to the created kit. This is the image that we've got. I didn't minimize it or anything as you can see. If we zoom in, let's zoom into this image. Image of the product is very poor quality area here, it's all pixelated. If we zoom here, it's still a pretty good quality if we compare this to the original image that we've supplied. So that was the original image zoom. Okay, Maybe here it's started to get more pixilated. The clip drop has shrinked the product image a little bit. If we zoom in here, it's worse quality. But overall compared to create a kid is way better. So now we have this nice big image that you can use on any platform. For example, if you want to post that on Amazon, that would be a good enough image to post that compared to this small image. In this case, I would use the clip drop because it makes a way better quality images and that's what matters. Okay, let's try with the same tool. Let's try some different format. Here I have the image that we've generated with Mid Journey. Let me show you. I have this girl here. You can see this is a water color image. I want to see if this replace background tool will adjust the background to the style of the image that I'm providing. Okay, let's try that here. We can make it larger or we can rotate it. Let's keep it like that. Okay, the image I've provided, it's not a PG. It has this white background. What it did here, use the remove background tool to remove the background first. If you don't like the way it removed the background, then you can provide the PNG image here. It did it automatically and as you can see, it removed it removed the toys eyes because it was the same color as the background. Let's just try this here. I can use this randomizer. Let's just put here a beautiful landscape, and let's click Generate. Okay, Here you can see that the style was adjusted to the image. We didn't get any photorealistic background. Now it's more watercolor, blurry and so on here. I think the number two is the best here. All of that looks watercolors. Let's see, the first one here, we've got different style. I think this looks more pastoral style. Doesn't match too well with this image, but the number two worked great. Even though in the prompt, we didn't put anything, we didn't add the water color, we didn't add any style. But it just used the style of the image to make a background that matches well with this image. I think that's a very smart tool. Okay, we can download this. As you can see, this replaced background tool will allow you to place your characters into different settings and background. This is the replaced background tool that allows you to generate a new background for your product image or for a character or a specific object. 62. ClipDrop Tools Overview - Text Remover: Let's move on to the next tool. The next tool is our last tool for clip drop is the text remover. Okay, let's see the use cases. Here we have the creative agencies. Let's see. This is the, the original image and you can see that there are those letters. Now we've removed them. Okay, let's try the molk. Here is the product. We don't want to see any text. We remove the text and you can see that there is the light logo. It didn't remove the slight logo, it just removed the text image editing. Again, here we have the slogan, we don't want the slogan. We remove it and we can put any other message here. Okay, great, here on the T shirt. That also removes the text. As you can see in all those four scenarios, it was able to detect the text and remove it. Okay, let's try with our images. Here with me Journey. We've generated some logo images because the text was legible, but we would want to replace it with something else. That's why this tool can be very useful here. As you can see, we had the text here and now it was removed. Now we can download now. We can put our own name off the bakery if we want to. This was the logo with the text. Now it's removed and we can add anything here. Let's try a different one. Let's use a different logo here. We have this text here, which is relatively easy to remove. Let's try another one. Here we have more text. Okay, this is interesting. Let's go to the original image here. This is what we've generated with mid journey. And you can see that this bit wasn't removed, possibly it wasn't recognized as the text. We can try again and see. Maybe if we try it again, then it would be removed. Let's see. Okay. Yeah, it didn't remove it. It didn't recognize that as part of the text. Another interesting thing is that it changed this cupcake base. It made it worse. I don't know if it recognized the text and wanted to remove it, but clearly there were no text here. It would be easier to remove the text ourselves here than rely on this AI because now we have this messed up cupcake base here. I'm not sure why this happened, but as you can see on other images, it worked really well. But this one, for some reason, it didn't give us good results. You can try it with your own image or any other image where you want to remove your text. Another thing that I want to try here is uploading a photo of a product with a lot of text and see how well it would remove the text from it, the same as with this more cup. Let's see here. I found two images on the Internet. Two photos. I wonder how well the text would be removed from this product here. Okay, let's see. Okay, not too good of a job. Let me show you the original image here. This is the original image. Tried to remove the text, however, it created some smudge area. I don't even know what this is. Maybe try to replace it with bubbles or something. But overall here, because it's a bit more difficult, because this label is transparent and we have this gradient in the background, it's more difficult to remove the text from here here, It didn't do a clean job, actually. Maybe the clean up tool would work better here. Let's actually quickly go and check it out. Okay, this is the cleanup tool. Let's upload our image. Okay. And here, let's try to remove this text text here. Okay, let's clean this up. As you can see here, we've got a way better result than the text removing tool. Sometimes the clean up tool would be more appropriate than the text removing one. Maybe this part doesn't do it quite well. Actually, I'm curious what's going to happen with the high quality. Let's go back. We have those text selected. Let's choose the high quality. The high quality mode takes a little more time to reprocess your image for better results. Okay, let's use that. That's only available for people with subscription. I think that was quite similar. I don't see too many difference with the previous result. Okay, so as you can see, the clean up tool actually here in this case was better than the text removing one. Okay, let's go back to our text removing tool here. I want to use the other image. It's also the image that I found on Internet and as you can see it has some text here. Okay, again, we didn't get quite a good job here. Unlike what we've been promised in the use cases with a mocap where it removes very cleanly the text here, when we applaud our own images, you can see that there are some problems still. Then for this case, you can try to use the clean up tool. Okay, now we've discussed all the tools that Clip Draw has. These tools can be very useful and helpful when you do photo editing or image editing. Here they even describe some use cases for these tools. For example, team portraits to automatically harmonize your team photos to create a beautiful and consistent team pages. Car resellers, you can place car in a nice background or real estate. For real estate, you can improve the quality of the image. You can upscale, you can change the lighting, for example, and also e commerce. Because there are a lot of tools that allow to make your product photo better, Remove background clean up, and all those tools will help to make your image better for me. My favorite tools in Clip Drop are light and the clean up. I think the is quite unique to Clip Drop because I couldn't find anything similar to this in any other platform. And what's also really nice with Clip Drop that there is a premium version that allows you to try it out and if you like then you can subscribe and pay membership. Okay, so that's it for the clip drop. In the next module we'll move on to Adobe Firefly. 63. Adobe Firefly Introduction: Hello everyone. In this module, we will go on exploring Adobe Firefly. Adobe Firefly is AI image generating and editing app. It was developed by Adobe and it launched quite recently in March 2023 and became available to everyone only in June 2023. Firefly has a text to image generator and it uses its own model. It's different to the stable diffusion. It's not based on stable diffusion, it's their own model and it was trained on a dataset of Adobe stock. We can read more information from their own website. Here they have the article, how Adobe Firefly is different to stable diffusion. So if we go here, how Adobe Firefly differs from stable diffusion. And here it says that Adobe Firefly is a family of creative generative AI models currently in beta plan to appear in Adobe Creative Cloud products including Adobe Express, Photoshop and Illustrator. It was trained on a database of expired copy right, and openly licensed images. Firefly takes text descriptions and translates them into AI creations. From amazing images to unique text effects. With more models planned. Adobe Firefly basically uses their own proprietary model, which is different to stable diffusion. Here they don't go into details for the differences between stable diffusion and Adobe Firefly, but we have this general understanding of their models and how they're trained. If we go a little bit down here, what tent does Adobe Firefly train on? And here it says Adobe Firefly trains on Adobe stock imagery in accordance with the stock contributor license agreement. Openly license content in public domain content with an expired copyright. This means anything you create in Firefly will not infringe on the copyright of artists. It also opens the possibility that everything Firefly generates will eventually be suitable for commercial use. So I think this can be very important for some people and why someone may choose using Firefly over other models, for example. Okay, what tools do they have? Well, first of all, they have their own text to image generator with their own model. Let's go and check that out. This is the text to image tool. Then we have generative fill, which is basically in painting. If you go here, generative fill, you can use a brush to remove objects or paint in new ones. From text descriptions, remove objects. In the previous module, we covered the clip drop. They had the tool called clean up. That's basically the same thing. You can remove objects here. Also, apart from just cleaning up the image or certain elements, you can paint in new ones. From text descriptions, that's basically in painting. You remove certain part, you write your prompt and it will generate something else on that spot. Then we have the text effects. This is a unique tool that Adobe Firefly provides. You can create custom cool text effects with that like feathers, bread balloon texture. We have the scales and so on. Then we have generative recolor. It will generate color variations of your vector artwork from a detailed text description. Here you need to provide a vector artwork like SVG. It will help you create color variations from your image. It will recolor, make a different color palette. Here you have this yellow and now it repainted orange. Then we have tools that are not publicly available yet. They are in exploration, as it says here. You cannot use it yet. I will not be covering these tools, But let's just check them out and see what does Firefly plan for the future. This is the three D two image. It creates a three D scene and uses a text prompt to generate an image. Here you provide a three D model. From that three D model, you write a prompt and can generate images with that, which is pretty cool. Then we have extend image. That's basically out painting. You provide an image and then the tool allows you to extend the boundaries. Then if we scroll down here, there's a bunch of other tools. We have the personalized results. It generates images based on your own object or style. I think this is very useful, especially if you're working with specific style and you want to create in that same style. Right now, it's a bit challenging to do that. And I believe that Adobe Firefly plans to make that available and allow you to do that quickly. Then we have this text to vector. It generates editable vectors from a detailed text description. I think this is very innovative because what right now we have is text to image, we get the Pec image. If we want to make a vector out of that, we have to convert that Pec image and make SVG or other vector format using other tools like we tried using Vectorize to convert our image to vector. If we're making logo, how wonderful it would be to just use this tool, write our prompt for a logo, and create a vector right away that will save a lot of time and especially that you can edit it right away. That also will be very helpful. Then we have this text to pattern that allows to generate seamlessly tiling patterns for a detailed text description. Then we have text to brush. Here we have new models that allow us to create something else from the prompt, not only image the Pac image, but here we can create vector. We can create patent or even brush that can be used in Photoshop to add to our own designs or artwork. I think that's pretty cool. Then we have the sketch to image, you provide your sketch, and I will make an actual image out of that. There are actually already platforms that allow to do that. This is nothing new, but it would be fun to see it on Adobe Firefly. And then we have text to template. It generates editable template from a detailed text description. Okay, That will be very helpful when you do a website design. You can write your prompt and immediately be able to change the text and manipulate this template the way you want, which is amazing. Okay, so here are some future tools that hopefully will be available to us. Okay, but right now we have these four tools that are also very interesting. We have the text to image, generative fail text effects and generative pre color. That will go on and try these tools. 64. Adobe Firefly Text to Image - Portrait & Logo: Let's begin by actually checking out what kind of images we can generate with firefly. And for that we can go to gallery and see some of the successful images that were generated by the community. Okay, so here we have some text effects. Some three D renders, water color paintings. We have some faces here. This one looks very realistic here. Okay, I, overall, I can see that the images are pretty good. Another thing that I'm noticing is that the prompt is very short in order, for example a key origami, rose, flower holographic, close up digital in order to get a good image. It doesn't require you to have a long prompt. Unlike stable diffusion, let's say let's try to generate some images ourselves here. If we go back and click on this text image generator, let's generate images here. Basically, there are no settings here. We can paste our prompt here and just click Generate. Let's do that. I'll be using the same prompt as usually. Okay, we have this professional portrait photograph of young British woman and all those stylizI'm. Sure if we need those styles here, but let's keep them. Let's see, Click Generate. As you can see, it automatically placed it in the art category, the Quantum type. It placed it in art. Let's check those images. Okay, the images are not grad. Actually, if we go to the fireflies gallery, I didn't find a lot of portraits probably. That's why maybe portrait is a limitation, as we can see here, compared to mid journey or other stable diffusion models. That's definitely a worse quality here. Okay, can we make it better? Let's try it out in terms of quantum type instead of art. I'll choose photo here. You can choose different styles here. There's quite a few synth wave science fiction. I've seen that the pop art image was pretty good, but it's not going to give us the photo realistic style that we're looking for. Okay, I'll just remove some of those, stylist, keep it very simple. Blurry, rainy city background. Let's generate. I'm not going to choose science fiction here. You can choose the style. You can also choose color and tone, lighting and composition. For color and tone, we have those different colors. We have black and white, warm tone, cool tone, pastel color, and so on. For lighting, we have some options from back lighting, dramatic lighting, low lighting, studio lighting here, composition, blurry background, close up, white angle, and so on. Here we can choose blurry background. We can choose close up. So here you can only choose one category, because I already say blurry background. I don't see why should we add that again. Let's choose close up. Let's try this. Let's, let's see. This image is not too bad. Well, actually it has very natural expression here and facial features. I don't see any artifacts. However, the other ones here, we definitely, especially like hairs here, there is a problem here. And the other one, I think here, her face is a bit distorted just we have this cut here overall, the only good image was this one. I can download it when I open it, this is my image. It has this Adobe firefly, just so you know. Let's try with other prompts here. Let's go back this time. I want to use Logo here. From the images I saw from the gallery. I think logo would be very successful here. We have really great images of illustrations and geometric, geometric images here. I think logo would be really good here. Let's try it out. I have the line logo of Cup Ce with a tear and top clean line, simple shape, minimalist vector. Let's generate again, it shows the art style we want the graphic, I'll change it to graphic now. Here I want to change the style. Here we have some effects like iridescent, dark isometric materials, clay, origami. We even have line, you can treat the art cartoon vector. Look, here we have a line drawing. I'll choose the line drawing here for lighting and composition. I'm not going to change that, but I will change the color in tone. Here, I want pastel colors. Okay. Now let's click Generate. Okay. Here, if you noticed that the images, these images are very similar to our previous images. If I go back and forward, there may be some new elements, but overall they are almost the same images, y here, if we go back, I've changed here the style, I've changed the color and tone. And that's it. If you change the style, color and tone, lighting or composition, or even add something to your prompt. Let's add something like color. See how the button changed from Refresh to January. If you go back, it was refresh. If I modify, my prompt will add style, color, lighting, or composition. If I add color, then see how it changed the button when the button is generated, then we will only make variation of these images. It's going to be very similar image, but some elements of it can change color, light, whatever you want to change will change on that image, but the overall decomposition will stay the same. That's when it's generate. However, go back here. If you click refresh, then it's going to give you a totally new badge of images. Let's click on that. Now we've got a completely new batch of images. Now if we change the style, let's put our line drawing, then our color Pastor color. Let's click Generate. We'll have similar images, just the variation of them. This was the previous badge, these are the variations of that. If you don't like these images, then you need to click refresh, and try new images. However, let's say you like a specific image, but you don't like lighting or you don't like the color choice, then you just need to change the color to whatever you want, like warm tone. Now it's going to give you the variation of that image, which is quite similar to using seed in stable diffusion. Let's actually refresh and I'll change that to pastel colors. Okay, This one is not pad actually change maybe the lighting a little bit. Golden hour, let's generate, it's too yellowish. Okay. Then let's remove the golden hour, keep the pastel color. Let's generate again. Okay, this is a little bit better. Let's try our other logo. We have a tree inside of a water droplet. I'll remove the line drawing and I'll paste my prompt here. And I will remove some details because it might confuse I, contemporary style. I'll just keep it simple tree inside a water drop blood. Let's keep it with spectrographic, one color white background. Okay, let's generate. Okay, this is nice but it has too many details. I want it to be very simple. Let's maybe add simplistic here. In styles, there's also a style called minimalism, and also we can try geometric because that might make the shapes more simple. Okay, let's generate that. Okay, It is now getting a bit more simplistic. I'll play around with a little bit more and then show you the final results after trying a lot of different styles. With this prompt, I found that the simple wire frame, as well as combining that with the golden hour lighting, gave me a really interesting results. I like the color scheme of these images. I've got like this gold color that the golden hour adds to the image and the wire frame gives this intricate details of the image. If you're making illustration, then maybe wireframe would be nice to use. However, the best style for this particular prompt, in my opinion, was not the wire frame but the geometric. When I choose geometric, then I get more simple logos. Let's check this out. Once I put the geometric, now we're getting those very simple shapes that I was looking for. Okay, again, we have this golden hour. We don't have to have it here, but let's just create a few more images for you to see. I think number one is very good and with a little editing, we can make a nice logo out of this. Let's save this. The other images are also quite simple. I like that all of them have simple background. It just takes some patience to try all those different styles until you get the image that you want. In my opinion, the firefly did a way better job with this prompt than even compared to mid journey. Because here, the model that firefly uses, it was trained on more illustrative images like these ones. Like logos, illustrations, geometric images. That's why creating specific images like logos may be easier with this firefly model than compared to other models. Because here you have all the resources to choose what kind of illustrations you want, what kind of logos you want, and you have all those different themes that help you get the art that you want. Although I think the limitation here is that yet you cannot use a specific image for reference in the future. As we've talked about all those new tools that Firefight may implement, then you can train your own style images and generate in the style that you want. 65. Adobe Firefly Text to Image - Illustration, Anime, Landscape and Concept Art: Now let's move on to our next prompt. Three D render, we have our three render of raccoon. Let's see. What I'm noticing is that these images have a lot of texture. Look at the ***** fur and then we have the armchair folds the carpet of those small details. However, I think there's some artifacts with the Pow overall. It's not bad, but it still needs some improvement. Okay, that was the art, let's check out some popular styles. Digital art might work here. And then for color and tone, let's choose warm tone lighting. Let's low lighting. I think it was going to go very well with this night lamp and composition. For now, I will not choose composition here, low lighting and let's generate. I would say that firefly is still not at the level of creating complicated three D renders of animals. Well, some of the images that we saw in the gallery were great, but that's not happening here with the raccoon. Okay, in my opinion, mid journey was way better. Okay, let's move on to the next prompt. It's the illustration. Hopefully here it's going to perform very well. We have the children's book illustration here. I will change the style, make it a graphic style, and let's remove the low lighting and warm tone. Okay, I don't think it captured our artist here because this seemed to be a very different style to the one I've indicated here. Let's actually remove them and just see with our simple product, I see some errors here. Some problems with faces. Okay, let's use graphic and now let's try, maybe not digital art, but something else. Let's do a cartoon. Okay, for color, in tone. Maybe choose vibrant colors. Okay, let's generate. I like the color choice here. The brand colors work really well here. Again, we have problems with facial features. Let's actually, instead of cartoon, let's try the three D art. I'm curious what this will bring. Okay, let's refresh because we're again working with the same images. Okay, I think this is a bit better. Again, something wrong with the nose here when it's definitely has poor rendering of facial features. Another thing I want to try, let's delete the three D or here. There's some materials. There's layered paper, clay, and origami. The images I saw from the gallery, the origami was really nice and layered paper. Let's try layered paper again. We have the vibrant colors and I'll have to click it twice, so we have a different set of images. Let's generate and then click again. Let's click again. This layered paper creates a very interesting style. I actually like this style, but I would have to improve the facial features here. Now let's move on to Anime and see Firefly can produce good images with that style. Let's try that. And here we have graphic for as creation. We can change maybe to portrait. Okay, let's see, we have layered paper chosen. The lad paper reflects the style. Now, instead of layered paper, these images are pretty good actually. Let's change it to cyber punk. Let's remove the laid paper. You can actually combine the styles if you want. For example, if you want to choose more than one style, you can. You can choose laid paper, for example, yarn, metal, and so on. It's going to give you a combination of those different styles, which is pretty cool. Okay. I don't want that laid paper. Let's keep the cyber punk and vibrant colors. Okay, so actually I'll save this image. I think this one is pretty good. Okay, let's generate overall the images I'm getting for this prompt are okay, nothing too spectacular. I would still suggest, if you want to create anime images to use mid journey because it has that specific style. Or you can use the fine tubed stable diffusion model for anime. That would work very well too. But here it's definitely needs some work. Okay, let's move on landscape. Here I have digital art of magnificent medieval castle style. Let's make it fantasy. I'll remove the cyber punk, okay. Maybe not vibrant colors. Let's make the composition, let's make it wide angle for colors, just keep the default color. Okay, let's generate that. Okay, for aspect ratio, let's do wide screen, okay for landscape. We've actually got some stunning images here. Here is a very nice illustration of the castle. The other one here, we have cropped image the same as here, but these two look really good. I like the lighting and just this illustration style that you can use for a fantasy book for landscapes firefly did actually a really good job here. Now let's move on to our final prompt, that concept art. We'll challenge Fire Flight to make some artwork. Here I have the meaning of life, breathtaking, standing, high resolution, highly detailed, inspirational. I'll just keep it as is the only thing. I will remove all those styles for now. I'll remove Fantasy white angle. I've cleared all the styles, now here I want to try using psychedelic style. And here I want the default color, default lighting in default composition for the aspect ratio. Let's choose the default on E the square. Okay? Okay, very interesting. I like the style and the color. So here we have blue and pink and Rg between them, like Yin and Yen. Okay, the interesting fact about these images that all of them have some a circle or a sphere, maybe like a circle of light here, tree the moon and so on. Some of the details are not sharp, it's hard to know what this is. It's probably an artifact, but just the overall composition is pretty cool. Yeah, especially the last one. I don't know what this says, but maybe that's like a tree that makes up a human figure, I'm not sure, but looks cool. Definitely the psychedelic style is interesting. Let's actually try a few more or we can even combine psychedelic with science fiction and let's see what will happen. I like these images way more than the first batch here. Especially like here, it looks like maybe a mirror with the road somewhere. Then the moons. Some surreal art. Yeah, definitely surreal. Maybe some ally here with the eyes and this figure. Here we go. We've tried different prompts. We've tested out the firefly model and we found out that firefly is not best with portraits or facial features. When we did the, the photograph, the realistic photograph, the features were distorted. And then when we tried with the girl riding a bike, that also had some problems if you're using some characters firefly right now as is may not be the best tool, but for things like logos and illustrations, it might be the tool for you see for yourself and try it out. 66. Generative Fill - Logo Editing: Let's move on to our next tool, generative Fill. Here, it can remove objects and paint in new ones from tech description. Let's try it out. Okay, we can try some examples where we can upload our image. Let's upload our image first. Let's actually start with the logo. Here on the left panel, we have Insert and we have removed. Depending if you want to clear the area and insert something different. With your prompt, then you would use the insert. If you just want to remove a specific element or text, then let's choose Remove. Okay, here, I want to remove this text here. Now I can clear this up and then just click remove. Okay. Here on the first image it kept created a different text, I'm not sure why, but on the other ones, it nicely removed the text. On the second and third, the same thing here. On the fourth one, it added some text. Maybe it thought that it should have the text here. Let's keep this one. Another thing is if you want to insert a different text, then we can try to add it here. Let's say now again in this area, I want some text. What I'll do is, again, choose the area where I want the text to be. Here you can describe the image that you want to create. You can only use English for N L. Here I want the text. I'll put text. Okay, let's generate. As you can see it tried to put some of the letters but not quite. In all of these, we see that there are certain problems. The thing here, let's cancel this. We have this settings, we can adjust settings to help us with this prompt. The first one is mat shape, and that's basically matches the image to your selections shape. Basically, this is my selection shape. If I want the image to be exactly as my selection shape, then I would conform. If I'm more flexible where the image would be, then I can put the free form. In most cases, the free form would be more useful because with this tool, it's harder to make very clear borders. The free form would work better here. The next setting here is preserve content. That allows you to choose how much of original content will be kept in the generated image. If you want to keep some of the original image in the selected area, then you can drag this slide bar to the left. If you don't want to see any of the original image, then you should have this slide bar to the right. It will have only the new content. I think an important one here as well is guidance strength that determines how closely the generated content keeps to the prompt. If you want the generated image to be more closely to the original image, then you can keep this slide bar to the left. However, if whatever you write in your prompt, you want the image to reflect more of your prompt, then you should move the slide bar to the right. Usually it's nice. I mean, it really depends on what you're doing, but I like to have it in the middle or more to the pro to the right here. Okay. In this case we don't want any of the original image. We want it to be as close to the prompt as possible. I will move that to the right here. That's the maximum here. Now let's generate and see if there will be any differences. The images we've got here. I think these are a bit better. It's more closely to our text bakery. However, if we look at a few others, still it makes mistakes. I would say that this model is still not advanced in determining and generating text images. Okay, for now, let's not use it, let's cancel it and just clear. Okay. Now let's try something different. We have our logo that we've generated with Firefly here. We can improve it a little bit here. I don't like those lines in the tree. I want to make it more simple. I will erase it. I'll raise the part I've raised, the area that I don't like here. I will write logo geometric, simple shape tree. Okay, again, let's change some settings here. We don't want to have an original content, so I'll have a new content. And then the guidance strength, I want to have it aligned with the prompt. Let's make it maybe somewhere in the middle and see if it will work. If it doesn't, then we will make it even more towards the prompt. Okay, let's generate that. Okay, added some colors here. Let's try a few more images. Well, this one is interesting. Let's see other ones. This looks also pretty good, but it has a few lines right now, I will actually make it a little bit more clean because we have a few things left. I will remove these things to give it more space for creativity. I'll remove these ones. I'll just keep the branches and remove all of the top of the tree. Okay, here I will also add simple color or two color. We have a basic now let's move the guidance towards the probed here. Let's generate. Okay, let's see others. This one is pretty simple. This looks geometric. Actually, This one is not that bad. I think we have the winner here. I like this one the most. Maybe I'm not the fan of the colors, but the shapes look really good here. Let's check the other ones. This is not bad as well. Okay. But my favorite one is this one. Let's keep it, then I will show you how we can change the color scheme of this logo. Okay. A cool thing here is that you can also replace the background. It's pretty simple. You just click on this background button and it's going to remove the background automatically. It will cut out your object here. Now we can create a different background. For example, back white plan, one color background here. We want it more closely to the prompt. We want it to be aligned with the prompt. Let's move it to the maximum here, let's generate. Okay, we did get plain one color background. It's white, but that's okay. As long as it's plain, it would be easy to remove. Now we have some yellowish, grayish and dark blue. I think this one looks the best here. So I will keep this one. Okay. Now you can download your image and as you can see here, it's going to have this water mark as well. Okay, now what we can do is actually I have another logo that I want to edit and that was from Mid Journey. I think you remember the logo. We have some problems here. I want to see if we can improve them with Firefly. First of all, I don't like the tree trunk tree roots here. I will remove them. I think he remove choice is better. So I'll just select those things. I want them removed here. Let's click Remove. And we've got four different selections how we want it to be removed. I think the best one is number two, but let's see if there are any other options. Yeah, I think this is the best one. Now, let's work on, let's keep this, let's work on this area here. Here, I think I will insert. I will delete all of these things here. I made a mistake here. If you make a mistake, you can always click on the subtract and choose the area that you do not want to remove. I do want to remove this area now with this tool, I corrected my mistake here. Let's go back to the add and select more of this area. Now let's write our prompt. We'll just keep it simple tree logo. Let's see the settings. We don't want any content, we want the guidance strength to be yeah, somewhat close to the original image though it repeats the pattern of the original image. Let's try that. Okay, we've got some extension of the tree. I don't think it matches quite well but it's really try to stick to the same style. I think it adds a lot of details. I'll just use the remove button. I'll cancel, cancel, and I will move to remove here. I will remove, as you can see it now, whatever it proposes, it gives me better results than the Insert button. Okay, I think this one is quite simple here. Yeah, I like this 11 problem is the things, but it's easy to remove. I'll keep it. I will remove this area. Let's remove it here. It nicely cleanly removed it. I'll keep this image now. As you can see, it's getting better here. I think I want to change the shape. Maybe here I will try to insert tree logo. We've got this bird here that actually looks quite fun. Let's actually keep the bird. I'll keep it here, but I'll this part here. I'll choose remove, quick remove. Let's keep this one. Okay, now we've got our dream. Let's work a little bit on this reflection here. Now we have nice outline here. Okay, the last thing to do here is let's keep this. I want the background to be different. I'll again click on the background. If you remove the background, it's just give you some random background ideas. Let's just see what it comes up with. As you can see, it generated quite random backgrounds, but I don't want that. Instead of remove, I'll go to Insert here. I want to specify the background that I want. I want one color plain background. Let's generate that. Okay, didn't quite understood me. Let's go back and one color plain background here in the settings. Let's move it to the prompt. One color orange, plain background. Okay? It actually extended our logo a little bit here. I think this is the best one. Okay, let's keep that in the same firefly. I'll show you how we can change the color scheme. 67. Generative Fill - Portrait & Product Photo: Okay, now let's go back and work on some other images. Here are some sample images, and let's try to work with this lady here again. You can remove the background. Another thing you can do is you can invert, you can delete the subject or the image and keep the background. Let's say if you want a different person, you can put maybe a guy in a blue jacket. And then let's generate that. We've changed the person in this image. I don't think it did a good job here because we have all these artifacts that it tried to incorporate the outline from the previous image. It added some strange wires and the background effect. Not sure what this is. Okay. I don't want to keep that. Let's let's go back here. Not only you can change the subject, the background, but you can also modify some elements. For example, I don't like this orange jacket. I can go ahead and change that. Instead of the orange jacket, I want a stylish yellow jacket for the settings. Let's put a little bit towards the prompt. Let's generate. Okay, this is quite nice. That looks quite natural here. Let's see other images. Yeah, not bad at all. Has even some logo and hot cuts. I like the first one more. Okay, let's keep that. Now what I want to do is I want to change the background instead of this cafe background. Let's remove it and put her in a completely different setting on a mountain top background. I want it with sunset view. You can see that the background, it doesn't quite match with the person here. The big issue is the lighting. The lighting does feel wrong here. If you were to post this image, I would say this is clearly a Photoshopped image. The lighting has a big effect. However, if we cancel and try a different background, maybe let's, let's put rainy city, street background. I want it to be blurry. Let's put blurry, blurry. Again, with these backgrounds, it doesn't feel natural. Let's try a different image now instead of the person. Let's try our product. Let's go back and let's plod the image of our sneakers and see how here we can replace the background. I've uploaded the PNG image, it doesn't have the background, but now we can just insert the background. Let's click the background, and let's put sneakers placed on the top of the snowy mountain. And let's generate. As you can see here, it was nicely placed on the grass or the rock with snow. I think number three is the best here. Let's try a few others. Let's cancel and instead of the Snowy Mountain, let's put some fun background, maybe color splash, studio background. Okay, actually here we've got nice shades. It integrated the product with the background or this one is cool. I like the splash. Let's try a few more and see if it comes up with more interesting results. This one is very contemporary splash here. I like this one. Let's keep this one. I will say that here again, we see this water mark, but the quality is pretty good. It has a high resolution compared to even the creator kit. As you can see, this generative fill is a very useful tool. You can use it for logos, for portraits, remove, replace certain object elements that you'd like or you want to improve. Or you can replace backgrounds for products that would work here very well as well. There's so many things that you can do with general. Clearly, it's free. Why not test it out? 68. Text Effects: Now let's move to our next to text effects. This is the tool unique to Firefly. I didn't see anything like this on other platforms. It generates really cool typography in the gallery. Here are some works generated with this tool. Here we have fur wires, popcorn that we have, moss, gold dripping paves can make those cool effects. Okay, let's try it out here. You can enter the text, what you want to generate. Let's say if you want a letter H, then you can put H. Or if you want to generate, I don't know, high, then you can put high here. Then describe the effect you want to generate, for example, butterflies and flowers. Let's generate. We bought these two letters and I. Here are four different examples. You can choose between different variants here and select the one that you like the most. I think this one is very nice here. Then in the right window, you can choose the font that you like. Right now it uses a Man Pro, but there are other fonts that you can choose from. I don't see anywhere to applaud your own font, but here you can choose between these fonts. And if you're using Chinese characters, here are here. Okay. Now you can also choose the background color. Right now there is no background color. If we download this, let's see, it's going to be a PNG image, it wouldn't have the background, but let's say you want the white background, then you can put the white background. The text color. In my experience, it doesn't do quite much. If you choose, let's say the green color, it would change it a little bit, but here I don't see much of the color. Anyway, it doesn't influence too much because the texture and colors comes mostly from your prot we can keep it default just for curiosity, choose maybe a Chinese character. Now let's use a Chinese character. As you can see, it knows the Chinese characters and it filled with our prompt here, which is really nice, Like the butterflies here. Okay, let's try something different now. Let's try a few more text effects. Let's try, maybe underwater. Let's click January. This is quite fun. So we can see some fishes here. Corals that looks interesting. For example, if you are making illustration for a book and you want the first letter to be a drop cap, let me show you what I mean. Sometimes the books would have this illuminated letter that is a drop cap that has illustrations and ornaments. Then you can use Adobe Firefly to make these letters. As you can see, it makes really cool effects here. Okay, let's try a few more. If you don't write your own text, by default it's going to be firefly now instead of underwater. Another cool one is the jungle. Here you can choose from sample prompt. And the sample prompt actually has that one. I really like that one, the jungle ne. Here we have some jungle Ne inverts. Okay. Another one I want to try with more magical one. If you're maybe creating a fantasy book, then maybe we can use that one. Let's try fairies in a magical forest. Okay, we've got very interesting illustrations here. We've got some mushrooms, a fantasy forest. I don't quite see any, but the design looks very interesting here for the last prompt. Let's try. Here we have the box. I would say that more abstract textures like maybe flowers, leaves, what else is here? Lava wires would work better with the text, because right now with the box, you actually can see a lot of artifacts that don't look too good. That is the tool that allows you to do text effects. 69. Generative Recolor: The next tool is generative recolor. Here you would need to have an SVG image. You cannot upload P or PNG, it has to be SVG. But for that you can always use the vectorizivelready covered that. I'm not going to spend more time here, but what I did, I've changed some of the logos that we've created with Firefly and in Journey, I use the vectorizi to convert them to SVG vectors. Right now I can upload the SVG files here I, let's say this one was the first one I've converted to SVG. Now I can describe the color palette that I want. This is the logo of a tree in a water droplet. Let's change it to blue. I want it to be in blue and green color palette. Let's generate here. It gave me some ideas here. On the right hand side, we've also have sample prompts. Let's see some of ideas here. Dark blue, mid color palette, yellow submarine. Then we have Terracotta desert. Then we can choose the color scheme in harmony. There is default complementary colors, analogous colors, triad split, complementary, and square. For mine, I want colors to be very similar to each other. I will choose analogous here. I can actually choose what colors, blue, purple, and light purple. Okay, Now the image starts to look more on what I want here. I think the number two looks really as. Actually I'm going to say that one. Let's just see what other Sambal products that we can use. We're using the tract desert. Let's see, faded Emerald City, lavender, storm, summer by the sea. Well, right now they all look quite similar. And the driving factor here is the analogous color scheme. Along with the colors that we chose. Even though we choosing those different effects here, the color scheme does look quite similar. Okay, let's now move into a different vector image. Let's try a different logo that we've used. Now let's upload our logo that we did with Mid Journey, but then we've changed and edited with Firefly. This is our SVG file here. I wanted to be in blue turquoise palette. Okay, let's generate this. Okay, now you see that we've got this turquoise color prevalent in the image. Okay, The only thing I don't like here is that I want this drop to be white. I can change that by choosing the color. Let's choose the white color here. And now we're getting what I want. We have this water drop filled with white. Now, this tree looks really nice on this white background here. This was the blue turquoise, let's say if you've changed your mind and you want to experiment with other prompts, then we can check out and see maybe some other color scheme would be nice here. Let's try salmon sushi, for example. Now we can see that these colors are also pretty cool here together. Let's try the Turquia Desert. These are more pastel colors. Again, blue and greenish color scheme. Lavender store, we have this nice purplish color scheme here. Then we have the summer bite, the sea. All of the colors I think, look really nice together here. And let's say if you are struggling to find the color scheme, then you can use the stool to help you choose the colors that will look good together. Now let's go back here. We have other sample images. For example, you have an artwork, not a logo, then you can also recolor the whole. Here, for example, we have this image here. We can try it out here. It used the soft pastels. Prom's. See if we experiment and change the color scheme. Right now it's default. Let's choose complementary colors. Now we have the complementary colors, red and green, orange and blue, and so on. Let's change it to analogous. For analogous, we can see that the colors are more similar to each other. Let's move to triad. Now here we've got the triadic color combinations. Let's see split complementary. Then we have a square color scheme. Here we can try to refresh and every time it will be different colors depending on what color scheme you want. You can choose that particular color scheme. You can write your prompt and choose the exact colors that you want to be part of the image. That's what generative recolor will do for you. Instead of you going and manually changing each color, It helps you to recolor everything in a totally different color scheme. And let you experiment with different colors, combining different colors together, and so on. So this is it for Adobe Firefly. In this model we've covered a lot. We've talked about text to image generated with fireflies model and we've used different prompts and we've generated interesting images here. Then we've talked about generative fill and we found out that not only you can do in painting, but you can also remove certain objects or elements from the image and also replace the background. It would also work for product photos where you want to change the background. Then we also tried to generate cool text effects. This is a fun tool that allows you to create the illuminated letters. Then we went on to talk about generative recolor, that when you upload your SVG vector image, you can make it in any color scheme that you want and try out different color palettes with that image. For now, these are the only tools that are available on Adobe Firefly. And we can see that there are more tools that are still in development. This is it for this module. In the next module, we will cover another very interesting platform called Runway ML. See you soon. 70. RunwayML Introduction: Hello, in this module we will be covering another very exciting platform, Runway. Runway is an AA video and image editing app. It was founded in 2018, and initially it was launched as Models directory that allowed users to deploy and run machine learning models. Currently, it's a platform that offers multiple AA tools like AA image generated in painting and other image editing tools, but its main focus is on video editing models. Runway was also involved in the development of open source stable diffusion. Along with stability AI, and a few other universities and companies. I actually wanted to find what model do they use right now for AI image generation on their own platform? I couldn't find this information anywhere, but it is likely to be a proprietary model based on stable diffusion. Right now, Runway has also Runway research that partners with universities to research new AI models and publish papers. Then it integrates findings into new products. So to sum up, Runway plays an important role in AA research, especially in areas such as filmmaking and video editing. In this course, we're not covering video editing, but we're going to check out what Runway has to offer us for image editing. Here are the tools that Runway offers for image generation and image editing. The first one is the text to image generator. Let's go to Runway to check it out. Once you sign up, we'll go again to Runway. Under images, you will see image generating tools and image editing tools. Right now we're in this generate images section and the first one is the basic text to image generator that uses their own proprietary model. Then we have image to image generation. The third tool, we haven't seen that in any of our platforms that we've covered so far. This tool allows us to train our own models. It allows to create a custom model, portraits, animals, styles, and more. We will talk about the tool more in future videos. Then we have infinite image. That's basically out painting. So you write your prompt how you want to extend your image. For image editing, we have expand, expand image Is quite similar to this infinite image. What's the difference is that with the expanded image, you actually need to provide the prompt. But with expand image, it automatically expands the image. Then we have frame interpolation. Frame interpolation is a tool that turns a sequence of images into an animated video. If you have a bunch of images, you can upload them and make a video on it. I will also show you some cool ideas, how you can use this tool. Then we have a race and replace. That's basically the same as in painting. Then we have the backdrop mix. This is the same as background replace. We upload the image, it removes the background, and then we can choose a different background, we can generate a new background for red. Then we have image variation. We upload our image and then it creates variations of that image. The next tool is actually very cool, it's add color. If you have black and white photos or images, you can upload them here and it will actually colorize the images here. I'll show you some examples how to use this tool as well. Then the last one here is the upscale image, basically that upscale the image. Okay, then under more there's actually three D. And I wanted to show you this tool. So it's just one tool and it creates a three D texture. From prompt, we'll try that as well in the next video. Rightway does have quite a few tools. We'll check them all out, test them, and also compare some tools to the tools that we've already covered from other platforms. 71. Text to Image Generator: To use runway, the first thing you'll need to do is to go to the website Runway Ml.com and sign up. Once you sign up, you can log in to the platform. Here you'll see a lot of different tools. Here is your profile. Here are AA tools. Here we can see popular AA magic tools. This includes the video editing and image editing. Here, you can also check out some tutorials that were made by Runway on their tools. I also want to mention that here I have the paid plan. Some tools are limited if you don't have a paid plan. If I go to my account here, manage your plan here, you can upgrade your plan. When you sign up, you'll get some free credits, but they run out pretty quickly. So if you want to upgrade, you can buy here and you can update your plan. Here are some plans. You have the free plan, standard and the pro plan. Right now I have the standard plan. Okay? Now let's move on to image generating tools. Here if we go to images and then here we see generate images. The first one is the image generator, let's try that one. Based on my experience, I found that runways image generator is inferior to other platforms that we've already tried. I'm not going to spend too much time here, especially that I think it's quite pricey to generate images with the plan compared to other platforms where it's way, way cheaper per generation. It's quite pricey here. Especially here, I paid $15 and I only got 200 generations, not that much. We even covered the clip drop that gives you 1,500 generations, Way more than this. So keep that in mind. Okay, here, let's just try our first prompt and you'll see what I mean, that it's not that great. Okay, so we have our professional portrait, and then here we can choose the ratio square, white screen, landscape portrait, then resolution. If you're in the free plan, you can only choose the 512. If you have the paid plan, you can choose a higher resolution. Let's, let's choose the regular thousand 80. Then you can choose how many images you want. Let's choose four. Then here we can go to Advanced Settings and choose Style, Futuristic, and so on. So that's the style. Then we can choose the medium here. Canvas, airbrush, graffiti, drawing here, or oil painting here. I will choose photography, then moved, let's put Beautiful. Then we have the prompt weight to remind you. Prompt weight affects how much prompt do you want to be reflected in the image? It determines how much to take the prompt into account in the generation. Higher values may result in more precise results. Lower values will generate more creative outputs. The standard prompt weight value is around seven. Here we have 7.5 that's standard one. Let's actually move it maybe eight, a little bit more here. Okay, it's more aligned with our prompt here. Then we have our set right now. We're not changing the set here. Okay, and that's it. Let's click generator. As you can see here, we've got some poorly branded images here, we've got a lot of artifacts, especially with eyes here again, the portions are distorted. Even though we have this nice and long prompt, we're still getting not great results. This is for portraits, and you'll find that using faces with the current model right now is just not going to give you great results. Let's try something else really quick. Let's try maybe a logo and see if that's going to work. Here we have line logo. Let's change the style. Right now, I will choose digital or minimalism. Let's choose Minimalism Sum, let's choose illustration for mood. Let's keep it to non default one prompt, let's make it higher because this is logo, we don't want it to be too creative. We want to be exactly as our prompt. Let's generate. Okay, we're getting some simple images here. This one is just a circle with a cherry cupcake. Another cupcake and just the cherry. Yet, not my favorite model to work with. But again, let's just do landscape. I think landscape should be fine here. Landscape usually works with any model because it doesn't need precision. It can tolerate lots of mistakes. Let's use digital art of our medieval castle. Then here again, we have four images as output. Let's pump our resolution to two K. Then for the style, let's fantasy, for medium, let's make it into oil painting. Here's our oil painting and mood. Does Epic. Oh yeah, it does have Epic. Let's use Epic. Okay, prompt weight. We can now make it smaller, maybe six. It can make some creative images here. Okay, let's generate it here. Let's check out those images. It's okay, it's cropped here, that's not good. But other ones, we're getting shapes here that doesn't look like castle here. Okay. So it's a little bit messed up still. Here we have something flying in the air. This is definitely not advanced model compared to other platforms. Okay. But it does have cool tools. Let's check out other tools. 72. Train your Own Generator: So let's actually cover the train your own generator first because this is a pretty fun tool to use. Okay, here you can train your own model. So you'll basically need to upload similar images of the same person, animal, object or style and that basically teaches AI about that person, animal, or object. So the next time you generate images, you can actually generate images with your own face or with your pet, or with some with a certain object that you want. That's how it's used here. You can choose from portrait generator, animal generator or custom generator. Let's say if you want to generate images of yourself, then you can use train a portrait generator here. Just click here here, it's a paid feature. But if you have a paid plan, you have one free training. We can say train portrait generator. Okay, here you will need to upload images of yourself and it should be 15 to 30 images of a face with different backgrounds. You can't redo this later, so choose with care. This is very important to make sure that the images you choose are high quality. That will help AI learn your face better and the images you'll get will be better. Here you can check out some image examples. Here are some selfies and as you can see, different backgrounds. I would not recommend using cropped images because that will affect the output images. This is the input. If we check the output, you'll see that some images will be cropped and that possibly because the input has cropped images. These are the images that you give AI for training. So make sure that they're good. Okay. You would also want to make sure that images are cropped in square one to one for best results. Of course, avoid inappropriate images. Okay, let's upload some images, for example this one. This one. For this image, it's not a square, I would have to make it into a square. And then upload it. Right now these two are squares and then we'll need to make that square as well. Okay, here with Mac, I can use Preview for example, but you can use any other tool here. I can choose a square, let's make sure that it matches. Now we have a square. Can move it a little bit, now Can crop okay, and safe. Now it's a square and I can upload it on, you'll find 15 images. You'll upload them here. And then you'll click Generate after half an hour. And so you'll have a new model. Let's get into the model that I was able to generate with the images. Okay. After waiting some time, I was trained on my images and here are the images that it generated. We've got different styles, for example, black and white pastel colors, and so on. Illustration fantasy, my only concern is that it didn't capture my eye color or even hair color. There's a lot of black hair here, but it captured my nose here. Overall, I think that it did not a bad job in training and generating images that look somewhat like me. But honestly, there are other tools that, in my opinion, train models way better and cheaper. But we'll also talk about that in the next few modules. So make sure you check those modules out too. Okay, now we have these images. We can actually make more images. Let's say you like this style and you want to generate more images in this style, you can. Okay, for that you need to go back and you need to go to generate images. Here, you can choose the text image generator. Here. You just need to write your prompt, what you want to see. Let's say pop art and then I'll describe myself. So young woman with curly blond hair and I want to get colorful. So I'll just put colorful here. You can choose default or you can choose something else here. If you have trained a model, you'll see the name of the model. For some reason I've decided to name my model. I'll just choose K, K and see how it added here. Now it would know that it needs to generate images with my face on them. Okay, let's click Generate. Here I have low resolution, let's change that. I'll change the resolution to, let's put two K, the number of outputs, Four then here. Okay, you see it used, in a way, my facial features to create this image. Okay, let's then change the advanced settings for style. Let's choose Pop Art here, and then Mood. Let's make it beautiful. Okay, and the prompt weight. Let's keep the default one. And let's click Generate. Again, as you can see it, try to integrate my facial features here even though it's not quite successful. But again, in the future module, I will show you a platform where you can also train and generate images of yourself, an animal, or of objects or styles. It is my opinion that platform is more successful then what we have here, but let's try one more time and let's move on to other tools instead of port pop art. Let's actually here we have comic art here in the prompt. I'll just add also comic book. I'll leave everything here and I keep my model chosen here. Then medium for mood maybe. Let's choose colorful. Let's generate here. We've got some comic books here. We have a lot of images on this one. Let's expand again. The facial features are totally wrong. Let's see other ones. This one looks like a human, but still the proportions are bad. This one is, I think, the best out of all of the images that we've generated. But again, still lot of artifacts here. Let's say we'll save it in downloads. This is the image and that's supposed to be a two K image. Okay, let's go back here now. We've tried using text to image generated with our own model. The results weren't great, but now you know the concept and I'm going to show you other platforms where you can also train your model and generate images based on your model. 73. Image to Image and Infinite Image: Now let's move on to our next tool, which is image to image generator. And let's try it out. Here we have our ballerina. Okay, here we have the image. Let's put our prompt. Let's ballerina dancing in a magical forest. And then I want trees and flowers in the background. Let's put it beautiful. Okay, so here again, I want four outputs. Resolution. You have the H, D, or D, so let's try it out. Okay, here it got the faces terribly wrong and we have an extra limp here. There's actually no way you can write the negative prompt. Okay, then we have number three here. The third one is not that bad, even though the face is really bad. But the proportions, and the proportion is pretty good here. Here again, we have the same bid posture as the original image, but the face is horrendous. Let's actually try it with a different image. Now that we know that it's really bad with faces, let's not use any more faces here. I have a wolf here. I'll just put iridescent wolf magical colors. Again, the resolution is, let's try that. Okay, here we actually got the iridescent effect on the wolf. The wolves proportions are pretty good here compared to human faces. Let's see other ones. Yeah, it captured the fur eyes nose. Then we have the jaw here. This one is the best out of all the images here. This one is pretty good. Okay, another thing we can do with image to image generator, we can actually use our own model. Let's try Apple for example. Here I have a art. I wonder if I can put my face on it. Here we choose this image. Okay, now I'll just put myself a young woman. Let's add the pop art art. A young woman with curly hair. Now let's choose our model. Let's generate. Okay, as you can see, it used the original image and it also captured facial features like a nose, maybe my eyebrows here. Not too bad. Not too bad. This is something that you can do with image to image generator. If you have trained your own model, you can try out those things. Now let's move to our next tool, Infinite Image. Here you can generate a new image. Basically just write your prompt and click Generate, or you can upload your own image. To do that, you would need to click Add Image Pattern, and click Upload from Computer. Here, I prepared some artworks. Let's try this one. This is the great wave of Kanagawa. Okay, here we have our frame. We can move it anywhere where we want to extend the image. For example, I want it to be here. Okay? Now all I need to do is make sure that it overlaps with the original image. Then let's put a wave in the style of wood block print. Okay, this is the wood block print style. Let's try that here We've got different variations. This is the first, second, third, and then I think the best one would be the second. I think the second one is best here. Because now we have this. Can Sky, let's accept it in a similar way, we can move it here. And just change our prompt. A wave here. Let's just put a sky in the style of wood block pred. Here you can see that it added some characters which probably don't make any sense at all. Let's actually the reason is because in the previous generation that we had, we had, let's cancel this. We had some characters. It took the idea from here, but we don't want this because I generates some nonsense. Let's use as we can now erase this part. Okay, let's make the race a little bit bigger. Okay, I think this is a good size, so just remove it now. Hopefully it's not going to generate more characters. Okay, let's try this. Okay, now we have something different here. Maybe I'll go with this one. Okay, here we've got some characters. Again, let's use this one, Okay? Now you can also add more images if you want similar as Ali, you can add multiple images. However, the difference is you cannot move the original image around. In order to upload the image in the correct position, you have to move this frame in the place where you want your image to start. If you want to start the image here, then you need to move this frame here. If you want the image to start here, then you need to move this frame here. Okay, let's upload our other image. Let's put it maybe here. Okay? And then again, add image from computer. Here I have the artwork by Salvador Dali. Okay, now we have this image here. Again, if you change the mind and you don't want the image to be here, unfortunately you cannot move it around. You would have to go back to coma or control Z and then upload it. Move the frame again and upload the image again. Okay, so that's a bit of inconvenience now, let's try to merge those two images together. Let's erase some parts of it, for example. Let's erase those sharp edges here. Okay, now maybe here as well. Here. Okay, now let's move our frame somewhere where it overlaps with the two images here. Let's try this. Let's put a way that becomes a scarf, and then here I'll put surrealism. Okay, let's change the setting, maybe for prompt weight. Let's keep that somewhere in this default one. Okay, let's generate. Okay, let's see here. I think it merged pretty well. Here we are coming from this woodblock print to surrealism, and here are some variations. And it also added the C in the background here. Okay, maybe the first one was pretty good. Yeah, I think the first one is the best here. Let's accept that and I'll quickly do some more. Okay, here is the final result of merging those two images together. I added a few frames here, and in the bottom here, I think overall, it's pretty good here. If you want to save the image, then you can head down to this button and click download. Okay, we're done again, if you want to undo or redo, you can use these buttons or you can use the keyword control or command team. 74. Image Expansion: Okay, let's check out some other tools. Let's go back to Generate images here. We've tried all the tools here. Now let's move on to Edit Images. And the first one is expand images. That's very similar to the infinite images that we've already done, but with a slight difference. Okay, let's check out the difference here. We just need to upload the image. Let's use the same image. Let's use the wave. Okay, here we have our image on the right. We have settings here, we can choose scale. It's basically how much we want to zoom out. Right now it's one x, that's the original image. If we want to zoom out a little bit, we can choose 0.75 x, and even further is 0.5 x. Let's keep it maybe. Let's first choose 0.75 x for aspect ratio. We can also change the Ascra ratio, but let's keep the original one. Okay, You can also write the prompt if you want here. It actually automatically generated the prompt based on our image and right now it's correct. So it's a Japanese painting, so maybe it's sort of painting but woodblock print showing the great wave of Kanagawa, Japan. As you can automatically does the prompt, so you don't need to write it yourself. You can if you want. Okay, let's that. Okay, here are our zoomed out image. As you can see it in all of these images, it added some characters which we do not want. If you don't see certain things like characters, then we should have prepared this image beforehand and removed the characters. Okay, And there is actually a way to do that here in Runway. So if we go back here, it images, there is a race and replace. We can just add our image here. Now we can use this eraser to remove the part that we don't want. Then I'll just put a sky wood block print and then maybe Japan, Japanese style. Okay, let's generate that. Okay, here it added some more characters. Okay, I don't want to see any characters actually now because it takes the information from the original image. I'll move my prompt weight to maybe 26 or 20. Let's move it too high, maybe 20. It's more aligned with our prompt. So it's like maybe I'll put simple Simple Sky. And I'll use more of this area around. Okay, Simple yellow sky woodblock print Japanese style. Let's remove Japanese style so it doesn't add Japanese or Chinese characters. Okay, let's try that. Okay, let's see. Okay, the second one is a bit better, still not something that I really want. For this case, I might go and use the clip drop to remove because here there is no removing clean up tool that just nicely removes the thing like it must add something to it. Actually, I'll just go to clip drop here. In the clean up, I will upload my image here. I don't want to see these things. Let's clean. Okay. Beautiful. That's all I need. Okay, let's download it, save it and go back. This one, this is the runaway. This is okay but see it added some yellowish pinks that don't work with the style. The clip drop version was way better and faster. Let's use that for the expand image. Now I can choose the clean image. Okay, here I don't have any characters. Hopefully it when we zoom out, it's not going to add those characters here. Okay, let's use the scale 0.5 x prompt painting, that's showing waves in the ocean here instead of painting. Output block print. Okay, let's generate. Okay. As you can see right now, it didn't add any more characters, so we don't have any more mass. Well, added some text here, but for other ones we don't have it. Which is way nicer. This is something that you need to take into account. For example, let's expand this image. And this one is pretty good. We can download this. Okay, let's also try this expand tool with our, the image. So we have our am just let's try it out here. I get a 10.5 X for prompt. It automatically generated a man standing in front of a woman with an angry look. That's not an angry look, but let's remove that. A man standing with a woman beside him, looking back, had another woman. Okay, that's a better description here. Let's generate. Okay. Runway doesn't like my content. It triggered some moderation guidelines. Okay, I don't know, what did I say, but let's remove that and just not use it. Let's see. After a few attempts, it still didn't work. However, a day before when I tried it out myself, it worked perfectly fine. So this is the image that I was able to generate with runway and I chose at that time the best image. And here is the image that we've got with the crop, So you can see some differences here. In my opinion, clip drop did a better job in terms of extending the image. Here we can see a lot of artifacts and especially if it generates more figures, then there is a big problem with faces. That was for runway. Let's just try a different one. Since we cannot do other meme here, let's use the baby. Let's see those extended images. Okay. This one is not bad. The child is in the sand playing. The problem here is these things. So it should be some kind of toys here, just artifacts here. It's just setting down then not sure what this is and then some huge head here. Okay. The best one is the first one, but still I think it needs improvement. Okay. That was the expanded image tool. 75. Frame Interpolation, Erase and Replace: Now let's move to frame interpolation. This is a fun tool that allows you to make your images into videos. For that, I think we should prepare some images. And the best way to pay images for the frame interpolation, if you want to use the generated images, is the rays and replace tool. Let's first use the rays and replace tool. Okay, here we can use a landscape. Okay, so here I have an image of mountains and the sky. I want to make that the sky move. Let's do that here in the ray and replace. I will erase all of the sky and generate new images with the sky. Okay, let's raise all of this. Can now maybe make it smaller here. I'll put photo realistic sky with clouds. Okay, and let's generate. Okay, so we have some new images here. Okay, well maybe this one matches it a little bit, but I think clouds are too heavy. This style just doesn't match the image. Okay, maybe this one, but still it doesn't feel photos. Okay, let's actually improve this. I'm going to erase those little details. Maybe that will help. Okay, so let's move those little details. Okay, so now for the prompt. So I'm going to have footers steak and then I'll put blue sky with clouds. Okay, let's check our setting. Okay, so our prompt weight is ten, let's move it back to 77.5 Okay, And then maybe some clouds. Okay, Okay, this is way better. Let's see. Other ones, we have some sky here. These clouds are better than the previous batch of images. Okay, these ones are all good. Honestly, I'm going to save all of them. I'll download all the images here. Okay. Then I'll click cancel and I'm going to generate one more batch of images. Okay, I think this one is also very good. Okay. Okay, so let's save them all. Okay, now I'll go back and I'll go to images and I'll go to frame interpolation Here. I will upload all the images that we've just downloaded, these ones as well as my original image. Now I will change the sequenced based on how many clouds they have. The fewer ones will go first, maybe, and then more clouds towards the end. I think this is about it. Okay, so maybe I'll move this image here. The sky is more clear. Okay, so now it goes towards the more cloudy sky. Okay, And then for settings, you can choose the clip duration. The default one is usually 10 seconds. We will try different using different images. Sometimes it's nice to have it less time. Then we have also in the Advanced. How much of transition time do you want right now? Let's leave the setting as and let's generate. Okay, Let's see what we've got here. Okay. As you've seen, we've got the skies moving right now. It feels a little bit unnatural because the transition takes very long time. What we can do is we can change the clip duration. Instead of 10 seconds, let's make it into around 5 seconds. Maybe 5 seconds here. Let's generate. Okay, It feels a bit more natural, but still we see those big traditions. If you have more similar images, then that would work better. Let me show you what I also try doing here. I also used only one image and then I and replace Tool to change the background. Let me show you. I used this image with the rays and replace Tool, I replaced those background mountains. Okay. And I've generated a few images like that one here. We've got some, let me re arrange it again. Okay, Maybe something like this. Okay. Now again, clip duration. I usually like it shorter. Four smaller number of images. Let's make it around five. Yeah, five, then let's generate. Okay, let's see here. You can see those mountains moving. And that is a really cool effect that if you want to do that with your photos. Another way you can use frame interpolation is when you make lots of photos. Let me show you here. I have a bunch of photos. When I add all of them together, now I'll change the clip duration to maybe four, also for advanced. Okay, let's use the transition time, 100% first, and then I'll show you how we can change that. Let's generate, let's see here. Here you can see that transition is pretty long. Let's reduce transition time to maybe 20% okay, 22% I'll also move the clip duration to 1 second. 1 second. Here we go. Let's re generate. Okay, let's see. Now this is a little bit better, and that's how you can combine your images into a video. Okay, another cool thing you can do with frame interpolation is changing the subject of the image. Let me show you. Okay, now I need to go to race and replace. Here I have this image of origami on the plain background here. Again, I'll use the race tool to erase this part here, I'll put origami flour. Okay, Let's generate, okay? And here are some examples. For example, this one is knives, okay? Now, instead of origami flour, let's put origami. I think this one is pretty cute. Let's use this one. You can create different objects using this erase and replace tool. And then if we go back to our frame interpolation here, we can upload all those images. Here we have those two. I also have a few more that I did earlier, so for example, one, now you can change the settings. Clip duration. Let's make it maybe 3 seconds. 3.5 for Advanced. Yeah, let's keep the transition time or okay, let's generate. Let's see, here we have a cool effect where one object becomes the other object with this nice transition. These are some things that you can do with frame interplationase, replace tool. 76. Backdrop Remix, Image Variation and Add: Let's move on to our next tool, Backdrop Remix. That's basically a replace background tool. Let's use it here. We can use our sneakers that we've tried with other platforms. Here are our sneakers. Okay, here for settings, you can choose the scale, so you can zoom out a little bit, or you can zoom in. Then you can choose the style, like apartment bakery, or you can choose a custom one. Then there are for the backdrop for the studio, flowers, beach, and so on. I think for this one, let's try to find some mountains. Maybe outdoors, okay? Mountains here, we can put sneakers placed on top of a mountain rock. Okay, let's generate that. Here we've got some images. I think they're very similar to one another. Let's see. Okay, I think this one is pretty good here. It tried to add something to our shoes, which I don't like. Again, it added some layers. Again, what I found is that when you use runway for background replacement, it may add something to your subject or object. What I tried replacing the background myself with the sneakers Here is the result that I've got. That was for Snowy Mountain. As you can see, it added this extra platform to our shoes that was like almost to every single image. This is compared to Adobe Firefly or other tools like Clip Drop that only add the shadows and integrate the product with its surroundings. And that's it. For some reason Runway adds extra details. Let's try it with something else. For example, a chair. Here, it's a PNG image, it doesn't have background, okay. Here I want it with a zoom out, so 0.5 x. And I want it to be in the apartment. And let's put modern style apartment. And here we have cheer inside a modern style apartment. And then standing beside plans. Okay, let's try that. This is the original image. Look at those legs here. When we go and check the images that were generated with Runaway, you can see that on all the images here, it added extra stuff to our product, which is not good here. It added really long legs and so on. Let's expand it here. Definitely see that plants have artifacts and just the whole setting. The objects are not clear. Here is a little bit better, but again, we have those legs, lots of problems here. I wouldn't use backdrop mix to replace background of a product photo because the images that you will get will not be good quality images. I think that backdrop remix tool needs a lot of improvement before it can be useful. Okay, let's go back. Our next tool is image variation here. Let's try to use the same image at the image that was generated with mid journey and the one we've tried with the clip drop here. For settings, we can only choose number of outputs, 123.4 images. Let's choose the highest, it's going to take the most credits, but let's try it out. Let's generate. As you can see on these images, it's basically the same studio, the same design as the image we've uploaded. But the only differences is just small details. For example, the floor tiling that we have, just the colors of the desk, the chair, and then different monitor. And so on. Slight modification of the image, but overall it's very similar composition. Okay, if we compare that to clip drop, here is our original image with Runway, we were able to generate images that have almost identical composition with slight changes of the design texture and so on. However, with Clip Drop, it actually it kept the same color palette, but it rearranged the object here. Here it has the same style, but now the desk is in front of the window. So these are the differences when you use the clip drop and runway for image variation. That's the image variation with runway. Now let's go to the next to add color. Let's try it out. For this, I've prepared some black and white images, let's check them out. The first one is let's use the landscape here. Okay? Basically we just need to click Color. Here we go. Now we have colorized image. We can see mountains white, and then we have foggy hills, and then some dark green colors. That was the landscape. Now let's try to do the old photo of Marilyn Monroe. Let's use that here. Again, colorize it here. It did a pretty good job. Here we have quite natural skin color. Then we have this blond hair color and red lipstick, and then we even have some silver pearls and this old style sofa. However, here we are getting slight green tone. Maybe that's something that should be different here. Okay, but overall, I think it did a great job here. Let's save it now, let's try a little experiment Here I have a color image of a family in the park. In Preview, I've changed it into black and white image. Here is the black and white image. What I want to do is try the color tool here and see how different it will be to the original color image. Let's see. Okay, we've got some colors, but as you can see, the colors are a bit more pale. We do have the green grass, but some colors are quite different to the original image. And let's check out which colors. Okay, here is the original image. On the original image, we have lots of brightness and saturation of colors on the colorized image. Colors are not saturated here. For example, for genes here it's dark blue. Genes here it's almost violet. As well as with his shirt, almost violet color. And look at the girl's T shirt on the original image, it's bright yellow. Here we've got this pale bluish, maybe a little bit purplish and greenish color. As you can see some of the colors it would get spot on grass, maybe the skin color. But some of the colors like this yellow here, it would get wrong just because there's just a variety of colors that would be the same on the black and white image. Also, saturation may also no match. Here are some limitations with using this tool, but otherwise I think this is a useful tool to use, especially if you have some old photos that you would like to colorize. 77. Upscale Image and 3D Texture: Our last tool is upscale image here. I want to try the same images as we've tried with other platforms. Here, I'm going to have my photo in low resolution. Here is the photo here. I can upscale up to four K, However, you'll have to be on the paid plan to be able to upscale two K or four K. Let's upscale to four K here and let's process it. Let's expand the image and let's um, in okay, it has smoothened out some patchy areas and reduced noise. Overall, it enhances the image. However, I think it smoothened out quite a lot on the face. Now it doesn't look photo realistic. That's why in my opinion, clip drop had the best job done in terms of photo up scaling compared to runway or big GPG runway. In my opinion, it over smoothed the area making the photo less potalistic. However, in clip drop, this wasn't a problem. But I encourage you to try out all of them and see which one you like the most. Okay, then let's try our artwork here. Okay, let's now try, this is the meaning of life artwork that we've made with Mid Journey. Okay, and again, let's use the four K and let's process it. Okay, let's expand and let's zoom in. Okay, Again, a pretty good job in terms of improving the resolution of the image. And now if we look at the moon and clouds the human figure, it doesn't look pixilated. Okay. However, if we compare this to other platforms. So this is the original artwork and then here are our three up scalers, Big GPG, clip, drop and Runway. My favorite one was the Big GPG specifically for artwork upscaling. And that's because here, zoom in here. It added those very nice smooth lines that I think fit very well in this artwork. But again, you may prefer a different platform for your artworks. Now let's move on to our next section, three D texture. Here, under more we can find three D. Here we have the tool that allows us to create three D textures from a text prompt. Let's try it out. Okay, here let's write, for example, mossy texture and let's generate. Okay, we've got this three D texture that can be used for games or other visual elements. On the right hand side, we can change a few settings. The first one is we can increase the resolution to 2048 by 2048. Then for the form, we can change cube to sphere. Also we can have just the image. That's just going to give us the two D image. Let's return it to the cube for tiling it actually at the repeated pattern, we feel one surface. For example, here we just have one image. This is one image, let's say. Then when we increase it, let's say to two. Now this image is repeated four times on one surface. Then when we increase it even further, then it's repeated multiple times. Then when we get to 20, that's the maximum here. Okay, let's make it back to one then. We can also adjust the ambient light if we want to make it lighter surfaces. And we can by increasing this ambient light exposure, the default one is around 40. If you don't want to see that some sides are more darker, then you can change the directional light. We can make it zero. Then all of the sides will be with the same lighting. So if I increase it, so there will be no difference, it would not look like it has any shades. Okay, here you can all the acts which include texture displacement and roughness maps. You can download that after downloading, we'll have those four different PNG images. The displacement, then color and roughness. That's basically all that is to the three D texture. This is it for runway. We've covered all the image generators, text to image, image to image. We've learned how to train your own model. And that we also looked into out painting with infinite image. Then we did some image editing with cool tools that Runway has like expand image frame, interpolation, erase and replace backdrop remix that replaces the background image variation, add color, and upscale image. 78. Leonardo.ai Introduction: Hello, hello. In this module we're going to go back into stable diffusion and actually explore more platforms and more advanced features. In this module, we're going to go and talk about Leonardo I, which is an AA image generator based on stable diffusion. Okay, what is Leonardo? It's an image generator. It was developed to as game artists in creating game assets such as characters, environment items, conceptual artwork, and so on. It was founded in 2022 and the company is based in Australia. Okay, so let's talk about some advantages and limitations of the platform for prose. Leonardo actually gives a good amount of free credits, and what's even better is that those credits are updated daily. So you can try the platform out and see if you like it and really understand how it works. Leonardo has also an image gallery. Let's check it out. This is, once you look again to Leonardo, this is what you'll see here. You'll see a bunch of images created by the community. You can even go to this community feed on the left hand side and you'll see all the images here. Let's say you like any of the images, you can mark them with the heart and they will be placed to your personal feed. To the feed, here are all the images that I liked. So this is quite handy because if you want to reuse some of the prompts, for example, you like this image and you want to use this prompt for example. It actually gives you all the tools to do that. For example, to reuse this prompt or do image to image generation very handy. Also, the images you generate with Leonardo are high quality and you can upscale the images and there's different up scalers which we'll talk about. Leonardo also has many stable diffusion models to choose from, which are fine tuned for a specific style or character. If we go to Leonardo here you'll see featured models and you can see some models here are great for three D animation style for illustration cute animal characters, you can find the model that will be best for your image. It also allows to train your own models. If we go back here in the training and datasets, you can upload your images and train your model. Then it also has an AA canvas where you can do out painting. And in painting, for example, if you generate an image, you can quickly go and edit it in the AA canvas. Also, you can generate images privately with any paid plan, which is great because their started plan is very affordable. Now let's talk about some limitations and disadvantages. Perhaps the first one is the wait list. Right now you have to join the wait list first. After a few days, you'll get an e mail saying that you have the access to the platform. If you don't have Leonardo AI account yet, I urge you to sign up right now. So by the time you want to try it out, you have the access. Another thing is that Leonardo has a lot of features which can be overwhelming for new users. However, throughout the course, we've talked a lot about stable diffusion. We've talked about prompt engineering. We've talked about different parameters. It shouldn't be a problem for you then what I found is that it's hard to produce photorealistic images. With Leonardo, it lacks photorealistic models. Even though on Leonardo we have this model called absolute reality. It says that it's a photostic style model, but when I've tried using it, the images have this game character aesthetic rather than the photography style. All the models that I've tried here, I couldn't quite get that photography style images. The last thing, the models that Leonardo has are stable. Diffusion based detailed prompt is important for better results. That's a brief introduction to Leonardo. And in the next two videos, we'll go and explore this platform. 79. Leonardo.ai Overview: To start with Leonardo. You'll need to go to Leonardo. And if you don't have an account yet, then you'll need to click this, get instant access, put your name and email, and you'll get an E mail. Once you get access to the platform, if you already got an e mail that you're white listed, you can click this. Yes, I'm white listed. And then again, once again, this is something you'll see in the front page. So this is home. These are featured models and the gallery on the left hand panel here we have the community feed. This is the images generated by community. If you like any of the images, you can put a heart here and then it's going to get in your personal feed. Okay, then we have personal feed. These are your generations. Then if you follow certain accounts, you will have the images that were generated by that person. Then the liked feed will be all the images that you liked. Then we have the training in dataset. This is where you can train your own model. Then we have this fine tuned models. These are models that you can use to generate your own images. As I was recording the course, the DXL 0.9 model by Stability AI, it became public, now you can use that one as well. There's a bunch of different models here, for example, magic potion, spirit creatures, Christmas stickers, and so on. These are platform models, but they are also community models. And the community models were trained by Leonardo users for specific use. They made it public, so you can also try them as well. If you like any model, you can actually bookmark that model. Just click this. It's going to be in your favorite models. These are the models that I used and liked. Then there is your models here, you'll see all the models that you've trained in Nado. Okay, then we get to user tools and the first one is the AA image generation. This is where you will generate your images or prompt. Let's go back here. Then we have this AA canvas. This is where you can do in painting out painting, but you can also generate images. There is a canvas mode, you can choose the text to image to generate an image. Then we have this texture generation where you can upload a three D model and that will generate a texture from your text prompt and turn it into a three D mesh. Okay, and then for settings here, you can specify your interests, for example, art, architecture, advertising, whatever you want to see in the news feed. And then we have questions and answers, and also some guides to help you with Leonardo tools. Okay, so if we go to questions and answers. So here I want to point out one thing here. It says, can I use the images generated by the platform for commercial purposes? And it says, yes, you can use the images generated by the platform for commercial purposes. This applies to images created by free users two, which is great to know. Okay, now let's check out some pricing. Here I've got 9,500 credits. Here are all the plans offered by Leonardo. Right now, I have this apprentice plan and it gives me a lot of credit. It gives the 8,500 tokens per month. Then there is Artisan Plan which gives you 25,000 tokens per month and then Yest with 60,000 tokens per month. With free generation, currently you get 150 fast generations per day, which is quite a lot If you want just to try out and generate a few images, then with a free plan, you get 30 up scales. And with a pay plan you get more 1,000,705,000.12 thousand. Then background removal, you also get a bunch of that here. The number of jobs you can do in parallel here is only one, with apprentice 5,010 and Maestra 20. Then we have private generations. With a free plan, you cannot make any private generations, but with any other paid plan you then there is priority infrastructure. And I think that refers to the new features that Leonardo offers. Some new features would be only available to paid plan users. Then we have the relaxed generation que, which is not available in a free or apprentice plans, but it's available in artisan or master plans. Here you can check out the plan details and decide for yourself which plan you want to go with. Okay, now let's go back to the platform and let's create some art. 80. Image Generation - Text to Image: Let's generate some images. You can go directly to a image generation or you can first choose a model with which you want to generate images. And there is a bunch of featured models here. By default, when you generate images, you will be generating with this Leonardo Diffusion model, which is a proprietary model developed by Leonardo. But let's say you want to create a cartoon character, then you can use this three D animation style. Or if you're creating a game asset, then you can use that specific model. Let's try it out. Here are the featured models. If you want to see the full list, then go to the fine tuned models in this platform models. Here are all the good models here. Here, for example, we have magic potions. Now in order to create with this model, you basically just click Generate. With this model, it's also good to take note of the resolution because this was, the trading resolution was 512 by 512, it's best to use the same dimensions. Let's try it out here in image dimensions, it's already preset to 512 to 512. I don't need to adjust anything. Okay, let's maybe put something magical. I put a beautiful magic potion containing a galaxy, intricately detailed game acid illustrated. Okay, here we have the magic potion, fine tuned model. Let's out now we've got these magic images. Okay, so on the left hand side, you'll find the settings here. The first one is the number of credits that you have. Right now. I have 9,533 tokens. Then we have the number of images that are being generated every time you have your prompt. Right now I have two. Let's just increase it up to four. Then we have custom Leonardo features which are prompt magic and alchemy, and we'll talk about that a bit later. Then there is public images. If you are on the paid plan, you can generate images privately. Right now, these images are not public, it's turned off. But if you're on a free plan that it would be turned on, all your images will be public and potentially could be seen in a community feed. Okay, we can make it public. Let's say that we have the image dimensions here. You can choose the fixed or you can use the slider to change the dimension. Then we have guidance scale and step count, which are basic parameters in staple diffusion. We step counts, you may not see that as a setting depending on what schedule you choose. By default the schedule is Nado. There'll be no steps for you to manage or change. Of course, you can include a seed number. Again, basic stable diffusion parameter. We will talk about all these settings a little bit later, but right now what I want to do is to just try it out and also tell you a little bit more about prompt generation. Okay, first again, let's choose our model with which we want to generate our images. Select custom model here you can choose between your favorite models, platform models and community models. If you don't have any favorite models yet, you can go to the platform models and choose the one that you like. For example here, here we have the Dream shaper. This model is fine tuned for a portrait illustration style. That's somewhere between fort realistic and computer graphic. Let's, I actually use that one again. Let's generate with this model. Okay, for my prompt, I want something. I want a close up of a cowboy. Because this is a stable diffusion, we need a much longer prompt to generate good images For that, there is actually prompt generation tool that helps you out with your ideas for prompt generation instead of writing in this field here. You need to write in this field. Okay, so let's just copy that up short of cowboy and let's generate some ideas here. We've got some ideas. So the first one is a weathered cowboy. His face illuminated by the setting sun. His hat casting a long shadow across his rugged features. I like that one. Okay, then we have close up hands. No close up boots. His eyes squinting against the bright sun. A hint of smile playing on his lips, very poetic, maybe. Let's try the first one. You can copy that. Just copy or you just click Generate. Let's say I want to add more details here. Let's copy pasted to our held here. Now I will add some Stylize, 64 K and real engine. Okay, we already have this model chosen here. We can use the Leonato style aesthetic or no. If we want, we can also add negative prompt right now. Let's just use our prompt here. Let's click Generate. One more thing, when you use prom generation, make sure when you actually do image generation, you come back to image generation. Because when you click generate here, you would see that nothing happens. But in fact, if we go back to image generation, it already generated images of prompt. Here we've got our cowboy. Let's check it out. He looks very evil. Here I will actually regenerate and see if I like the new batch of images better. Maybe I will put the Cowboys full face eliminated by the setting sun. Okay, let's try that and maybe let's put photo realistic. Okay, let's try that. Okay, this is a bit better. I like the first one and maybe the fourth one. Okay, let's actually work with the fourth one. In the bottom here, you have different features. Here you have different up scalers. Let's try different up scalars and see how they compare. This is, the first one is Creative Upscale, and it says that it can improve images during the upscale process. It will cost us five tokens. Let's do this one simultaneously. I'll upscale with different up scalars. The next up scalar is the alternative. It says use this. If you find the creative upscale is resulting in loss detail, I guess it has more details. Let's use that one then. We have this up scalar. It works well with a focus subject but can end up smoothing out fine details. Maybe if it smooths out the background, it'd be great here, but we'll see. Then the last one is the HD up scalar. And it's a great balance up scalar which retains a good amount of detail and crispness to the image. Let's try that. Okay? Now I believe we have most of the up scalars. The first, let's go back to original image. This is the original, this is the creative upscaled image. You see how it changed certain details. Then we have the alternative upscaled image that kept most of details. And I think that's right now that's more similar to the original image. That's the alternative up scalar. Then we have the HD smooth upscaled image. Supposedly it smoothened out a few things there, and here the last one is the crisp upscaled image. I think that didn't change it, or if we zoom in on the face, maybe that has more of the skin texture. But overall, I think all the scalars are quite similar. Some added more details, maybe some smoothen out, but overall very small changes to the original image. Let's go back to the original image. Now here you have also an option to remove background, Let's do that. Here we can find the snow background image, and here we have it. The only problem, it didn't do a good job with his other shoulder, and it cleaned it up as well. That's not a desired outcome for us, but let's go back to original image then. The next feature is the zoom. That's basically out paint extends this image. Okay, let's try that. Let's see, zoomed image here, it extended the image. I think it's pretty good. The tools that we have for this zoomed image, you can copy to clipboard. You can download it, or you can delete it. Let's close it. These are basically the features that you can use. These are the features that you can use with the generated image. One thing I want to say is that with original image, if we go back, for example, to this one, let's say you have all these options here. But let's say if you upscale it and then upscale version will have only the limited settings. It will have the delete download, copy and also remove a background on the original image, will have all the features. Let's say if you don't like this upscaled image, then you need to go back to original image and then use a different upscale. That's how you do it. 81. Image Generation - Leonardo Parameters: Now let's actually experiment with the Leonardo settings here. So we have this fine tuned model dream shaper. Let's now try using the Leonardo style and see how that compares here. Okay, now for some images, we got something wrong with the hat here again, but these two are pretty similar to the default no style mode. Now let's try to generate the same prompt with Leonardo's alchemy feature. Let's turn it on. Alchemy is Leonardo's custom feature. It's designed to generate high quality two D images. It's available for paid users and you can select from a range of alchemy specific precepts. Let me show you where this is. Once you turn this on here you will see lots of different presets like anime creative dynamic photography. You can choose none. The default one will be dynamic. Also, when you turn alchemy on off On, there is a bunch of other settings that will show up such as high resolution, this boost the output resolution of lead alchemy. High resolution outputs will be somewhat different to the normal resolution outputs due to the diffusion process. It's not going to be the same as up scalers. Okay, you can turn it on or off, Let's keep it off for now. Then we have expanded domain. It increases the creativity range of generated images. When it's off, images are more likely to be aesthetically pleasing but may not be as prompt adherent. I would say that expanded domain is somewhat similar to guidance scale for alchemy feature. Let's keep it on. When we keep it on, then there'll be a greater prompt adherence, but there is the risk of visual artifacts and anomalies. Okay, we'll see if there are more artifacts, then we should turn this off. Then we have contrast boost. This will adjust the dynamic range of your image. The default is one that you may find, reducing, it is helpful depending on your subject matter. Then we have resonance that dictates how much detail is in the image and how prompt adherent it is. Around 13 to 15 is a good balance. Higher numbers will create images that are extremely busy. Right now we're on 15. Let's try out with the default ones and just see how it goes and maybe change it later. These images look way more detailed than the previous batch. Look at all the setting here and the clothing, the belts, jeans and so on. However, I think for the face, let's, I think there is something wrong with the face here. I'm not sure where are those black lines come from? For some reason they are on the other images here, it like maybe shades from the hair, but it doesn't look natural. Now, I would like to change a few settings, but before I do that, I think I'll take the seed. This generation, we have consistent composition and see what would be the differences. Okay, here to take the seed, you click on this three dots here and you can copy the seed. And then if you scroll down on the left panel and click Show Advanced Settings, You can paste the seed here and then use Fixed Seed. Turn on, Okay, now we should the same composition. Now I will change a few things. Contrast bust, it says that the default one is one. Let's try with one. Maybe it's going to give a more dynamic image. Maybe I don't want to edge detail here, I'll turn that off. Let's try again. Maybe we can also change the preset. Right now it's dynamic. Maybe we can choose a three D Render, or Illustration or Creative. Let's use dynamic since we turned on the dynamic one. We got our images and again, we have these lines on the face. I like the second one, but the previous batch was better. I really like this image here. This is not as good, especially here. We have some artifacts and again, the lines on his face that was alchemy feature, Sometimes it works really good by adding lots of details in this image, but from my experience it just adds more artifacts. It really depends on what kind of image you're trying to generate. And of course, you need to play around with all these pre settings here because for some images maybe contrast boost should be like 0.5 or zero different resonance. Or you choose to have expanded domain. That will very much depend on your image. Okay, now let's move to prompt magic. I'll turn alchemy off. You can actually keep them both on, but for simplicity, let's actually turn the alchemy off and then just keep the prompt magic on. What is prompt magic? It's Leonardo's custom under pipeline that has far greater prompt adherence, higher image fidelity, and can improve the output with any chosen model. It increases token cost due to higher GPU overheads. Once it's on, then you'll also have more settings here to change the prompt magic strength. That's how strongly prompt magic influences the output. A higher number means greater influence. Right now it's at 0.4 Let's maybe choose like the highest one here, which is 0.8 Then we also have this high contrast. And high contrast mode will give moody images with more shadows. Turn this total off if you find that outputs with it are too dark for your chosen prop, let's keep that on and just see what's going to happen. Again, I've turned the highest prompt magic strength. We see the influence of prompt magic end. Actually understand what it does here again, I'm using the same scene. Let's generate. Here we got the images in a completely different style. Here again we have cowboy, but now we have a side view. It's way more dark. His face definitely is illuminated only by the sunset. Okay, that's what we wrote in our prompt, we wrote the cowboy, where his face is illuminated by the setting sun. And I think that here and here is great compared to other ones where, for example, with alchemy doesn't seem to be like a sunset. Or maybe this one here we got the sunset. But I think these images have a more dramatic lock to it. Let's now maybe change the settings here instead of high contrast, let's turn it off and see how this will affect the image here. There's definitely less contrast in the image. Again, we have the Cowboys side view and sunset, but now the image is way more light. So it really depends on what you try to achieve. I really like these images with high contrast here. I think it really sets this Wild West atmosphere here. 82. Image Generation - SD Parameters, Schedule & Sampler: Now let's try some different prompt for this. I want to change the model. I'm going to go back and I'm going to go to fine tuned models. Right now I want to use the three D animation style model. When you click on the model, you can see the images that were generated with this model. For example, I can click more. I'll have all the images, for example I like the first one here. I can check out what prompt was used to generate this image, as well as the settings that were used. For example of the resolution guidance scale, sampler presets, prompt magic strength. We can click Remix. It will automatically use some of the settings. It will use the prompt as well as the model and maybe some of the presets. But what I found is that it may not use all of these settings, especially with prompt magic. Let's try it out. Let's use this remix button here. It changed our prompt. It added the fine tuned model three D animation style, leonato style. Actually, let's check out that image settings. Here we have resolution 640 by 832. Here we have 640 by 832. That's correct. Then we have the leonato style. Leonato style. Here they've used the pro version two, they've used high contrast, and the pro magic strength was 0.4 Let's check this out for prompt magic. It's on, but as you can see, it wasn't adjusted to that image. It was left from my cowboy. Prompt Here, I will manually change it, so I'll make it 0.4 I will turn on the high contrast. Okay, I think we've covered all the settings here. The guidance scale is 77. That's correct. Okay, let's try it out. Beautiful. We've got very similar images to the image that I found in the community feed. Okay, now I want to adjust my prompt a little bit. I want to change the setting instead of Bohemian city. I want to make it a Greek city. Here instead of Bohemian. I will change that to Tunic. In tunic and accessories. Let's try this out. The images look very nice. Okay, now I want to explain some more settings for that. I'm going to turn off the prompt magic and let's also make it private generation. Why not? Then let's repeat. What is guidance scale? Well, guidance scale is a parameter in stable diffusion. And basically it determines how much image generation process follows the text prompt. Lower values will give you more creative results, but some things from your prompt may be missing in the image. With higher values, you will have images that are better aligned with your prompt, but it may increase the risk of having artifacts. The default value is around seven. If you use somewhere 6-10 that will also give you good results depending on your image. Then we have the set number you'll find in advanced settings, and then you can use a specific seeded is just an initial input that guides the creation of the image. The same set prompt in parameters will produce the same image with minor variations. Then we have tiling. Tiling creates similar patterns. If you want to create a similar pattern, then you can turn this styling on. For example, here in the prompt, maybe let's put water color lemons here. I will disable this fixed seed. I want a random one. Let's generate, maybe not use this three D animation style. Let's just use the Leonardo Diffusion. Here we have some fun patterns that you can check in the similar pattern checker. That's the tiling parameter. Now let's turn it off, and let's move on to more advanced parameters. Another parameter in stable diffusion is called sampling method in Leonardo, it's called scheduler. Basically, it's an algorithm that guides image generation. But to be more specific, it's actually responsible for carrying out the denoising steps. If you remember how stable diffusion works, it starts with a random noise through a number of denoising steps. It creates this clean, clear image. Well, sampling method is responsible for that denoising step. There are a bunch of different sampling methods and they will produce different image outcomes. Let's look at some of them. Let's take a look. Someone has compiled this chart and shared it. On read it, it gives a really good picture of different samples. Here on the left, we have different samples. On the top here, we have a number of steps. The fewer the steps, then the noisier is the image. The more steps, the more clear and detailed does the image get. The first one is oil. I would say around step 16 we get a nice image here that we have. We get a good image at step eight, then we have LMS. As you can see, it needs more steps to get to our desired outcome that we have LMS D. Then we have Hums Sampler 02:00 P.M. two. Of course there are more different sampling methods, but these ones are the more common ones. As you can see, most of the samplers converge to the same image, except the PM two and the oiler A, which gives a completely different image. Also, as you can see, different number of steps is needed for different samplers to get to the desired image. It can be as low as eight steps. For some it needs to be 32 steps. For some it has to be more than 32 steps. It really depends on the sampling method. The fastest one is oiler. It's usually used as a default sampler for stable diffusion, but the oiler is pretty good and the DDI is also pretty fast. Now let's get back to our Leonardo AI here in the advanced settings. If we open that, we can change the scheduler or sampling method, or sampler. Here at the default on E is Leyendo. Then we have oil, or the full name is Oiler Ancestral. Oiler Discrete DIM, DPM solver, and so on. Let's try using different scheduler. For example, let's use Oiler Ancestral and keep other settings the same. I'm going to use this prompt here by clicking this button, it's going to reload you're prompt in this field, which is quite handy. Also, I want to use the three D animation model here, this one, in order to see the differences between the schedulers. I will keep the fixed seed. Let's try with oiler A. Let's also try with Leonardo. Let's with DDM. Now we've got all our images here. To see what we've generated, you can click on these three dots and view generation info. Here we have the DDM. This one is the D DIM. This one is Leonardo and this one is Oiler. You can see that there are small changes, some images here. We can see that the arm is a bit different here, but everything else very similar. The color of the dress may vary. Using different samplers will result in subtle changes in the images. 83. Image to Image Generation & ControlNet: Now let's move on to explore more settings here we can actually use image to image generation and image propped. We'll try both and we'll see the differences for image to image generation. Let's upload our image here. So let's say this one down here. I can change what's the influence of my image on the generated image. Initial strength and high initial strength will preserve the original image more. Okay, here I have the three D animation style. Let's change it to maybe Dream Shaper, because my input image is more photo realistic. I think something more realistic would work better here. Okay, Dream Shaper version seven. Let's try with initial strength of zero point. Let's use the maximum. Maybe here 0.9 And let's keep the same prompt. Okay, now let's generate here. You can see that it's almost the same as our input image here. But maybe some things were a little bit different, but not much. Maybe the highlight on her cheeks, and that's it. Let's make the initial strength smaller. Let's put 0.5 Let's generate again. Now here we have more differences. The clothing and hair accessories. Also the face looks more as the dream shape or train character. Now let's move it to maybe 0.20 0.24 at 0.24 Now the background is also different. See how the strength of 0.5 the background was kept as on the input image, but with the 0.24 Now we use more of the prompt here. It says Greek City Landmark and we get those columns. Okay, that was the image to image. Let's see how image prompt is different. First of all, image prompt here, you can upload more than one image. For example, here we have this lady here, but I also want to give a background here. I have a Greek town, Santorini. Here let me show you, this is the input image of the lady and this is a picture somewhere in Santorini. I'm putting all those two images here. I have different settings. Here I have image weight and higher values will make the output look more like the reference images. Low values adhere more to the text. I think the 0.7 the default one is pretty good. Then we have this magic string which is bad custom function. Here it says how strongly pro magic influences the output. A higher number means greater influence. Let's keep it in default 0.4 we can make it the minimum is 0.1 and the maximum is 0.8 Yeah, let's keep the default 10.4 Let's see here, we definitely see the influence of the input images On the background, we have the Santorini City, this blue roof in a dome shape. Here in the foreground, we have this lady and her posture is quite similar to the input image here. The difference again, between the image prompt and the image to image is that with image prompt, you can upload many images as well. Use the prompted magic with image to image, you cannot use prompt magic. Now let's move on to our next setting parameter, which is control net. I think there is. Good transition between using the image to image and image prompt. Let me explain. When you upload image here, there is no way you can tell AI. What do you want from this image? Let's say I only want to use the posture of this lady here. I don't care about the background or anything, just the posture or maybe a facial expression. But with image to image, this is very difficult to do because all you can do is affect the image strength, which is the image weight. And you don't know what part of the image will be used in the image generation. For that reason, there is a very useful tool called control net. Control net is extension or add on to stable diffusion that allows to input a reference image to influence a specific attribute of the generated image. It can include a pause, composition, edges, depth, or facial expression. Let me show you, these are different control net models and this is the image input. Different models extract different things from our image input. The first model is called in Leonardo, is called edge to image. What can does, let me show you the candy method extracts the hard edges of the sample image. It's useful for many different types of images, specifically where you want to preserve small details and the general look of an image. The image input was this meme here. Here, what it extracted here is the image that was generated with. Then we have our depth model. The depth method extracts the three D elements of the sample image. It is best suited for complex environments and general composition. Let's check out, see how we have this white in the foreground and a little bit more gray in middle ground and black in the background. Here is the image that was with the depth method. Then we have open pose, and the open post method extracts the human poses of the sample image. It helps tremendously to get the desired shot and composition of your generated characters. Let's check out here, this is what it extracted from the input image. And see how here and now we have totally different background, totally different clothing, but the poses are the same. There are more models for control net, such as Scribble and so on. But for Leonardo, right now we have those three models. If we go to Leonardo, let's switch on control net. Here we can choose post to image, edge to image, and depth to image. Let's try all of them. Here I have the post to image. Make sure you have the image here. As you can see, now there is no initial strength here. We can only change the control net weight. The higher the weight, the more control net will influence the generation output. I think one would work best here because I want to have this exact pose. Let's try it out. Again, I'm using the same prompt. Let's generate, see how we got exactly the same pose, but now the background is different. The woman here is very different. Clothing items are diverse compare to image generation, where we have almost the same lighting to the original image. If the background is different, then not too much from the original image, but here completely different lighting. Only the pose is what has been taken from the original image. The control net gives us so much more control over image to image generation. Now here, let's check out those images first to make sure the proportions are wrong. This one is way better here. This one is good too. Here again, proportions are a bit screwed up. Let's try, instead of post to image, let's use edge to image. It's going to use all the edges from the image here. See how the face is more similar to the original image then compared to the post to image here face is completely different. Then we still have the same hose, but the background is different. And that's because here in the edge to image, we have very detailed information about the girl here, about her facial features. Including her facial features, but for the background, because this image has blurry background. Maybe some small things here with edge to image. Let's say if this background wasn't blurry and we had some nice and sharp buildings in the background, then it would also catch that and will generate something that would have the same shape. Now let's drive our last model, depth to image. Let's try it out to remind you the depth to image model extracts the three D elements from the input image. Now here we got this girl and she is in the same pose as our input image. The face is quite different as well as the background is also completely. For the depth to image model, I think it'll be more fun to have more characters on the image. Let's maybe change the image here. I will app different image, let me show you here is the image that I've applauded. And here we have a girl in a foreground reading a book. On the middle ground we have a different girl. Let's try that one. I will here change the prompt a little bit as well. Beautiful woman in tunic. Let's put sitting on a bench. Let's keep everything else the same here. I didn't pay attention. When I re appload a different image, the control net automatically turns off. This is just the simple image to image generation, let's just check that out. Here are the images. Now let's turn on the control net and use the depth to image. And we'll see how that compares. See how in these images, the position and the relative position between these two characters is the same as the input image compared to the image and image generation where the correct position wasn't captured. That was the depth to image. You can see that there is a huge difference between using control net and just the regular image to image generation. As you can see, Leonato has so many settings that you can adjust to get a desired outcome image. Here we've talked about leads, custom functions, prompt magic and alchemy. We've talked about switching between public and private generation, where you can change image dimensions, you can change the guidance scale. Then if you have a reference image, then you can use the control net with three different models, Post to image, edge to image, and depth to image. Then we have tiling, Then you can use the regular image to image generation. You can use Image Prompt with many images. Then you can use a fixed seed. Then you can change the scheduler, which is the same name as sampling method or sampler. 84. AI Canvas - Outpainting: Now I would like to move on to A, A canvas. This is where you can do in painting and out painting. First of all, here in the canvas mode, you can choose the mode. The first one is text to image. Let's say you want to generate image first, then you can use this one here. Here, you'll have most of the settings that we've already seen in the image generator settings. However, let's say you have generated an image. Let's actually go back here and let's go to image generation Here. I want to find a nice image. Maybe this one can actually send it to the canvas by clicking the square icon. Now it's in our canvas and we can work with it here. Okay, for now, let's change our canvas mode. Since we're not planning to generate any new images, let's use in paint and out paint. On the left hand panel here we have a pan tool that basically allows us to move the canvas around. Then we have this Select tool. This allows us to select this generation frame and move that around. Then we have Draw Mask and Erase Tools as tool. Let's say you want to change something in this image, you can go ahead and erase that. For draw mask, it's very similar. If you want to change something about this image, you just paint over that specific element. What's the difference between rays and draw mask tools? Is that the mask retains some information of the image under the mask. For example, if you want to draw sunglasses and you want to retain maybe the eyes here, then you can use the mask instead of erase tool. Then we have the sketch tool that allows you to draw something. You can draw it on the canvas here or you can draw it on the image on the top here. You can change the colors. You can change the brush size as well. Lastly, we have this upload image. You can upload images from your computer. You can download the artwork. Okay, now let's say I want to make out painting. I want to extend the image. Let's extend it maybe to the bottom here. I will use the Pan tool to move the image bit up. Then I will use the Select tool to move my generation frame. Down here on the right. I have a bunch of other settings here if you want to out paint, make sure this out paint is turned on. If you want to do in painting, then you can turn that off and then you can choose the paint string. But we'll get to that too for now. Let's use the out paint then How many images that you want to generate for the out painting? Let's use four. Then we can set image dimension it's nice to generate with highest size. So I'll change that and see how our generation frame immediately became large. Then we have this render density. Render density decreases the size and increases the pixel density of the generation context. It will decrease the size of this generation frame, but the quality of the image that it will produce will be better. Actually, right now it's one X, maybe. Let's make it 1.5 or two X. See now the size is smaller but the image generate will be higher quality. Then again, we can change the guidance scale. And let's see some more advanced settings here. We can use the fixed seed and we can choose the scheduler for now. Let's keep that as default here, I will use the same prompt as I used to generate this image. Also make sure that the generation frame overlaps with your image here. May move bit up here. For negative prompt, you can also include that by clicking this button here. Let's say forward. And nude. Let's click Generate. Let's check out the images. This one is missing a hand here. We got completely wrong extensions for that reason. Let's actually cancel it to improve the out painting results. There are a couple of things that we can change here. If you've noticed to do the first out painting, we've used stable diffusion 1.5 model. But when I was generating this image, I actually used a fine tuned model called Dream Shap. Let's switch and see if that will improve the out painting results. Dream Shaper seven, that's the one I've used here. I will keep the same prompt. Let's generate, let's check out the results here. Here, again, we're missing the hand here, but I think it's a little bit better matching. Let's see the other ones. Okay, still there is a problem with the arm, but I think the details match way better here As we've changed the model, let's cancel it and help AI a little bit more. I will erase the sharp edge here. I will remove some of the back here and the arm, hopefully that will give it more room to improve. Let's generated again here. Now we have, I think, a better extension of this image. The arm is covered with this fabric, which is not too bad. Let's see the other ones. Yes, same here. This has a bit more problem. Okay, I think the first one was the best. But we can spend a little bit more time here and just try changing a few things in the prompt and in the settings to make it look even better. Let's do that after playing around with the settings. Here is the result that I've got pretty like this image here. This is where I stopped. I've changed the size of the generation frame. I reduced the render density to make this generation frame larger so it covers more of the area. Also, I changed the prompt here a little bit. I added with bracelet on her hand, This is not really reflected here. But then I also increase the guidance scale to eight, which is also a minor change. Then I just try to generate a number of times to get the result that I want. Here are some other images that I've got, but they weren't as good as this one. As you can see, there is still problem with the hand here. But I like how it's covered here. Right now here, it just looks natural like she is covered by this fabric. I will click except here. Now we can also add maybe some out painting to the sides. I will keep this a Greek city landmark background and generate a couple of frames here. Again, I will move my generation frame to the right here. I will use the array tool to remove these edges. Let's this looks good, so I'll just accept it here. So this is a final result after all, the out painting. 85. AI Canvas - Inpainting: Now let's try in painting to switch on to paint. Just turn off this out pin tool here. Now we can change the pain strength. The painting algorithm looks at what already exists in your image to better edit and place new quantum. The higher the strength, the more it will diverge from the original image. The default one is one that's the highest. Let's say if you want to keep some of the content, then you can reduce the in pain strap. But let's strive with the default one here. Now I'm going to zoom in. I want to add a tattoo for that, I'm going to use all the tools here. I'll use the mask, I'll use a race, and I'll use the sketch. So you can see the differences here. I will start with, let's use the rays one first. On the top, I can change the size of my brush, make it smaller. Let's put 20. Okay, I want something here and here in the prompt, I will pour a butterfly tattoo. Okay. So now my pain strength is one. Image dimensions is 1020, 4,024 Rented density is 1.5 x. We can actually make it higher because the area is very small. Let's make it maybe 2.5 x or even higher, 33.5 I think that's good. Okay, again, let's zoom in and use the pen tool to move the canvas in. Let's generate. We have this butterfly tattoo. I'll lick the blue color here. I think this one is the best, because it matches better with this whole image. Let's accept it. Now, let me show you how you can use the sketch tool to generate something that you paint. For example, here I will draw cherries. Here, I will use the red color. By the way, this is a color picker. For example, you want to use this color here. And you just can click on this button and click on the color you want. Now see a match to the color of this place here. Let's change it to red. Think this is a good size. Now I will draw my cherry. I have one, maybe a second one, and I want a stamp. Let's make it green. And let's decrease the brush size. 14, I think that's good. Or even smaller. Put eight, that's good. Here I'll put and a leaf something like that at the bottom. I'll put a cherry tattoo. Now instead of using this in Pain out Pain tool, I will go to the Canvas mode. I will go to Sketch. To Image, I will choose the mode. Once the mode is selected, I will click January. Here we got the images that look like my sketch. Let's see other variations. Okay, again we have those two cherries and leaves. I like the first one here. Let's accept it. Okay, now let's use this mask tool here. I will actually cover this butterfly here. Just to show you what the mask does. Maybe we will cover a little bit more area here. I'll put butterflies tattoo again. I'll switch the mode from sketch to image to paint and out paint here. Let's say we want to preserve some of this butterfly or maybe the shape, then we can make this paint strength smaller, make it maybe 0.8 Let's use that. Let's generate here in place of that old butterfly tattoo. We got these new butterflies. As you can see, the shape of the butterfly and the colors are quite similar to the previous tattoo. Let's see the other ones, again, very similar shape like this one the most. This is how you can use the arrays tool and then the mask tool to change things around a bit. I'll also like how the inside of this butterfly match with her skin tone. It really looks pretty. Let's accept it. Okay, I think this is way better here. Now let's continue and do some more in painting. I will zoom out a little bit to show you the final result. I think the butterfly looks really good here. But for the char is, maybe we should change. But that's for later. Okay, now I'll use the Pan tool to move the canvas. And I'll use the Select tool to move my generation frame. Again, I'll zoom in here. I want to add sunglasses. I'm focusing my frame around her eyes. I can make it even smaller, so maybe four X. I think that's good. Here and here I can choose either to draw a mask or erase, or even sketch. Let's use the mask and I will change the brush here in the prompt output. Fashionable sunglasses. Let's generate the glasses are poorly integrated. Here we have big lines. Let's see the others. This is a bit better. I like how we try to preserve the eyes. Let's generate one more time and see. I think these glasses turned out pretty good, but again, we have this problem here. But we can probably fix it with image to image generation. And let me show you how to do that. Here I will click, except now I will change the canvas mode to image to image. Here, prompt, I will put my old prompt, Master Professional Photography and then beautiful woman in tunic. Here I will delete the tunic and put it in fashionable sunglasses. Let's try it out, by the way, for input strength here, we can also modify this. That's how closely the generation will follow the underlying image. Let's say if you want just small adjustments to the image, then you should make it higher. In our case, it's 0.3 Let's move it higher because I don't want to see a big difference. Maybe let's move it to 0.5 If we have strange results, we'll move it even higher here. As you can see, image to image changed the whole area in this generation frame. Now we see a completely different phase. Let's see the other ones. This one is pretty good. I like the earrings here. The sunglasses now look way better. Maybe the second one was pretty good. Now let's actually cancel that. Let's make the input strength higher and see what will happen. Let's cancel this. Let's make input strength to the highest is 0.8 Let's make it 0.65 or 0.7 let's make it 0.7 Here we see just slight modifications but, but it makes the whole image way better, especially the sun glasses here. Let's see other ones. I think the first one was pretty good, but the only problem is that we don't have this frame here. Actually, let's cancel it, and I'll make the input string for a little bit less 0.6 Let's generate again, let's check the other ones. This one tally has artifacts. Well, I like this one. I'll keep this one with image to image. We turned this to this, which I think is a pretty good result. Okay, now the last step here is to download our artwork. Let's check it out. Okay, so this is our final art work. Okay, with image to image, I can definitely see that we've got this square a little bit out of place. You definitely can see those different pixels here on the canvas. We can actually blend that together using the array tool. I will the edges here. I will make my generation frame higher. I'll reduce the ranger density. That should work. Before I click Generate, make sure to change the mode to paint out paint. Otherwise we'll just get the image to image generation, which I don't want here. Let's use the paint out paint now. As I'm looking at it here, it nicely blended it together. Let's see the other ones, there's like slight changes in the shoulder. I like the last one here I'll accept. And again, see the difference here. Now it's way better. Let's download it again. This is our file Result with Out painting and in painting using Leonato's AI canvas. 86. Leonardo - Training a Model: In the introduction to Leonardo, I've told you that you can actually trade your own model and you can do it here in the training and dataset if you have a free plan. I believe you can train up to one model. But if you are on a paid plan, there are more models that you can train here. You can train your character here. I trained with myself. I trained with a character, I also trained with a style sticker style. Okay, how can you train the dataset? Basically, just click new dataset here. Give a name to your dataset, for example, fun stickers. And then you can give a description for your collection. For example, animal stickers. Then let's create a dataset here. All you need is to drag and drop the images. For example, here I have a few stickers. Let's put them all here. Once you upload all the images that you want, you can click this train model. But before we click this train model button, let's go and check out the requirements and tips for the best model training. This you can find in if you go to the questions and answers help and AI Model Fine tuning. Introduction to model fine tuning. Here is some steps. Here we have what makes a good training number of images. So suggested ideal eight to 15 images, minimum five images, maximum 30 images. Here is more suggestions on variation and consistency. Consistency, It's important that there is a common theme or pattern between your images for the model to learn from. For variation, things that vary across your images will be more loosely learned. That is what allows your model to put your trained object in new kinds of style and context. Let's check some examples. These are bad datasets. As you can see, these are very different styles here. And then we have also repeated images. This is a good data set. We have animals that are quite similar style, different backgrounds, different animals. But that's a healthy variation that helps the model to learn. Let's go back to our training and datasets here again, let's applaud more images. I will applaud all of them. These images I generated with Leonato beforehand. Here we have 13 images. I think they're quite diverse and yet they are in a similar sticker style. Now all we need is to click this train model here. Model name. Let's put stickers then, training resolution. One more thing, when you upload images, make sure that they are squares, otherwise you may get very strange results. Let's go back to our train model, a fun stickers for training resolution. You can choose 512 or 768 by 768 per category. Make sure that it aligns with the characters so it can be a character environment building, fashion illustration photography, if you're doing maybe photos of yourself, for example, that we have product design textures, elements in this one is set somewhere between illustration and graphical elements. Let's use illustration. Then we choose the model stable diffusion, 1.5 or 2.1 I would make sure that the training resolution is consistent between the model. Here you can see 512, 512 here. If you choose 768, then you should probably use the version 2.1 However, it also says here that stable diffusion 1.5 is recommended. This model performs well in general, better with realistic images. Okay, model description animals. I guess one of the most important things here is the instance prompt. This is what you will need to put in your prompt to refer to your model or your character. For example, if you applaud selfies of yourself, you can perhaps put, I don't know your name. Then in your prompt you'll put and then a description such as a girl with curly hair in walking in a park, something like that. Here for example, you can use three letter combinations such as SKS. It doesn't quite matter. Just try not to put something vague here. I'll just put SKS. I'll just start training right now. Training is in progress and it says that they will e mail me when it's complete. Depending on the size of your training model, it may take anywhere between 30 minutes to a few hours. Then you can check the status. But I've trained my stickers. Let's just try it out and see how it works. Again, I will go to AI Image generation here for example, I can put sticker, cartoon cute baby husky, white background, 12 K, high quality HD octane render. Then it's important to choose your model here. You can select custom model in your models. After training, you'll see all the models here. For example, let's choose stickers to just click View and click Generate with this model. Okay, now we have our model selected. And very important, make sure that in your prompt, it doesn't matter where, but you put the instance prompt that you've created. In my case it's SS sticker. And I have to add that so that the model will be generating based on my training images. S sticker, cartoon cute baby sky white background 12, high quality H C octane render. Now here I don't want to see Lana style and I will disable the magic. Let's also make it private because it's based on my model. Then for image dimensions, I've trained it on 768 by 768, this is good. Then the guidance scale, then let's check out the scheduler. Here we have Leonardo. I will actually change it to Oiler Ancestral Steps. Let's use the 30 steps. Okay, let's check this out. I think the results are successful, except for the eyes. I don't like them here, but everything else is very similar to my training data set. We have this sticker, white highlight here and grayish background. If you notice all my images here, they also have grayish background. Okay, that was for the stickers. We can try something else. Maybe not husky, but maybe bulldog, and let's try that. Okay, this is so cute. In a similar way, if you have a specific style that you like working with, you can train a model on that style. And then I use this model to something else, something new. Let's now also try with myself. Here I've created a model called T, which was trained on myself. Here I will click on, I'll click Generate with this model. For the prompt I will, my instance prompt here is a girl. I have to include that here. A girl with curly hair in a part background. And then I'll put soft lighting, high quality, and let's put 64 K for negative prompt. I will also add that. Bad framing and so on, Okay. Now. Here. It was trained with 512 by 512, so the image dimensions is correct. Okay, so let's increase the step count a little bit here. Movie 30, 40. I think that's good. Let's generate. Okay, so that me or the AI generated version of me overall, it's not bad. It captured my hair style, it captured my facial features. My eyebrows, nose, eyes, mouth. However, the quality is not great. We can try to upscale. Let's use maybe creative up scalar after upscaling with all the up scalars. Here are the results. This is the creative up scalar. This is the alternative. This one is the HD smooth. This is the H D crisp up scalar. The best result here was the original image without any up scalars. Because the upscalithermooved out too much, they changed the facial features and now it just looks ugly. The original image was the best here, but again, unfortunately the quality is the best here and up scalers do not help. Of course, we can change and play around with different prompts here and change the settings to play around with guidance scales, step counts, schedulers, and so on. Another thing I want to note is that if we go back to our training and datasets here, we have quite limited training settings. The advanced training settings are not available yet, probably that will definitely change in the future. But for now it's very simple, and for people that want more control over the training, this is very limited. Another limitation here is that when the training is finalized, you cannot download the model and use it with your own stable diffusion. But on the other hand, Leonardo allows to train one model on a free plan, which is a big advantage, and you can try it out and have fun with generating images based on your model. In the next module, we will talk more about model training. And we will actually cover another platform where you can train your model. But right now, let's quickly sum up what we've covered in Leonardo. Well, first we started with home page. Here you'll find the featured models. And we've talked about the gallery. We've also talked about the community feed and the personal feed. Then we went on exploring fine tuned models and using some of that in AA image generation. Then we went on to generate images and we generated a bunch of different characters, images, and other art. Here we've covered all the settings. On the left, we've talked about image generation and prompt generation. We've talked about prompt and negative. Prompt, fine tuned model and different styles. Then we went on exploring the AA canvas here. We've talked about all the tools here. On the left, we've changed the models as well as we looked at different canvas modes, such as in Pines, out panes, image to image, and then sketch to image. We've also tried different settings here as well. Then we went on to talk about training your own model and generating images with that model on this node. I want to finish with Granada module and let you try it yourself. 87. Astria.ai Introduction: Hello, hello. In this module, we will talk about fine tuning stable diffusion models. And we will cover a platform called Asta. Let's check it out. Asi was founded in December 2022. It's a website where you can do high quality fine tuning and image generation with stable diffusion. Here you can train custom models, download them, and generate images using your model or some other stable diffusion models as well. It has very simple interface and it's easy to use. Before we go on and check out Astra, do I, I want to talk a little bit more about fine tuning and what is it. Fine tuned model is a model that's been trained on a specific data set such as a particular art style objects, animals, people. How does it work? Well, a model that's been trained on a wide data set, such as table diffusion version 1.5 which is a general or standard model, is now further trained on a narrower dataset. Why do we need it? Why do we care about fine tuned model? Well, general models are good, but they cannot be good at everything. For example, if you want to generate images in a particular style, such as a specific anime style, it can be very challenging to do with the prompt. Fine tuning allows us to train the model in that art style. When we generate images, we will get them consistently in that style. The fine tuned model will be biased towards generating images similar to your dataset while keeping versatility of the general model. There are a number of fine tuned techniques, one of them is Dream Booth and another one is Laura Dream Boof Technique was developed by Google and Boston University. And it involves injecting a custom subject into the general model. There are many platforms that help you train models using this technique. And this includes Astra. You could also run Dream Booth yourself on a computer or using a cloud service like Google Coap. But what's the advantage of using platforms like Asta is that they already have specific presets that allow you to train your model in just a few clicks. And the results can be way higher quality than personal dream booth training, especially if you're a beginner. That's why using services such as Astra.ai, is a good starting point in fine tuning your models. Let's get started and let me show you Astrid. Ai. Here is Astra's website, Atria. Let's check out the pricing first. I'll click here. Asta doesn't have any free credits or free trial pay as you go in order to train your fine tune your model. It's going to be 1.5 dollar per model. If you want to generate images with your model, then it's $0.10 per prompt. So you'll generate eight images. Let's say if you want to make video with Asta, that's going to be $0.40 per 100 Phrase here, below are some more details. The minimum credit card processing amount is $5 so you cannot put anywhere less than five. Also, models are only saved for 30 days since the moment they become available. However, you can extend model storage and it's going to be $0.50 per model per month. Then also new is not allowed and so on. Then at the top here, we have community gallery. We have lots of community feed. Let's check out the gallery. Maybe here first. Okay. Apparently, before viewing the gallery, you have to sign up. Or again, if you don't have Atrias account, then sign up. It's very easy to do. Just provide email password here. I will again. Okay. Now I'm logged in with my e mail here and let's go to check out the gallery again. In the gallery, you'll find prompts that were made by the astray. I quite like it because as you can see, the prompts are pretty lengthy and they contain also the weights and the negative prompt, which makes the result really good. You can use these prompts when you generate images with your own model and take that as an inspiration. Then there models, these models were fine tuned and made available by the community, so you can also check them out and use them. Then there is video API and examples. Let's check examples. Here you will find what things can be trained with the model. For example, a car here is the original image it was trained and after. Here are the images that were generated pretty good, then there is a hat. Here are the generated images, concepts, different style genres here. And then we have a man also very different or techniques. Then we go to tunes. This is where you'll find all your models. This is my model that I've generated beforehand with the character. Then we'll go to, here is the AA image generator. Here where you write your prompt. Then you choose your model. You can use this basic interface and concrete image. Or you can use the advanced interface where you can add the negative prop change parameters and so on. So this is the overview of Astra's website. In the next video, we will go on fine tuning our model and also trying out to generate images here. 88. Astria - Training a Model: Now let's fine tune our model. Go to tunes. Once you find your account, you can fine tune a new model. Just click this new fine tune here. Give a name to your model such as a. This doesn't matter too much. Then very important, write a class name for your model. It has to align with the subject of your training dataset. Because I'm training this model on myself, I'm going to put a woman, but you can also put like a couple man book, dog, car hat style and so on. Let's use woman here then I'll choose my images. Here are myself here, it says upload 20 images of the subject or anywhere 4-30 images should include subject in different variations, expressions, poses, and backgrounds. Go to your photos. Up and select photos from different types. No nude here. If we scroll down here, there is also a little bit more information here. The images preferably should be cropped to one to one aspect ratio, two square. Then they recommend uploading three photos or full body or entire object. Five medium shot photos from chest up and ten close ups. Then variation is key. Change, body post every picture, use pictures from different days. Backgrounds and lighting show a variety of expressions and emotions. Make sure you capture the subject's eyes looking in different directions for different images. Take one with closed eyes. Every picture of your subject should introduce new info about your subject. Here we've got more details on the generation process here. The fine tune should take around 20 minutes. It can take longer depending on the cue. The models will be saved for 30 days. But there is an extend option. Okay, let's try it out. Here is the basic interface. Can click this Creed. If you want a little bit more control on how you train your model, you can click this advanced button. And now we have more advanced features here. For example, we can select which model we want to use here. We can choose between stable diffusion and 1.52 0.1 open journey two and so on. So here we have a general model, but you can also select a base fine tuned model. It may sound strange, but you can fine tune a fine tuned model Here for example, there is a model that's been trained on a specific style, like let's say photography style. It's still broad enough to train your own subjects with this model, but it's going to give you a specific aesthetic. For example, here we have CB I and I can't believe it's not photography here. If you just Google this model, you can click on this AI. Here is a little bit more information about the model and you can see that the images that are generated with this model are very photo realistic. Let's say you want to train your portraits with this model, you can just make sure, let's close this here. Make sure that the base model here, we have stable division 1.5 is aligned with this model here. So here we've chosen stable division 1.5 and from the information here, the base model is also stable division 1.5 It matches, otherwise you'll get strange results. Okay, now here you can write your own token. This is the same as Instance prompt. This is what you'll put in the prompt to refer to your subject. It can be SKS, it can be whatever. Here, let's use S here. You can also choose the model type. Here is the Dream Booth technique or Laura. Laura right now is in the beta mode right now. Let's keep it in Dream Booth, Fine tuning. Okay, now we have all the information here, The class name, we've uploaded our images. Let's create it. It's now going to upload all my images. The fine tuning has begun. It says here the ETA is around 30 minutes. After waiting about half an hour, our fine tuning is complete now, and this is what you will see. You'll have generated images with this new model using sample prompts. The first one is the '80s portrait of SKS, woman, blond hair, blue eyes. Then we have realistic digital painting here. I think the facial features are, don't quite match, but maybe this one is not bad. I like the number three here. We have magazine cover photo realistic glamor shot of beautiful SKS woman here. So far I like it the most in terms of a facial feature similarity. I think this is a spot on. I really like this one. Then we have number four, close up or face of SKS, woman fashion model. Portrait of young SGS woman, cinematic flower patterns and so on. And the last one is a portrait of Princess S woman. If you like any of the images, you can download all of them here. If you want to download all of the images here, you can click Download Generations. And this will download all of the images here. Let's do that. Okay, here, let's put cat generated images here. You'll have a Si folder with all the images here. Okay. You can also write your own prompt here. Just write your prompt then. Let's say you want to also add a negative prompt. You can click on this Advanced button. Not only you can add your negative prompt, but you can also configure other settings like Steps, See then we have with it. But right now, let's keep it more simple and just use the prompt and negative prompt. For inspiration, I will go to the gallery and choose some of my favorite prompts there. Here I can filter out that we just see women images woman. And now here let's choose the best prompt. This is like retro style. I think the first one is pretty fun. I'll use that. I'll copy all of it here. And also I'll copy the negative prompt. Here is my prompt. Okay, and then for negative prompt, let's also copy it. Okay, here, maybe let's reduce the number of images instead of 88 is default. Let's use four here and let's create image. One more thing that I forgot to mention is that when you copy and paste the prompt here and you generate with your own model, make sure that the prompt includes the instance prompt here we were that they've used the same instance prompt as, um, I created with my model. Here I'm using SKS woman as my instance prompt and they also used it here. One more thing, when we were training our model for the token, we just put the SKS. But here it's also important to add the class name. For the class name, we put a woman. It's best to include both of the token and the classname SKS woman. Make sure that you added in your prompt, otherwise the images that you'll generate will not look anything like the subject. Okay, so let's reload and check it out. Here are the results. I like the results way more than the leads model training. I guess that's because Astra is specialized in model training. They have better chosen settings for Dream Booth, Fine tuning. If you have more than one model with them, you can check all of your models in the tunes. Here I have my models, the new one with Cat and the older one with AI avatar. If you notice here it says deleted after 30 days the model will get deleted and you cannot use it for image generation. Actually, let me show you if I click on this model here you can see that there is no option for prompt writing because the model was deleted and I cannot do anything. Let's go back for T. If I click here, you can see that I can write my prompt here and negative prompt and so on. I can generate images with this model. Also, if I go to Gen here, I can write my prom automatically. My model will be pre selected. If I have one however, I can change it. I can choose from any of these base or fine tuned models here, but if you have more than one model, you can easily go to generate and choose between your models. But you can see that my other model, the AA avatar, is not here. And that's again because it's been deleted. What if you don't want the model to be deleted after 30 days? Well, there are two options. First you can extend the model storage. For that, go to your account here in the billing here, I can turn this on automatically extend model storage and it's going to be $0.50 per month per fine tune, and it's going to be deducted from your balance. Make sure you have a sufficient balance in your account so your model doesn't get deleted. Another way you can do, let's say you don't want to extend the model storage here. If you go to tunes, click on the model here, you can actually download the model. Just click on this KPT file and this will download the whole model. It's going to take some time because usually the models are quite heavy. This one is 2 gigabytes and it's going to take around 40 minutes to download. After downloading the model, this is the file that you'll have, it's going to be with a CPT extension. Once you have your model, you can use it with stable diffusion that you run locally on your computer or on any cloud server, such as Google Colab. And in the next model, we will precisely cover that. I will show you how to run stable diffusion yourself and how you can use this model that we've just downloaded with it. See you then. 89. AUTOMATIC1111 Introduction: Hello everyone. In this module we will cover our last A, A application which is Automatic 11 11. Or the abbreviation is 11 11. What is it? It's a popular web interface for running stable diffusion for advanced users. It was developed by a user with a nickname Automatic 11 11 with contributions from a passionate community. Here you can run, train, and deploy stable diffusion models. It constantly is being improved and updated by the community on Github. So you'll get up to date features and the newest extensions. Let's check out some pros and columns for automatic 11, 11 for the pros. Here you have high image customization and control with lots of features and settings that you can adjust. Yes, as I already said, there are many parameters and settings to achieve your desired results. The quality of images depends on your expertise. If you're good with stable diffusion, you can create really beautiful images. Automatic 11 11 has a lot of features which include text to image, image to image in painting out painting. Then upscaling base recovery and lots more. It also supports many extensions and add ons for AA art and video creation. We've already talked about control Net in Leonardo's module. Now you're familiar with that. This is one of the extensions for automatic 11, 11. If you want to create videos, then you can use to form and there's lots more extensions that you can use with it. Another big advantage of Automatic 11, 11 is that you can use your custom models or choose from a wide variety of models. So you can choose from thousands of models that were made public by community and try them out with Automatic 11, 11. Also, when you run Automatic 11 11 locally on your computer, there are no restrictions in terms of what kind of images you can generate or what kind of prompts you can use. If you remember from images they were flagging specific words. Here are no filters or censorship, so you can generate any image that you want. Now, let's talk about cons. So the first one is that Automatic 11 11 has an advanced user interface with so many features that it's going to be very overwhelming for a beginner. Second of all, if you want to install automatic 11 11 on your computer, it can be quite challenging to do so because we've already got used to the softwares where you download them and you just click Run and they automatically install. However, here you would have to use Terminal on Mac or Command Prompt on PC. If you're not familiar with that, then it might be a bit challenging for you. Also, when you install automatic 11 11 on your computer, it requires a powerful GPU. Not all the computers qualify. If you use a C, then you can use M1m2. Finally, it takes experience to generate high quality images. If you are a beginner, then maybe using more beginner friendly platforms like Lexica will be easier and you will just get better results. But right now, with all the material that we've covered in the course, you should be well equipped to generate some grade images in Automatic 11, 11. Now let's take a look on how you can set up automatic 11, 11. There are different options here. You can run automatic 11 11 locally on your computer. Here are some installation guidelines in this course, I'm not going to show you the installation process because everybody has different operating systems, so it will be quite different for everyone. Another option you have is to run automatic 11 11 on a cloud server. That's definitely helpful if your computer does not meet the requirements for installation. In the next video, I will show you how to set up Automatic 11 11 on Google Colab. Here are useful links. Also, I'll be showing you how to set it up in run diffusion. I also want to mention that when you install and run Automatic 11 11 locally, then it's going to be free. But if you use Cloud Server with Google Coll Ap, you may need to pay round Diffusion is a paid service. This is also something that you need to take into the account when choosing between the different setups. This is all for Automatic. 11 11 introduction. In the next few videos, we will set up automatic as well as Get started and I will show you all the different features there. 90. AUTOMATIC1111 Google Collaboration Setup: Let's begin the set up with Google Colab. All you need is just click this link here. That's going to take you to Google Colab in the beginning. Here we have instructions, updates. If you want more information, you can check out this guide on how to use this notebook. Just click on the link. Okay, let's get started. It's not too hard. Here we have username password you can change you want, but let's keep it as default. First, here is the option to save small model images and settings in your Google Drive. If you want to do that, you can keep that as default. If you don't want to save anything, then click nothing. If you're a more advanced user and you want to save everything, then choose this one. But it's just going to take a lot of space from your Google Drive. Let's just keep the small images and settings here, then here we can choose what models we want to use with Automatic 11 11. Here we have a version 1.5 the 1.4 F222 model. This one is nice. The Dream Shaper model is also good. There are a bunch of models that you can choose from. You can also select the version two models. Version two with 768 by 768 train dataset and so on. Okay, for now let's use the default 1.5 model. Then we have extensions. So if you want to add control net, then you just click that control net here. If you want to use different models that are not listed here, then you can provide a URL with that model. And the same applies to extensions. If let's say there is a different extension that you want to use that's not listed here, then you just use the URL and paste it here. Okay, so right now we basically didn't change anything except I just added the control net. Now let's run it. Just click this Play button here. Okay. And here we need to click continue Anyway, because we want to save images and small models on Google Drive, it asks to connect to Google Drive. Let's give the access. Now. It's going to take some time to launch automatic and run all of this code here, install everything that it needs to work. Right now, we just need to wait. After around 6 minutes, I've got this local and public URL and that's how you know that it's good and running. However, if you find that it's been quite a while and still loading, if you've clicked lots of models and extensions, try to reduce that. Also, if you want to use heavy extensions like Control Net, then you may consider upgrading your Google Collapse. You can try table diffusion with the free Google Coap first. But if you find that you need a little bit more, there are the different plans here. For example, Coa gives you 100 compute units per month, that means faster GPU and more memory. Also there is coal, which gives you 500 compute units per month, Faster GPU, more memory, background execution terminal, all the good stuff here. We can just click on this link here. And it's going to open in a new tab here for username and password. Put the same username and password that you've entered here. The default one is a, let's use that. Here is our web interface to stable diffusion. Here on the top, we can choose the model if you selected more models that you can choose from all the different models here. Here I only have the version 1.5 Here is where you write your prompt, the negative prompt, and click Generate. Okay, let's try some of our prompts here. I will use the first prompt, Professional Portrait Photograph. Then I'll put the negative prompt here. You have default settings, you don't have to change here anything. Let's generate first, here is our result. It's cropped, okay. I don't think we have that in our negative from Let's put cropped also. Let's increase the badge count is how many images you want to see being generated in parallel gills. Again, try that once you've generated these images. If in this step you didn't change anything and you kept save in Google Drive, small models, images and settings, you will find the images in your Google Drive. So if I go to my Google Drive here, I can see the output. You will see a folder called AI picks. If you click on this, here will be all the information and the small models. If you want to check out all the images that you generate, click on this output and then text image images here you'll see your output, for example, this one. Let's go back to our web interface here. Here, because we added the control net. You'll see the control net here. You can expand, you can upload your image. I will show you a little bit later all those settings and how you can adjust them to great images. But before I explain all the parameters here, I want to show you how to set up automatic 11 11 with a different cloud server called run diffusion. That's because some people may find run diffusion a bit easier to use, especially when it comes to using and installing your own models. In the next video, we'll cover run diffusion and then we will talk about different settings and how to create great images. 91. AUTOMATIC1111 RunDiffusion Setup: Another cloud server that you can use to run Automatic 11. 11 is Run Diffusion. Let's click here. Here is their website. Run Diffusion.com here, you just can click this. Get started, you'll need to sign up here. I already have an account for them here, it just locked me in. Okay, here you can. Here you can choose what server to use to run Automatic 11, 11. They have different plans. Here is the small server and it costs $0.50 per hour. It's 3 seconds image generation, it has stable diffusion. 1.5 and 2.1 300 gigabytes models loaded. Then latest automatic 11, 11 and Deform, And then two minute wood or launch time. Then we have a more powerful server that costs $0.99 per hour with 2.2 seconds image generation. The highest one is the 2.5 dollar per hour with 1.6 second image generation. Here are a few more servers, but for automatic, I would recommend using any one of these. Also, when you just sign up, you get 30 minutes for free to try it out. For example, to use the 30 minutes, I would recommend using the cheapest one you get accustomed to. All the features here you can see you can use the remaining balance. And here is 30 minutes. If you choose, let's say more expensive one, then it's only going to be 6 minutes because it's based on the credit here. They give you basically $0.25 for free in order to benefit from all the features you would need to sign up for the Creators Club. Here you get 30% of the large hardware, the 2.5 dollar, 1/hour Then you have 100 gigabytes of private storage, $6 of starting balance. But I guess the most critical element here is that with Creators Club, you can upload and merge models. So basically you can upload your custom model here and use it to generate images without the Creators Club. That's not available. If you want to sign up to Creators Club, you click the sign up here. So here you enter your payment information. If you want to use my promotion code, it's Caterina 15 and it's going to give you a 15% discount on your first month. Okay, let's try it out. So I'll go back here. I've already signed up to Creators Club, and this is what you'll see. You'll still need to choose which server you want to use. You get the discount with the Creators Club. Let's select this one. Okay, I have, my balance is $5.35 here. I can choose for how long I want the session running. Let's choose 1 hour here, then here it's automatically selected that. It will play down to notify when the session is ready. Okay, let's launch it. So it's going to take some time to initialize and launch and you'll be notified by the sound the first time you launch Automatic 11, 11. Here you'll have the username and password filled up. Here I have the S D user. Let's click Log in here. And here are Automatic 11 11 fowlers. For example, here you can add your model and I will show you how. Here we see Automatic 11, 11 interface. At the top here you'll find a timer. I chose 1 hour. It counts down that 1 hour. Let's say you change your mind and you want to make the session shorter. You can always click this Stop button and it's going to shut down the session. If you want to extend the time, you can click this extend button here on the top left. We can choose what model we want to use to generate images. And Here are a bunch of models here. There are base models like stable diffusion of 1.5 stable diffusion 2.1 But there are also more fine tuned models such as Dream Shaper. And here are also different versions like Dream Shaper 76 and so on. You can choose your favorite model here, for example, let's, let's use just the basic 20022. And let's generate something with this model here. Let's generate our Landscape prompt. And here I'll just click January again. We have default settings here. If I want more than one image, then I can increase my badge count. So let's put four. It's going to be pretty quick because we've selected the highest server. Okay, great. So here are our results. All the output images you'll find in the automatic 11, 11 images, just click here. Every time you start a session, you'll have a new folder for each day. Here is the 21st, let's use this one and here are my castle images. Okay. Now let me show you how you can upload your custom model here. Remember from the previous module, we've trained and downloaded our custom model here with the KPT extension here. You can just drag it to automatic here and it's going to load. Now I will also show you a different way you can upload it a little bit faster. Because here it's going to take maybe an hour to upload. But let's say you've already uploaded your model to Google Drive. Then from Google Drive you can upload your model here in a few minutes. Let me go to my Google Drive here. I've just dragged my model to my Google Drive. Let me find where it is. Okay, here is my model, SKS woman. I've renamed it to make it more clear here, all I need to do is go to the small actions here. I can go and share. Share. Okay here, make sure that the axis is anyone with the link, then copy the link here. Let's go back to Run diffusion here. To upload your model from Google Drive, just click these three dots here and click to shell here. You can put down and then Space Space. Paste your URL link, okay, here. Then click Enter. Now it's going to upload it, which will be a way faster than just dragging and dropping the file from computer. To learn more about different commands that you can use here you can go to run diffusion and let's say documentation. Here they have some guides and questions and answers. They also have a disc account. You can ask your question there to see if you have the model. You can reduce this window here to shell. Here we have our model that we got from Google Drive and as you can see, it was way faster here. Our model from computer is just uploading. It's not even halfway there. Now very important we need to move our model to this models folder. Let's drag it and drop here. Okay, now it's going to be in models here. You will need to move your model into the correct folder. Here we have version 1.5 it was stable diffusion 1.5 It should be in the version one. If you have models that were trained with stable diffusion 2.1 then they will be going into this version two folder. Let's place my model here in the version one. Okay, here I got my model here. Now all we need to do is just reload this website. Reload. And also let's refresh here. Okay, now we should see our model in the list here because the bunch of models, the easiest way is just to put the name of the model. Let's put cat here is the model. Model a woman. Okay, here it is, now loading. Now I can generate some prompt with my model here. Let's try again using the Astra prompt. Fashion photography portrait. I'll go back here and paste the prompt. I'll paste the negative prompt. Okay, let's generate. But here I will increase the badge count to four. Generate. Here we've got some interesting results with what looks like my face here. I like the dress from flowers here, but there are some artifacts in the face. And D as well. Okay. If you notice as when they have the prompt here, they also specify the parameters that used. For example, here you can see that what schedule they used, they used oil, the size steps, and so on. We can also try to use that here. Let's put 30 steps. Let's choose oil as selected here. Then let's they also use the face. Correct. Let's use that one here. Restore faces. Sure. And also yeah, let's try that. Let's check this out here. I think this one is pretty good. We don't have the lines on the face here. Let's see the other one lines again, but overall this is a bit better. I like this one. This is how you can use your custom model with Automatic 11, 11. And here you can generate whatever you want. Try different settings. In the next video, we will talk about those different settings and also the control net. Here, by the way, here are some useful commands that you can use with run diffusion. We've already tried the down that allows us to upload our model from Google Drive and that's way faster than uploading from a computer. Then with these commands, you can also upload model from any other link. For example, you find a cool model on some website, for example CDAI, that has bunch of cool models. When you want to use a specific model such as this one, you can just go and double click on this download. And here you can see the scopy link address. Then let's go to run diffusion here. Again, total the shell here. Let's put two space x eight. Then let's base our U R L here. Here is it. And it's going to download. The download is complete. Let's see, Let's reload. Okay, here is at the model, it's around 2 gigabytes. Now let's move it to models. Here, let's check out which is it is one or version two. It's version 1.5 Let's move it to version one here. I can also rename it if I want to. Here I can go and edit it. For example, I can put my model like this, rename. Okay, so here we have my fun model. Now let's refresh. Okay, let's check it out again, I'll use here, we have my fun model here, for example. I can use a girl here. Let's put for work. Let's, I've generated an image with this model that I just took from Vit Models like this one would be in run diffusion so you don't have to upload them. For example, if I just try to look for it. I B, as you can see it's already here and you didn't have to upload it. But I just wanted to show you how you can upload a model from a website like Bit AI here. We can also choose in painting version and so on. As you can see, there are a bunch of different models to choose from. 92. AUTOMATIC1111 Basic Parameters: In this video, I'll be showing you different settings and parameters that you can change in automatic 11, 11 to achieve better results with your image. Throughout the course, we've been talking a lot about stable diffusion. How to write prompts for stable diffusion. We've also covered different parameters here. It's going to be a great summary of the course because automatic 11 11 here, we'll find all those settings. It's going to be a good revision of all those terms. So first you want to start with a good model. It can be a basic stable diffusion model, like version 1.5 or version 2.1 Or it can be a more fine tuned model, like a dream shaper or I chose this ICBI L model, I quite like it. It's great for Porter realistic images. Then what's very important is to have a good prompt. We've covered that in Pro writing, where I explained some tips on how to write good problems. Let's quickly revise that. Here I have photograph, mid shot photograph of beautiful Brazilian woman. Here is our subject. Then we have a subject description in a bush jacket, extremely detailed ice. Then we have our background in a wild jungle through the foliage. Then I put in the style of dark brown, iconic sharp focus. Now we have lots of stylizerstendreyle of Jessica Dwon and Greg Rod Koski. These are our artists. At the end, I've also added 16 K HTR. We can move that before the style, so it's more consistent. Okay, here is our prompt. Now we need a good negative prompt. And for negative prompt we can use something like this. You'll find this prompt in the prompt presentation, cop head, bad framing and so on. If you want to highlight anything in your prompt, you can put that in parentheses. For example, Stop, that's important. And you can put 12 or more parentheses for now. I think that's good. Another thing here in Automatic 11, 11, there are styles you can actually save. For example, if you want to save this negative prompt the next time, you don't have to paste it here. All you need to do, okay, let's delete this first. Here, you just click Save button. And then you choose how do you want to save it? In my case, let's put just negative, Click okay here. Now you'll find that in the styles, you can just click here, choose the negative prompt. And if you write any prompt here, for example, our prompt here, the negative prompt will automatically be applied. Let's try it out. Let's generate. Okay, here. If we go down here, we can see our prompt and we also see the negative prompt. As you can see, even though we didn't put anything because we chose it to include in the styles, now it's automatically was applied. Similarly, you can do that for the prompt. Let's say you want to save stylizerst'st in the style of dark brown. Again, you can save that as a style. Let's save it. And let's put a photograph style. And let's click okay. Now we can choose our photograph style. We can choose negative prompt. Here, we just need to put our subject in the description of the subject. Here I'll put mid shot photograph of a beautiful Brazilian woman in a bash jacket. Extremely detailed eyes in a wild jungle. Using those styles, it's going to automatically apply our stylizers as well as the negative prompt. Let's see how it works again. Let's generate again. If we go down here, as you can see, our stylizersre automatically applied the same as the negative prompt. This is how you can save different styles for the genres that you work with. And that's going to simplify the whole process. Okay, now I'm going to delete that. Well, actually let's keep that. Let's choose the negative prompt and our photograph style. Okay, perfect. So now here, let's go into the settings. Let's begin with the simpler ones. So we have this badge count and badge size. The badge size is a number of images generated at the same time in one badge. Badge count is the number of badges generated one after the other. The differences between those two is that for badge size it requires let's say badge size of four or six. Because they're generated at the same time, it requires a higher V realm. It's more GPU heavy than the badge count. In terms of generation time, they are pretty similar. Let's actually try it out, for example, badge count. Let's put two. Let's okay, so here we've got two batches with one image in one badge. Let's change that. Let's put badge size with two. Again, here we've got two images, but this time they are in the same batch. Now let's move to width and height. For width and height, I would recommend sticking to the native width and height as was used for training the model. For example, this one is using the stable diffusion 1.5 version and was trained with the resolution of 512 by 512. This is the native resolution. If you want to make the image with higher resolution, then you can change the model. That's let's say 768 by 768. Or let's say you want to generate like twice as big as this, then you can use something like higher fix. We'll also talk about that a bit later. The problem here, if you just choose higher resolution here, let's say than something 1,000 by 1,000 then you may get strange results like double heads and so on. For that reason I would not recommend doing that. But let's say if you want to have the image in a different aspect ratio, then you can also change that. You can use the aspect ratio calculator to help us. Let's say I want the ratio with 16.9 Here are the width and the height. Let's use that. Okay, let's try that. Okay, as you can see here, we get this undesired result with the second replica, this woman here. Let's actually try the same ratio, but make it scale down by the factor of two. Let's put 640 by 360. That's going to give us the same ratio. As you can see, it gave us the same ratio, but now we do not have that undesired result. You may have to play around with height if you want something else but the square. 93. AUTOMATIC1111 Parameters - Sampling Steps, Sampler, Seed & CFG Scale: Let's move on to sampling steps. Steps are the number of noising iterations in the generation process. As you remember, stable diffusion starts from random noise. With every step, a new information is added eventually to get to the clear image based on your prompt. Let me show you some examples. Here are sampling steps. As you can see, when we have only one, it's a very blurry image. But as we increase the sampling steps, we get more and more information here. This is at five steps, it's still a bit blurry, but at least we see the eyes. The nose of husky dog and then at ten it's now a clear image. And then the more we increase the steps, the more details we get. But there is not much difference as between, for example, step 1.5 or 5.10 Here, it's very small difference between steps 20.30 or 20.50 Maybe there are slight changes but barely visible. I would say around 20.30 steps is a good choice, but it really depends on your image. If you want to make something more abstract, maybe you can choose five steps if this is the image that you try to achieve. Okay, let's get back to our run diffusion here. Let's just try with this image. Let's put, let's say ten. Let's, let's see here. We've got some details, especially on the jacket here. But overall, especially like the hair face is quite blurry. Let's pump it up. Let's say 40 steps. It's going to take longer time to generate, but let's try it out. Okay, here, let's check it out. You can see that the hairs are now way more detailed. The background as well has more details. This is how you can adjust the sampling steps, even though there are quite a lot of details. But there's definitely a problem with the face here. Here we have a button called to restore faces, and we can use that to help restore the face here. I'll copy the seat so we'll get the same image. So seed. Okay, let's generate again. Now you can see that the phase is way better. And that's what restore phase button here, what it does. Okay, Now let's move on to sampling method. Okay, here for sampling steps, let's make it 30 because I think that's a good average. I will also disable the restored phases because it also takes longer time to generate images. However, let's say if you liked a certain image in your badge, you can always restore base on that specific image. Okay, for sampling method, sampling refers to the denoising process in stable diffusion. And there are different sampling methods that can produce a bit different results. Some they will converge to the same image, but others will produce slight differences. By default, oiler works pretty good and it's also a quicker sampling method. What I mean by quicker is that it's going to give you good result with a number of sampling steps at around ten or 15 sampling steps. It's going to give you a clear image, maybe not with too much details, but it's going to be a nice image. Actually, let me show you all those different sampling methods. Okay, here on the right we have different samplers like oiler and you can see that at around step 16 we already have a good image, and another good one is D D IM then we also have A new samplers like P M plus plus two Mars. These are newer samplers and we'll also produce fast and good results. Maybe let's choose that 12. Here we go. Okay, let's try that one. Here is the result with the PM two, may I'm going to switch that back to oil. Now let's talk about the CFG scale. Cfg scale controls how much image generation process follows the text prompt On different platforms you can find CFG scale could be referred as Prompt Guidance, guidance scale, or even in some, I've seen that as prompt weight. These refer to the same setting here, basically, how much generation process follows the text Prompt. Where lower values mean that the result is going to be more creative. Higher values will be better aligned with the prompt, but may create a higher risks of artifacts. And I'm going to also show you the default is around seven and the recommended range is around four to ten, depending on your image. Let's see. Okay, here we have two different images, one with the scheme here, and this is with the CFG one. You can see that usually a lower CFG will also have less details and will also be more pale. Whereas with a higher prompt guidance, you'll find images that have more contrast. Overall, they are more saturated, so lots of colors. Also, if you take a look at this prompt guidance of 15, you would notice some artifacts starting to show up. For example, the fur starts to feel unnatural and more sharp, whereas before that, maybe at 75, it's way nicer. Let's try it with our image here. Let's maybe CG scale two, so you can see the contrast G two. Okay, let's generate. Okay? You can see here that the colors are more dull. Let's try with the higher scale here. Let's maybe put around 11. Let's use the same seed. Okay? Okay. Do you see the difference here? Colors are way more bright, 6-8 It's going to be a good balance in colors quality and you'll get image that matches your prompt. Let's put eight here. Okay, Now let's talk about the seed. The seed is initial input that guides the creation of the image. The same prompt and parameters will produce the same image with minor variations here, by default, you'll have a random seed every time image will be generated with a different seed. But let's say you liked a certain image, then you can use the seed. Let's say I like this image here. I can click this green recycle button. It's going to copy the seed number of that image. Another way to see the seed is to go below here. Here at the seed. You also can get the same number. Okay, let's say if I change some settings here, let's say I want to restore phase. I've copied the seed, now I'm expecting very similar image, but now it's going to have better quality phase. Okay, as you can see, almost the same image and minor changes in the face here. That's how you can use the seed. You can change some parameters. Photograph of beautiful prettily woman, let's put smiling. Hopefully that will only change her emotion. But we'll keep the composition the same here. I think it changed a little bit too much. We have new clothing details. It really depends how you modify the prompt and what effect it will have on the image. Here. Sometimes it will just have minor changes. But with the same set, you will get a completely different image. 94. AUTOMATIC1111 ControlNet: Now, what happens if you want to generate images in higher resolution? This one is 512 by 512, which is a pretty low resolution. What if you want to generate twice as high? Remember when we've talked about width and height, I told you that it's not a good idea to just modify the width to twice as much because it's going to produce strange results. And you'll get double head, double bodies and so on. For that there is this button called Hire Fix. If we click on it here, let's disable the restore phases. You can see that you can resize an image 512-1024 This is an upscale by a factor of two. Let's say you want it even higher, then you can change that to three. It's going to be 1,536 5,536 Important things to change is this upscale by factor, you can do 23 or four. The higher the factor, the longer it's going to take to generate the image. Another thing that we can change here is the higher steps, we now it's at zero. For higher steps, you can set it to zero for pure image upscaling. If you don't want any changes to the image you want it as, then you can set it to zero. But usually it's nice to set it maybe two as half as many steps, your sampling steps. Here we have 30 sampling steps. Let's put 15. Another parameter that you want to change is the denoising straight. The denoising strength controls how much the image will change. Near zero, no detail will be added to the image. Near one, it will completely change the image. I would recommend to use around 0.2 and 0.7 here. The lower it is, the more closely it's going to be to the image. Let's put your 0.2 for example. Another thing you can change is this up scalar. There are different up scalers you can use, but for now let's just use the default latent one here. If you have a random, it's going to generate a completely different image. First, it's going to generate image with 512 by 512. Then it's going to scale it by your factor here. In my case it's going to be three x. Okay, let me use a random seed here and let's generate. You will not see the 512512 image, you will directly see the higher resolution image. It's going to take a little bit of time. It took way longer to generate just this one image here. I even reduced my app scale factor because it was just taking so much time here. I changed it to two. Here is the result. The quality is way better here, but it just took so much time. What I would recommend is not using the higher fix and just generating a few images. Let's put like four images then choosing your favorite images from one of them. And I'll show you what we can do next. Let's see these ones. I think this one is pretty fun one. Let's use this one. Okay? The way you can improve the quality of this image is move it to image, to image generation. Let's send it to image to image. Okay, now we're in this image editor. I see how change the tab to image to image. Okay, here we have a few more settings. Okay, here you can choose to restore the face also. Let's resize it by here. If you want to set the width and the height, you can do that or you can resize by a factor here, it's easy for me to do the resize by a factor here. I will change it to two. Then again, you can change the prompt guidance scale. But the most important parameter that you'll be adjusting here is the denoising strength. Again, denoising strength affects how similar it will be to your image here. If we want to make it very similar, then we can set it to 0.20 0.2 here. If we want, let's say more details or changes, then we can increase the de, noising strength. And let me show you what will be the difference. Let's first create with 0.2 here. Next time I'll change it to 0.7 Okay, let's generate. Here is the result. Okay, Now let's change it to zero point. Let's actually put 0.85 We see the difference here. You can see that there are so many changes to the original image. Now her hair is covered by this hootie here. We also have a slightly different background. She's smiling way more here on this, her eyes are squinting. Quite a big difference. That's how you can use the Ois strength to choose how much you want the upscaled version to resemble your original image. 95. AUTOMATIC1111 Hires Fix and Image to Image: Now let me show you how you can use Control Net here. You can use it with image to image generation, but for simplicity, let's use text to image. Okay, Here if you go down here, we have this extension already installed. So we have this Control Net version 1.1 okay? And we just can open it to remind you we've covered control net in Leonato, More Agile. Control Net allows a reference image such as this one to influence specific attributes of the generated image. There are different models. The more popular models can, which takes edges of this reference image, which makes a depth map from this reference image, and then pose to image. Pose to image can be just the pose of this person here. Or it can be together with a facial expression. You can read more about control net here in this article. But let's get back to run diffusion. Okay, here we've opened control extension here. Just applaud your image, your reference image. For example, let's use the same one as we've used in Leonardo module. This one, Okay? Here, very important, Make sure that it's enabled. Because if it disabled, if there is no tick here, then it's not going to use the control net. So make sure it's enabled. Okay. Now here, let's choose the model. There are different models, and there are many of them just for let's say depth to image. There is four of them. For open pose, there are even more. There is open pose full, open open pose face, the open pose full is a pose with the face. The open pose is just the pose without the face. Yeah, let's use just the pose here. We chose our preprocessor open pose automatically. We'll choose the model for us here, which is nice. Then to see how it will process this image, you can click Allow, Preview, then click on this explosion mog. Now it's going to process the image and we'll have the control net map. This is what it will feed to our image generation. Let's say if you choose a different processor, let's use the open pose full again. Let's click this explosion here. You can see that now it also includes the phase. Okay, I have some problem with scrolling here. Scroll manually. Okay. Here, open pose. Let's use the open pose. The next important parameter here is this control weight. You can adjust control weight to increase or decrease the influence of control net on the generic grid image. Let's say, let's actually try something first. Let's use open post Okay, process. Now here let's click Generate. One more important thing, if you use ad blockers or if you use brave browser. Control net may not work with run diffusion. If you have a problem, control net is not working, try to switch to a different browser. Okay, here we've got our four images. And you can see all the images, they have similar pose to our reference image here. Now let's reduce the control weight and see what's going to happen here. Control weight is one. Let's make it, let's say 0.2 That's going to reduce the control net influence on the generated image. Okay, let's see the other images, the four images. As you can see, it still uses similar pose. But right now the poses are a little bit more diverse. That's because we've decreased the influence of control net. Here, with open pose, the control weight is not quite apparent. But if we use Y, for example, Y is the edge to image, it's going to use all the edges here. For example one. Let's generate here, you can see that mimics our reference image. The here, the color of the dress is different, but everything else looks very similar. Now let's reduce the control weight. Let's make it 0.10 0.1 Let's generate again here, even though we've used the same Can model, basically we've used the same settings except for control weight. Now the images have way less influence from the control net map then previous badge. Here you can see that some images still have let's say similar pose but this one is straight looking as you can see. Way less effect of control net on these images with the can. I think it's pretty apparent here. This is how you can use control net to influence your images, which is a very useful tool. 96. AUTOMATIC1111 Inpainting Extras: Now I want to show you how you can use in painting in Automatic 11, 11. For that, I will disable the control net here. Okay? It's not going to be affected by control net here. I will go to image, to image. Actually, before that, let's generate a full by short photograph. I can show you fully. Short photograph, that's good. Okay, even though we put here full body shot, we're still getting a meat shot here. I will just add the trousers. Okay, let's beautiful Brazilian woman. Okay, let's take, smiling away in brush jacket then and safari trousers. Poor eyes. Let's just take that out in a wild jungle. Okay, that's good. Let's try that. Okay, let's check this out. Okay, so this is more of what I was looking for, okay? I like the second one, okay? See how the phase is screwed up in all of them. And that's because there's just very small area for AI to generate good quality face here. For that, you can go and try to upscale. Another method is you can send the image to paint. Let me show you send to paint. Actually, let's first choose the image here. Let's use this one. Okay? First of all, let's increase the resolution of the image a little bit. Send image to image. Okay? Now here I will first increase, I will resize it by two here, denoising strength, Let's make it 0.4 0.5 Yeah, I think that's 0.45 I think that's good. Okay, now let's do that. Okay, so here we've got the image that's way better. We've got more details on her shirt, her pants. Okay. Now, but as you can see, face is still not the best quality for that. Let's send this image to paint. Okay, Now it's in the paint tab here. We can zoom in and use this arrays tool to erase the face. Okay, Here if you want to change the size of the brush here, you can change it. But right now I think the swing would be good. Okay, now I've erased the face here. If we scroll down, here are the settings. The most important parameters here are masked content and paint area. The first masked content as if you want to keep the original quantent underneath your erased heart. Let's say if you want to keep a very similar phase here, then you should choose original. If you want to generate a completely different phase, then you can use either latent noise or later nothing. Then we have paint area here. You can select whole picture or only masked the difference will be in the resolution of generated image. Let's say if you choose only mask here, the resolution will be way higher than the whole picture. Because here we've selected just this small area here. And it's going to be using the same resolution as this whole image to generate just this area. Which will give us way better results. Especially if you work with faces that's ideal to choose. Only masked, then you can also adjust the denoising strength, and that's going to influence how close it will be to the original content. Let's say I want only minor adjustment to the phase here. Then I can move my denoising strength lower, so make it 0.3 or two. But if I want a bit different phase, then I can move it higher and make it 0.9 If you make it the highest of one, then the phase will not resemble anything like the original content. So for noising strength, let's put 0.5 because I want to see some differences. Now let's change the prompt here. We want to generate images of the face here I will full by D shot here. I'll just remove that photograph of beautiful Brazilian woman Here I can put detailed is something that refers to her face. Okay, I have my negative prompt and photographers. Great, let's generate. Okay, now we've got quite different phase but now it's way higher quality. In my experience with, in painting, you would get way better faces than if you just click restore phase here. Restore phase, you can do that, but from my experience in painting will give you better results. Now let's try a few more things with, in painting. Now I like this image and I will move that to paint again. Okay, here I want to show you how you can remove stuff from the image. Let's say I don't like the earrings here. I just want to remove them again, Just paint over the element that you don't like here. Okay. Now let's adjust the settings here. Again, I want to keep the original even though I don't want to see the earrings, but because here I have the hair around, so we want to have the same hair. I'll keep the original down here. I will just move the noising strength to a higher value, maybe 0.9 We won't get the earrings up here. In the prompt, I will also put photograph of a beautiful Brazilian woman here, hair, and I don't put any earrings. Let's try to generate this and see if it's going to work. I also increase the batch number so we have more images to choose from. Here, I'll choose form. Here are some results. Instead of the golden earrings, I tried to add the leaves, even though we didn't say anything here in our prompt. But let's see, other ones again, some leaves. Okay, The best one is the last one. But, but what it added here, I don't think it quite matches her hair style for that, if you want to remove something from the image. There are actually better models that you can use for, in painting. If you use front diffusion, you can just go and search in painting. Here are all the different models that have been trained to do in painting, such the basic one, we have the stable diffusion version 1.5 in painting. If we run it again, the results usually from what I found, we better if you're trying to remove certain elements from your image. If you are changing a phase, then again, from my experience, I usually like using the models that are best for pass. Not necessarily in painting, but just either Photos models, the ICB, INL or Dream Shaper, which give a really good phase results. Right. Now we select the painting. Here we have the 1.5 grade. Again, we have everything deleted here. Let's check the settings again. We have original At the bottom here we have the noising strength, 0.9 which is high. We don't want to see any of the golden earrings. Let's try that. By the way, I also changed my prompt a little bit. I put coiled hair. Okay, let's see the results. Okay, On the first one, I see that it perfectly removed the area. Let's see the other ones. That's good. I think the second one is the best, the shape of the ear, especially in the neck. Let's choose the second one. Once you've got the image that you like, you can actually upscale it even more For that, just click Send to Extras. So let me show you what it's going to do. It's going to open this extra tab and we'll upload our image here. Here, basically we're not changing the image anymore, we just want to upscale it as here. All you need to adjust is this re size factor. Here I can upscale four X. Also very important. Choose your upscale. Here in run diffusion, we have a bunch of up scalers. The default one you get with automatic 11, 11 is the RS gun X plus, let's use that here again. Here we have a factor four. If you prefer writing the width and height yourself, then you can use that, but for me it's easier just to choose a factor. Okay, Now once you adjust the settings here, you can click Generate. Here is a final result which you can save. Save image as save grade. Now the image is 4,680 4,608 Let's open it here. And now you see that the quality is way better. Maybe I would work a little bit more on her hair, but overall, everything else looks really good. Another thing is that each up scalar is a bit different. You may want to also play around with different up scalers. For example, if you work with anime, there are specific up scalers for anime that work better for anime styles. I've summarized all the information that I've told you in the Powerpoint so you can use that for reference. I also listed useful links that you check out. Here are some really good guides for Automatic 11, 11, Especially because Automatic 11 11 is a pretty difficult app. So it's going to take some time to learn and master it, try it out, play with different parameters and settings, and challenge yourself to generate some cool images with it. In the next module, I'm going to be using different AI tools that we've covered in the scores to show you my workflow when I'm tasked to generate specific images. 97. Create a Comic Book Version of Yourself in Astria.ai & AUTOMATIC1111: Hello? Hello. In this module I'm going to show you my workflow on specific project. So let's get to it. Okay, so the first one is pretty easy, AI selfie, create a comic book version of yourself. So for this project I will be using Automatic 11, 11 and the model that I trained in, Astaire. Let's go to run diffusion here. Okay, here I will find my model that I've trained in, Astin, I have uploaded to run diffusion and it's model SKS woman version 1.5 staple diffusion. Here when I renamed my model, I usually like to add the instance prompt which is in the SKS woman. So I don't forget, I need to make sure that I add the SKS woman. I have already prepared a prompt for comic book images and I used Astra as the inspiration here. They have a really nice prompt, 23 with a comic portrait of cyberpunk SKS woman, and the images are really nice. I've used some of the prompt and then also added more of anime. Okay, here let me paste the prompt. Here I have a comic bulk of a cyberpunk superhero SS, Woman with big and cute eyes, curly hair, Comic book phase, fine details, night setting, very anima style anima style. Or here is repetition. Manga style hand drawing, cinematic sharp focus illustration. Big depth of field masterpiece, concept art. So lots of stylizers trending on art station. And then I specified the style and the artist, and then I gave the weight of 1.2 See how my SKS woman I used to parentheses. This has the highest weight here. Okay. Now, let's also add the negative prong and I've also prepared it here. I just have lots of words like to form blurry and so on. Okay. Before we change any settings, let's try it out. Okay. Here for badge count or batch size, I only have one image. Let's make it four images, Let's check this one out. Okay, it captured some of my features. I'm not sure what it is on the forehead here. But let's try one more time. Let's see. Okay, that doesn't have my facial features whatsoever. Let's see the other ones, he is not too great. The phase is a bit messed up. See, let's try to do restore phase with that. The restore phase makes the face less anemic comic book style. I don't think I want to use that parameter, I'm going to disable it. I'll just try generating a few more batches. Okay, let's see. This one is way better, but again, something on her forehead. Let's see, the other ones definitely better in terms of facial features. Okay, not too bad. I'm wondering if we should change anything about the prompt. Maybe in the negative prompt, I will add photo realistic. Maybe for Comic Book, I would also add more weight. Okay, Comic Book as woman would pick. Yes, I think that's a pretty good, let's try that. Okay, this time, let's check out. I like the result here. It's comic book style, yet it captures my facial features. Let's see the other ones. Yeah, not bad here. Okay, this is to enemy. And this one is also pretty nice. Okay, now that I know that the prompt works, I want to generate my images in a specific pose. For that, I found a cool image of Tony Stark, and I want to recreate that pose for my character here. Okay, and for that I'm going to be using control net. So I'm going to open it and I'm going to upload the image. Here is the Iron Man create here, make sure it's enabled. Then here I'm going to use the depth model. Let's preview it here. I'm going to click on this explosive G. This is the depth map and control weight is one. Let's try it with that. Okay, here, let's generate again four images. Use that. Okay, let's see. I like this one. This one seems too realistic but is also not bad. Okay, actually I want to use the seed for this image. You get this image again for seed. If you want to copy the seed, you can click on this as I call it, Recycle button. Here, we've copied the seed of this image, full control net. I want to change the control weight because I think the pose is forced, so I'm going to make the control weight smaller. So here we have one. I'll make it 0.7 okay? And let's generate, okay? Okay. Let's see. Okay, So the first one, okay? I actually really like this one. I'm probably going to be using it. Let's see the other ones, this one is not too bad, okay? I like the first one, but there are a few things that I want to work with. For example, I don't like this red button here, so I'm going to work a little bit more on the suit. Okay, so here I'm going to send my, the first image to paint. Okay, let's zoom in. Okay, first I'm going to work with the phase. I'm actually going to replace all of this here. Now for the settings, let's choose original for mask content here, we'll choose Only Mask to increase the resolution of the mask area. For denoising, let's make it, right now it's 0.2 Yeah, that's good. It's going to change a little bit, but hopefully that's going to increase the quality of the face here. Let's remove the night setting because it's not relevant to the face. I think that's pretty good. Also, Curly here, comic book phase. Okay, let's generate. Okay, so let's check out the results. I think here I've got too many wrinkles. Let's see the other ones, okay, I think the best one is this one. But I still would want to correct some things here. Again, let's send it to paint. Okay, Create. And now can remove small details here. And here I'll just put me and the skin. And let's check our settings. We have original only masked. Then denoising strength 0.2 Let's make it the highest because I don't want the wrinkles 0.8 That's going to change that. Let's generate. Okay, here we've got some strange results. I'm just going to go and make the denoising strength lower. As you can see, you just have to play around with the denoising strength. Okay, let's make it 0.5 Let's generate, okay, I'm still getting strange results now. I'm just going to choose the in paint model in painting. Okay, let's use the stable diffusion 1.5 in painting. Let's take a look here. The results are way better. And this is what I was looking for here. We just changed the model to stable diffusion in painting. Model that was trained specifically for in painting. And we kept all the other settings the same and see how much better it is. Okay, let me show you the settings for settings. I had mask content original, only masked in painted area. And for denoising I had 0.5 But here with this model, the results are way better. Okay, now let me choose the image. Let's see the other ones. I think the third one is the best. I will send it to paint. Now here I can also work on some details. For example, the shot, for example. I can remove this orange pattern here that I don't like at the top here. I'll just put Superhero Shot. Okay, for the noising, let's make it a little bit higher, 0.7 That should be good. Let's generate. Okay, now we try to incorporate the Super Man logo on the suit. I don't want that actually. Instead of superhero, I'm just going to put blue suit. Okay, this time it's a bit better. Let's see the other ones. This one is pretty good. I like the second one, but I think we can work a little bit more with that. I'm going to send it to paint and work a little bit more and we'll show you the final result after a few painting. Here is the result. Now I'm going to send it to extras and we'll upscale the image. I, let's use the anime here. Let's upscale for X. I think that's good. Let's generate, let's check out the upscaled image. Okay, I think it turned out pretty good. I'm pretty satisfied with this image for our first project here. 98. Create a Book Cover in Midjourney & AUTOMATIC1111: In my second project, I'm tasked to design a book cover for a new edition of Alison Wonderland adapted for children ages three to five. And here are some specifications, incorporate key elements from the book. It should be illustration with bright, vivid colors, with a resolution with at least 1920 by thousand 80, which has aspect ratio of 16 to nine. Let's do that. For this, I will start with N in my Discord account here. I'll just imagine, and I'll, because it's a famous book, or us should know about Alice in Wonderland. Then here, book cover. Let's see what it gives us. Okay, Here we've got our images. But I think for children's book, this is too much. First I'm going to go and look for some inspirational images on the internet. Some children's book covers. Okay, here I put Alice in Wonderland book cover. And I'm going to go and choose the one that I like the most here. I like this one. I think it's pretty simple. If we zoom in, it's pretty simple. And I like the cal palette here. Okay, I will save the image. Now I want to use this image for mid journey. The only problem is that it has a lot of text. I don't want I to be influenced by the text. I don't want it to generate more text. For that, I'm going to go to clip drop here. In the tools we can choose, either clean up or the text remover. Let's start with the text remover. Here is the image. Okay, the text remover was pretty bad. Actually, I'm going to cancel it. Go back to clip drop and choose the clean up tool. And then again choose the image here. And then I'll just erase all the part that I don't want to see. Okay, now let's clean this. This is way better than the remove text one here. I will download it. Great. Now I want to use it as the inspiration, but first I want to understand what style is a so I can give better description to my journey. I'm going to go and use the describe Commando Ter. Okay, here we have Alison Wonderland in the style of whimsical children's book illustrator. Light orange and so on. Fairy tale. Right now I'm looking for the style storybook like soft edge. Okay, soft edged. Then we have watercolor illustration from here, for example. I can use soft edge and watercolor illustration. Now I want to use that as a reference for my prompt. I will use the image address here. Copy image address here again. I'll click, imagine I'll paste the link here. Now I will write my prompt. Let's put book cover, Alice in Wonderland. Now I'll describe what I want to see in the scene. I want a small girl in a blue dress in sun lit with flowers following a rabbit. Let's try that. Actually, I forgot to add the style here. I'm going to do another one. The same prompt, but just add more details here. I'll put soft edged and then water color illustration. Let's check this out. Okay, I think the fourth one is pretty cute, but there's definitely problem with the rabbit. But it's okay, the rabbit is not that important. I can always paint and change the rabbit. Okay. But for now, let's scale or let's generate more versions of the number three here. Let's see the variations. Okay, I think this is pretty cute. I like the first one. I can upscale it upscale. The first one here is our upscaled version, which is pretty good. Let's see the other ones. This one was with the soft edge watercolor illustration. I think here we have too much of the watercolor aesthetic, although maybe the fourth one is not too bad. Okay, let me show you my settings here. If I go to settings, I'm generating images with the latest model, 5.2 and I'm using the stylized medium. Maybe I can make it a little bit lower so we have more simpler images. Let me try that. We can, I'll copy the prompt here. I like the images without the watercolor illustration. I think they came out a bit better. We'll continue using the more simpler prompt here. I'll put imagine, I want to change my stylized parameter, so I'll put stylize 100. Also, we were tasked to create image in 16 to nine. Aspicratioiill. Also add that. Aspicratio 16 to nine. We need to space here, great as generating a bunch of images with the same prompt. Here is the result that I liked in particular, number two here. For the prompt, I kept it the same, but I added the negative, no trees or tree. That's because I was getting just too many trees in the image. Now, even though there are still trees but way less dense, then I've upscale the image. Here is the result here. I think it's too zoomed in, it's best to zoom out the image I did, particular that I zoomed out 1.5 x here. The result which I really liked, the only problem here is that the resolution of the image is not great. It cannot be used anywhere. Also, I want to change the rabbit here because it doesn't match the Alice in Wonderland theme. Okay, the girl is great here reading a book by the tree. In order to upscale the image, there are different services we can use. For example, Big GPG. For Big GPG, we can upload the image, Let's choose four X art work and noise reduction. Let's move it to high. Okay, great, let's start here, Let's check out the result. Now the image has way higher resolution, but I think in terms of quality, it's not great. We have some problems with the hands here. I also want to remove the rabbit. Actually I'm going to use this image and put it in stable diffusion. Automatic 11, 11. Let's do that. Going to close this image. And I'm going to go to run diffusion here. I'll go to image, to image. I'm going to open this in paint and I'll upload my image here. Not the upscaled version, but the mid journey version. Here we have our image. I can move it. Okay, here, I want to change a few things. I think her face is not sharp. I'm going to change that. But before I do that, I want to change the model. I want to use a model that's great with faces, so I think Dream Shaper should be good for this kind of image. Let's try that. Dream Shaper. Dream Shaper seven, Okay, now it's loaded here. For the prompt, I'm going to use something similar to mid journey Alice in Wonderland. A small girl. Let's just keep a small girl here. I will raise her face. Okay, Now let's go to settings down here. Mask content, Original pad area A masked to increase the resolution. And then for denoising strength, I want to keep it as close to the original as possible. So let's maybe put 0.2 Okay, now let's generate. Let's check this out. Okay, not my favorite. Let's try one more time here. For the prompt, I will actually add the style by Excel Scheffler. Also for the negative prompt here I already have some negative prompts, I will use that. Okay, now let's increase the count for the batch count. We can generate more images. Batch count three, de noising strength, let's make it even less, it's more similar to the original image. One point, let's put 0.15 Okay, let's check them out. Okay, this time I think it's way better. I like the first one here. I'm going to use it to load to the paved. Okay. Now I'm going to go and remove the rabbit here. I'm going to use a bigger brush for this. I'm going to use a different model. I'm going to use the painting model, stable diffusion version 1.5 Okay, now here I just put grass field, it's a bit blurry. I'll also put blurry. Okay, here I don't need the negative prompt for the settings, I want them to be very different here. I can choose latent noise and then increase my de, noising strength. Or I can use original and literally like 0.90 87 or 98 to have very different image. Okay, let's try that. Okay, let's check out the result here. It added some flowers. This one is pretty good, okay? The first or third one did a pretty good job. Maybe I will use the first one here. I'll again set into paint. Now it would be nice to merge the edited part with the surroundings. We can easily do that. Again, I'll just my brush here. Let's change our de noising strength to zero point around 0.4 I think that's going to be good. Let's generate again, let's see. Okay, now it's a bit better here. Now as my last step, I also want to add a rabbit here behind the tree. I think that's going to create a nice composition. I'm going to use maybe the second image here and I'll set it to paint. Now in the spot I want to see a rabbit. For our settings, let's use original only masked here for denoising strength. Let's make it one because we want a completely different thing on that spot. Okay, and here I'll put a white rabbit watching from the tree. After generating a few more images, I had to change my prompt. I added the style by all, it was more in the same style. Then for the settings, I also changed a little bit here, I changed the Generis strength to 0.9 I like this one, but it Mrs. the ear. Let's see, the other ones, I think the bit is a bit small, this is just a mass. There are a few more things that we can in paint. Like our hands, the head bow here. I did it in the same way, was choosing an image, then sending it to paint, then removing the element that it didn't like, and then constantly improving the. Finally, I send the image to extras, and then I chose the four x ultra mix balance up scalar and I resize it by a factor of four. Here is the result. As you can see, I've merged the grass here. I changed the rabbit. I also improved the hands here, and I edited the head bow here. Then I've upscaled the whole image. I think we got pretty good results here that can be used for book cover. Also, remember at the Firefly module where I showed you that you can create fun letters. Let's try that Firefly, Adobe.com Here we can choose text Effects. If you want to use letters for a book cover, then you can also use this Firefly Adobe. For example. Something interesting with flowers, magical forest, for example. Here we can put Alice. Now we get interesting results that can be incorporated as part of the book. The only problem is let's download this. The problem here is that we will need to clean all of this up, which we can do with Clip Drop which allows us to clean up things. But another problem is right now you cannot use Adobe Firefly for commercial use. That's another thing. But otherwise, I think the concept is pretty fun that maybe in the future when they allow the commercial use, you can generate fine letters for book covers. Also, I want to show you from what we've started and our final result. Here is the mid journey image that we generated. And here is the final result, that's a way better quality that we were able to generate in automatic 11, 11. 99. Create a Logo in Leonardo, Midjourney, ClipDrop, Vectorizer and Firefly: For the third project, I'm tasked to design a logo for a newly opened space alone. Harmony and balance Spat, and here are specifications. Create elegant and captivating logo that embodies the space essence of tranquility and rejuvenation. The logo should have a modern dynamic and minimalistic style. Avoid excessive details focusing on creating a clean and memorable design. Okay, let's try it out here. I can use different platforms like Mid Journey Leica for Adobe Firefly. I would probably use it if it were available for commercial use because I think with the logos, we got really nice results with its AI generator. Let's actually start with the pond for this, I can go to Leonardo and go to AA Image generation from Generation. Here I just need some ideas for my logo and leonadogenerates nice ideas. Let's use it for example, Spa logo. I could also probably use GPT, but let's use Leonardo, since we've covered that in the course. Okay, A vibrant abstract logo design featuring a Trent keel pool of water surrounded by lash greenery. Oh interesting. Here we have the lotus flower that's very typical for spa salons. Modern logo design featuring a stylized spot building. I think that's too much stylized sun rising over Trunk Lake. This seems very interesting. Let's actually try that because Mid Journey is great with simple products. I'm going to use Mid Journey next. And he'll just put, imagine from over project we want the modern and minimalistic style. Let's add that minimalistic. Okay, okay, here we've got some logos, but I think it's beautiful but it has too much detail. Maybe I want to focus on a specific object. I'm going to go back to Ad and maybe generate more prompt Spa logo. Let's generate a few more ideas. In the meanwhile, I will also Google images. Many of the images here have the Lotus. Let's do something different but the Lotus or like the rocks here. Okay, actually let's go and just for mid journey, I will try very broad term. I'll just put Spa logo and I'll put simple Modern. I'll just put minimalistic and symbolic here. I will increase chaos because I want variation and chaos. Let's put ten. Okay, Let's see if Lado generated some interesting ideas, okay? Vibrant abstract logo design. Lotus flower, single lotus flower. Lotus flower. Okay. Everything is lotus flower. That's not fun. Okay? Okay, so here are some images I like the theme of the first and the third one here just looks strange like a mask, nose, mouth. This one is the lotus and it's gold. That's like a luxury spot. Not bad. But I want to make something different. Here in the images, I found that in one of the logos, they use candles. Let's try to use candles in the logo. Okay, I'm going to go back to Journey. Imagine and again, spl, let's portraying candles minimalistic, symbolic. Let's put harmony. Let's put chaos. Actually, I think let's increase the chaos a little bit. Maybe 20 in the meanwhile, I'm going to go back to Leonardo and put candles there. Maybe it's going to generate some cool problems with that. Spo candles featuring candles. More ideas. A single candles right by a ring of delicate petals. Bouquet flickering candle of vibrant flowers. That's nice. Surrounded by a, surrounded by a circle of vibrant blooms nested in a bed of vibrant flowers. Here I got the idea. Candles with leaves, flowers. We can try that starry night. No, I don't want, it just adds more details. Let's put minimalistic here. Let's see if that's going to help. Okay, so here we have minimalistic spy logo featuring a single candle. Okay, now this is more simple, modern spy logo with single candle surrounded by ring of liquin flames. Interesting. Surrounded by circle of soft light, glowing ember, circle of source. Okay, Circle of intricate pattern. That's interesting here. It's surrounded by circle of intricate designs. Basically the same thing. Actually, let's try the first one and the seventh one. We'll generate here. Generate. Okay, let's go back to mid journey. Okay, now we've got something interesting. This is basically what Leonardo was telling me. That the candle is surrounded by some flower. I like that, we can work with that here. If I like a specific image here, I can upscale it. Number three here, I will actually save it. I'm going to go and describe here. We've got some description here. We have in the style of light turquoise and light white. I like the turquoise color here, The energy filled illustrations. Indian traditional oil in a lotus flower logo. A candle with branches and leaves, a lotus flower logo with a candle burning inside it. I like a candle with branches and leaves. Here, I'm going to use that. I want something like this image here. It gives this luxury vibes. I'm going to also upscale it now. I actually want to blend those two together and see the result. I'm just going to click Blend here. I'll put first one and the second one here, dimensions we want the square, Let's use that. Even though it's still generating, I can see that we're not getting this black background, which I want. Let me describe this image as well. Describe, okay, so here we have sharp attention to detail. Okay, A logo. Okay, that's the name of it. It nicely took it from the image here. Gold in the style of serene peaceful ambience. That's very vague in the style of gold leaf. Okay, again, very vague. This is what I'm going to do. I'm going to take the description of this image here. I'm going to use the image address of this image. Okay, let's write a prompt. Imagine I will use the image address of this one, called the image address pasted here. Now I will further describe may prompt a candle. Actually, let's start with, it's a logo featuring a candle surrounded by petals, intricate patterns. This is what I took from Lead. Let's see actually what Leonato has generated. Okay, so these are like this intricate patterns, but it's too complicated and this one is too simple. It looks more like a photo than a logo. I'm going to stick with mere journey, but I like the ideas here and I will use them. Let's go here. Intricate patterns. Let's try that. Okay, this starts to look interesting. Maybe intricate patterns was too much because it adds too many details. I'll take bit of that. I'll change my pro. Okay here, I just can click control to get my prompt. I will change it here. I'll remove the intricate patterns because I don't want too much details. And also I will make stylize, let's put 50. It's more simple. Also I'm thinking if I want to make the image weight higher or lower here I got very similar images. Maybe I'll make the image weight a little bit lower. The default one is one. I'll put 0.9 I also want to add chaos. Let's put Chaos and let's put 15. I'll also generated a couple of times, I don't wait a long time. There is actually a parameter that you can add to your prompt that allows you to generate a few batches. You don't have to write it or copy and paste it yourself. Here in the parameter list, it's called repeat. You just can put this and that will create multiple jobs for a single prompt. Okay, let's see the results. Okay, this is interesting. I like the symbol here, but again we get the lotus. Let's see the other ones here. We have the candle, candle. Okay, maybe this one is pretty good. I like the candle holder. I don't know if that's it, but I assume it is. I want to generate more variations of this logo. I think it's simple and pretty. Okay. I still like this one way more. I'm just going to upscale it. Let's see if we can add more of the black aesthetic. Again, I'm going to try to blend this image with this one and see what's going to happen. You'd never know what the blend function as the images develop. I don't like any of them here. I still really like this one. But I'm not a fan of this flower here. What I'm going to do, I'm going to go to clip drop, then I'm going to use clean up tool and I'm going to upload my image here. Now I want to remove this flower. Okay, let's clean it. Removed it, but now it doesn't matter. I'll just save it. I'll go back to mid journey. I actually upload the images here. I'm going to upload the image. I'm going to use its image address. Imagine and then copy image address. And then I'll use the same prompt here. I'm thinking if I should use the other image as well. Okay, let's try to use that as well. So complete image address here. Let's paste it here. Now we have two images, okay, Spa logo featuring a candle surrounded by petals. Let's put minimalist and line design. Okay, style image weight. Let's make it higher, 1.2 I think that should be good. I'll just repeat it a few times. As I already said, you can use the repeat button here and put how many batches do you want? Let's put three more here. We are asked if we want to imagine three prompts. Yes, from all these generations. I like this one. I'm going to create more variations of the number for the first one here and the fourth one. I'm going to also generate some variations and then choose at the final logo. Okay, let's see, these are nice, like the first one here. Let's see, the other one here, I like number three. Okay, Here, I don't like any of them here. I want to choose the third one. I'm going to upscale it. Upscale number three here. Okay, I think it matches our description here. It's modern. It's capturing the tranquility and rejuvenation mood. And also we don't have too much details. The one thing I want to change is I want to get rid of this spiral thing. Here again, I'm going to go to clip drop. But first I need to save the image. Let's go back. Okay, I think that's good. Okay, beautiful. That's all I need. Let's download now. I'm going to go to Vectorize it and make a vector out of this image. Now let's use Vectorizer to convert our image to vector. Okay, great. Now we've got the vector, The quality is grade. The only problem with the background here, but it's easily removed. Let's download the SVG file grade. Now I can actually use Adobe Firefly to change the colors of my vector image. If I don't like the colors, I can try different color palettes with the generative recolor. The only problem here is that firefly is not available for commercial use yet. So I'm not sure how it plays out with the genes of free color because I'm here uploading my own image. I would be careful with that and just use that for inspiration. Let's say here, let's put black. Okay, now change to black and white. I like this here. Let's see other styles. Let's see the sandy stone. Let's use the lavender storm, the pink one here, Salmon sushi. Interesting here. Let me play around a little bit and choose the best one. I really like this one. I'm going to download it now because it has all this background. I'm going to go to online SG editor like this one. And I'm just going to upload my image now I will remove all the background here. Here is a final result after removing the background, which was very easy here. It's all really good and the lines are sharp. The only problem is this fire flame here, I, It just needs a little bit of tweaking and that's all here. The font that was generated by Mid Journey, I pretty like this font, so it can be used as the inspiration for actual company name. Here, this was our final project and you've made it to the end of the course. Congratulations, I hope you enjoyed the course. Found it useful. Please consider leaving a review for the course because it will help me to create more tailored content for you in the future. With that being said, I wish you best to flag with your AA art journey.

Generative AI Art Generation: Mastering all the AI Tools - Midjourney, BW, DALL-E, SD, Runway, etc.

Henry Learning, Instructor | AI Entrepreneur

Watch this class and thousands more

Watch this class and thousands more

Lessons in This Class

1.

Welcome to AI Art Generation

3:05

2.

AI Image Generator Apps Introduction

12:04

3.

AI Image Editing Apps Introduction

7:49

4.

DALL-E Introduction

7:24

5.

DALL-E Image Generation

10:22

6.

DALL-E Image Editing

9:47

7.

DALL-E Outpainting

8:44

8.

Prompt Examples

13:11

9.

New Update: Comparison Between DALL-E 2 and DALL-E 3

14:00

10.

New Update: DALL-E 3 with Bing vs DALL-E 3 with ChatGPT

6:47

11.

New Update: DALL-E 3 with ChatGPT

17:35

12.

New Update: Examples and Use Cases of DALL-E 3

17:55

13.

New Update: DALL-E 3 New Parameter Gen_ID

11:07

14.

Prompt Writing - Subject and Medium

10:17

15.

Prompt Writing - Composition, Action and Details

10:59

16.

Prompt Writing - Negative Prompt, Stylizers & Modifiers

11:30

17.

Prompt Writing - Artists

8:12

18.

Prompt Sample - Portrait

14:52

19.

Prompt Sample - Landscape

10:02

20.

Prompt Writing Resources

9:02

21.

Lexica Introduction

8:11

22.

Lexica Features

9:39

23.

Lexica Image Generation

7:15

24.

Prompt Guidance Parameter

10:28

25.

Lexica Image to Image Generation

9:22