Welcome to episode twenty two! This is your host, Doug Smith. This is Not An AI art podcast is a podcast about, well, AI ART โ technology, community, and techniques. With a focus on stable diffusion, but all art tools are up for grabs, from the pencil on up, and including pay-to-play tools, like Midjourney. Less philosophy โ more tire kicking. But if the philosophy gets in the way, we'll cover it.
But plenty of art theory!
Today we've got:
Available on:
Show notes are always included and include all the visuals, prompts and technique examples, the format is intended to be so that you don't have to be looking at your screen โ but the show notes have all the imagery and prompts and details on the processes we look at.
Maybe a little narrow, but, interesting idea. I'm curious how it was made honestly.
I didn't always have awesome luck with it. Somethings it works a lot, other times, it doesn't always.
the eiffel tower made of water, RAW photo, analogue style <lora:Aether_Aqua_v1_SDXL_LoRA:0.9>
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 3974890538, Size: 816x1024, Model hash: 0724518c6b, Model: juggernautXL_v7Rundiffusion, Lora hashes: "Aether_Aqua_v1_SDXL_LoRA: 87cec2fbc297", Version: v1.5.1
Didn't quite work.
Flappers didn't work well.
a sailboat made of water, RAW photo, analogue style <lora:Aether_Aqua_v1_SDXL_LoRA:0.9>
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 3476901789, Size: 816x1024, Model hash: 0724518c6b, Model: juggernautXL_v7Rundiffusion, Lora hashes: "Aether_Aqua_v1_SDXL_LoRA: 87cec2fbc297", Version: v1.5.1
A little better.
diabolical genius cat, evil, cyberpunk
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 2011832561, Size: 816x1024, Model hash: 5db7e0c07f, Model: CatGen, Version: v1.5.1
steampunk boat captain cat
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 3078443858, Size: 816x1024, Model hash: 5db7e0c07f, Model: CatGen, Version: v1.5.1
cat dressed as a 1920s flapper, in a speakeasy
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 2368208589, Size: 816x1024, Model hash: 5db7e0c07f, Model: CatGen, Version: v1.5.1
a cat in Montmartre, illustration by Henri de Toulouse-Lautrec
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 1527610832, Size: 816x1024, Model hash: 5db7e0c07f, Model: CatGen, Version: v1.5.1
I noticed they're using pretty low CFG, like 3-5 CFG.
a flapper gazes across the bar at a speakeasy in the year 1927, evocative lighting, dynamic pose, smoke fills the air, nightlife, parisian, RAW photo, analog style, film noire, color photography by __stoddard/favoritephotographers__
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 4130560422, Size: 816x1024, Model hash: d6ff242dc7, Model: icbinpXL_v10, Version: v1.5.1
she ponders the beauty of the mist in the Vermont mountains, blonde woman, style of outdoorgals, vivid sunset, RAW photo, anamorphic 35 mm lens, outdoor fashion, travel, Landscape Photography, 2022 trending photo, trending on instagram, color photography by Petra Collins <lora:outdoorgals_v1-000007:0.65>
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 1575979881, Size: 816x1024, Model hash: d6ff242dc7, Model: icbinpXL_v10, Lora hashes: "outdoorgals_v1-000007: 9a5f53db83c6", Version: v1.5.1
she is the blacksmith's daughter in the dark workshop, RAW photo, bokeh, depth of field, anamorphic 35 mm lens, 2022 trending photo, trending on instagram, color photography by Jamie Baldridge
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 2070040037, Size: 816x1024, Model hash: d6ff242dc7, Model: icbinpXL_v10, Version: v1.5.1
It's a merge focused on psychedelia, sci-fi and surrealism.
Sounds fun!
highres, depth of field
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 1815447005, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1
psychedelia
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 3585306257, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1
1920s flapper, lsd
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5, Seed: 2740167148, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1
she is on an alien planet in the winter, jacket, sci fi, RAW photo, depth of field <lora:outdoorgals_v1-000007:0.65>
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 4, Seed: 2001459026, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Lora hashes: "outdoorgals_v1-000007: 9a5f53db83c6", Version: v1.5.1
gr0lkonterous furchbronk turler
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 339098617, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1
[psychedelic woman:she is the bread loaf master of Antwerp:0.2], ((photorealistic, hyperrealistic, 8k, 4k, highres, UHD))
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 2, Seed: 3216008249, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1
blue cheese as a surrealist centerpiece
Negative prompt: 3d, render, cgi
Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3.5, Seed: 213663120, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1
Early results for Juggernaut XL v8
Playground v2
Echo AI
One Trainer
Rad integration with a Midi controller
LCM Inpainting with Crita
Sweet workflow to create a blender model and then texture it
From this /r/Stabledifussion thread
So, I didn't pick this one because I like it. I picked it because I actively dislike it.
Look at how lazily this was sup together. I don't think the generations were even iterated AT ALL.
Text is messed up. The car is too small. The hands haven't been fixed. What is she being handed, a remote detonator for TNT?
โฆThis is a great example of why you SHOULD iterate. It looks lazy. It's not good advertising. Guess what? I don't want to transport my vehicle with this company.
Also the oversaturated garbage here doesn't work. When it's stylized and it's part of the style, great. When it's just likeโฆ Looks fake? Not doing anything for me.
Criminal how few upvotes there are for this. It's a DALL-E gen.
Few things in this are looking really cool. I love the sci-fi look to it, and there's a 70s vibe that's coming through.
The composition is pretty cool, and the way that there's the two chairs facing one another look like it's achieving a lot of assymetrical balance. It works really well.
Something I probably would do if it were mine is to change the scale of the robot. While having it be big is interestingโฆ It might just look better if there's a similarly human scale.
It needs hand fixes. It needs likeโฆ worthless objects to be removed or to be made into a thing. I'd do something about the messy cabling. I like it in some ways, but, in others it needs to work with the piece and/or be cleaned up.
But it's a really good initial generation, doesn't look like it's been iterated.
Before it's a little shabbyโฆ not enough fingers plus it's too long
So I generally just quick touch up the fingers with a little painting.
Then, I need to resize, so I select and then paste this area as a new layer.
I position it to resize the hand, make it a little smaller
Then I mask for inpainting.
Inpainted "at full resolution / inpaint only masked" at 0.19 denoise.
Realized I had a mistake earlier and it wasn't at high enough res, so I made another passโฆ
Side by side before/after
And the finalโฆ