![pod logo](https://i.imgur.com/SlYH9da.png =600x408) ## Intro Welcome to episode twenty two! This is your host, Doug Smith. This is Not An AI art podcast is a podcast about, well, AI ART – technology, community, and techniques. With a focus on stable diffusion, but all art tools are up for grabs, from the pencil on up, and including pay-to-play tools, like Midjourney. Less philosophy – more tire kicking. But if the philosophy gets in the way, we'll cover it. But plenty of art theory! Today we've got: * Model madness: Model review on 3 models and 1 LoRA * Bloods and Crits: Art crits on two pieces * Technique of the week: A hand fix process. Available on: * [Spotify](https://open.spotify.com/show/4RxBUvcx71dnOr1e1oYmvV) * [iHeartRadio](https://www.iheart.com/podcast/269-this-is-not-an-ai-art-podc-112887791/) * [Google Podcasts](https://podcasts.google.com/feed/aHR0cHM6Ly9hbmNob3IuZm0vcy9kZWY2YmQwOC9wb2RjYXN0L3Jzcw) Show notes are always included and include all the visuals, prompts and technique examples, the format is intended to be so that you don't have to be looking at your screen -- but the show notes have all the imagery and prompts and details on the processes we look at. # Model madness ## Aether Aqua LoRA XL * On [Civitai](https://civitai.com/models/210754/aether-aqua-lora-for-sdxl) * From [reddit thread](https://www.reddit.com/r/StableDiffusion/comments/182ta22/new_lora_aether_aqua_turns_stuff_into_water/?share_id=4Fv1cM5lb1rVWp_5xMDWa&utm_content=1&utm_medium=android_app&utm_name=androidcss&utm_source=share&utm_term=1) Maybe a little narrow, but, interesting idea. I'm curious how it was made honestly. I didn't always have awesome luck with it. Somethings it works a lot, other times, it doesn't always. ``` the eiffel tower made of water, RAW photo, analogue style <lora:Aether_Aqua_v1_SDXL_LoRA:0.9> Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 3974890538, Size: 816x1024, Model hash: 0724518c6b, Model: juggernautXL_v7Rundiffusion, Lora hashes: "Aether_Aqua_v1_SDXL_LoRA: 87cec2fbc297", Version: v1.5.1 ``` Didn't quite work. ![aqua-tower](https://hackmd.io/_uploads/B10PVfIIT.jpg) Flappers didn't work well. ``` a sailboat made of water, RAW photo, analogue style <lora:Aether_Aqua_v1_SDXL_LoRA:0.9> Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 3476901789, Size: 816x1024, Model hash: 0724518c6b, Model: juggernautXL_v7Rundiffusion, Lora hashes: "Aether_Aqua_v1_SDXL_LoRA: 87cec2fbc297", Version: v1.5.1 ``` A little better. ![aqua-boat](https://hackmd.io/_uploads/By62NfL8T.jpg) ## Catgen XL * [on Reddit](https://www.reddit.com/r/StableDiffusion/comments/188a5ie/i_trained_the_biggest_checkpoint_on_cats_and/?share_id=5xctoNvi69NQVlCGL-h8V&utm_content=1&utm_medium=android_app&utm_name=androidcss&utm_source=share&utm_term=1) * [On Tensor.art](https://tensor.art/models/666115102501649482) ``` diabolical genius cat, evil, cyberpunk Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 2011832561, Size: 816x1024, Model hash: 5db7e0c07f, Model: CatGen, Version: v1.5.1 ``` ![catgen-genius](https://hackmd.io/_uploads/SyaL9XLU6.jpg) ``` steampunk boat captain cat Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 3078443858, Size: 816x1024, Model hash: 5db7e0c07f, Model: CatGen, Version: v1.5.1 ``` ![catgen-steampunk](https://hackmd.io/_uploads/BJWJjXLIp.jpg) ``` cat dressed as a 1920s flapper, in a speakeasy Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 2368208589, Size: 816x1024, Model hash: 5db7e0c07f, Model: CatGen, Version: v1.5.1 ``` ![catgen-flapper](https://hackmd.io/_uploads/HyLro7ULp.jpg) ``` a cat in Montmartre, illustration by Henri de Toulouse-Lautrec Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5.5, Seed: 1527610832, Size: 816x1024, Model hash: 5db7e0c07f, Model: CatGen, Version: v1.5.1 ``` ![cat-lautrec2](https://hackmd.io/_uploads/SJSr278Ua.jpg) ## ICBINP XL * [On Reddit](https://www.reddit.com/r/StableDiffusion/comments/18fi104/icbinp_xl_v10_released/?share_id=Q7zrfoBdbClWTNTs7MNQ2&utm_content=1&utm_medium=android_app&utm_name=androidcss&utm_source=share&utm_term=1) * [On Civitai](https://civitai.com/models/229002) I noticed they're using pretty low CFG, like 3-5 CFG. ``` a flapper gazes across the bar at a speakeasy in the year 1927, evocative lighting, dynamic pose, smoke fills the air, nightlife, parisian, RAW photo, analog style, film noire, color photography by __stoddard/favoritephotographers__ Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 4130560422, Size: 816x1024, Model hash: d6ff242dc7, Model: icbinpXL_v10, Version: v1.5.1 ``` ![icbxl-flapper1](https://hackmd.io/_uploads/ryM-8VP8p.jpg) ``` she ponders the beauty of the mist in the Vermont mountains, blonde woman, style of outdoorgals, vivid sunset, RAW photo, anamorphic 35 mm lens, outdoor fashion, travel, Landscape Photography, 2022 trending photo, trending on instagram, color photography by Petra Collins <lora:outdoorgals_v1-000007:0.65> Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 1575979881, Size: 816x1024, Model hash: d6ff242dc7, Model: icbinpXL_v10, Lora hashes: "outdoorgals_v1-000007: 9a5f53db83c6", Version: v1.5.1 ``` ![icbxl-outdoorgal](https://hackmd.io/_uploads/HJPPPVw8p.jpg) ``` she is the blacksmith's daughter in the dark workshop, RAW photo, bokeh, depth of field, anamorphic 35 mm lens, 2022 trending photo, trending on instagram, color photography by Jamie Baldridge Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 2070040037, Size: 816x1024, Model hash: d6ff242dc7, Model: icbinpXL_v10, Version: v1.5.1 ``` ![icbxl-blacksmith](https://hackmd.io/_uploads/BJYwd4DL6.jpg) ## Psyfi XL * [reddit](https://www.reddit.com/r/StableDiffusion/comments/18g7ag5/psyfi_xl_v10/?share_id=BMByunWWF2gfy7v9ZPv7H&utm_content=1&utm_medium=android_app&utm_name=androidcss&utm_source=share&utm_term=1) * [civitai](https://civitai.com/models/228162) It's a merge focused on psychedelia, sci-fi and surrealism. Sounds fun! ``` highres, depth of field Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 1815447005, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1 ``` ![psy-random](https://hackmd.io/_uploads/rknNz8vL6.jpg) ``` psychedelia Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 3585306257, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1 ``` ![psy-psych-1mg](https://hackmd.io/_uploads/r14BLUvLp.jpg) ``` 1920s flapper, lsd Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 5, Seed: 2740167148, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1 ``` ![psy-flapper](https://hackmd.io/_uploads/B19Hu8P8T.jpg) ``` she is on an alien planet in the winter, jacket, sci fi, RAW photo, depth of field <lora:outdoorgals_v1-000007:0.65> Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 4, Seed: 2001459026, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Lora hashes: "outdoorgals_v1-000007: 9a5f53db83c6", Version: v1.5.1 ``` ![psy-outdoorgals](https://hackmd.io/_uploads/B1O5q8PL6.jpg) ``` gr0lkonterous furchbronk turler Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 339098617, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1 ``` ![psy-jibberish](https://hackmd.io/_uploads/B18n28vLa.jpg) ``` [psychedelic woman:she is the bread loaf master of Antwerp:0.2], ((photorealistic, hyperrealistic, 8k, 4k, highres, UHD)) Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 2, Seed: 3216008249, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1 ``` ![psy-breadloafmaster](https://hackmd.io/_uploads/B1wGRUDLT.jpg) ``` blue cheese as a surrealist centerpiece Negative prompt: 3d, render, cgi Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 3.5, Seed: 213663120, Size: 816x1024, Model hash: 4f7a5a3b2d, Model: psyfiXL_v10, Version: v1.5.1 ``` ![psy-cheese](https://hackmd.io/_uploads/HJ6NFcP8a.jpg) ## Other resources * Early results for Juggernaut XL v8 * [On Reddit](https://www.reddit.com/r/StableDiffusion/s/oDNkH3KjTT) * Playground v2 * https://blog.playgroundai.com/playground-v2/ * https://huggingface.co/playgroundai/playground-v2-1024px-aesthetic * Might be available in comfy? or fooocus? * Not in a1111 yet. * Echo AI * From the person who trained the RPG model * Some kind of generative ai art game? * [Reddit thread and previews](https://www.reddit.com/r/StableDiffusion/comments/18fcdog/echoai_git_hub_update/) * [Github Repo](https://github.com/Anashel-RPG/echoai) * One Trainer * [reddit thread](https://www.reddit.com/r/StableDiffusion/s/WDo5LwHHHH) * https://github.com/Nerogar/OneTrainer * My pull request: https://github.com/Nerogar/OneTrainer/pull/86 * I got it working, but, haven't run a training run, yet! * Rad integration with a Midi controller * [On reddit](https://www.reddit.com/r/StableDiffusion/s/YgKkiYaDD1) * LCM Inpainting with Crita * [On Reddit](https://www.reddit.com/r/StableDiffusion/s/wGU619kP1J) * Looks like a cool way to basically... inpaint from scratch? * I want to try it, but haven't gotten around to installing Krita. * Sweet workflow to create a blender model and then texture it * [On Reddit](https://www.reddit.com/r/StableDiffusion/s/to8UX0LQMT) # Bloods and crits ## The world's laziest advertising From [this /r/Stabledifussion thread](https://www.reddit.com/r/StableDiffusion/s/bcfbf5E4hw) So, I didn't pick this one because I like it. I picked it because I actively dislike it. Look at how lazily this was sup together. I don't think the generations were even iterated AT ALL. Text is messed up. The car is too small. The hands haven't been fixed. What is she being handed, a remote detonator for TNT? ...This is a great example of why you SHOULD iterate. It looks lazy. It's not good advertising. Guess what? I don't want to transport my vehicle with this company. Also the oversaturated garbage here doesn't work. When it's stylized and it's part of the style, great. When it's just like... Looks fake? Not doing anything for me. ![](https://preview.redd.it/ee3o360ret1c1.jpg?width=640&crop=smart&auto=webp&s=53d7e83391193efb6d95a1c20bf7e988d532649f) ## 1970's Dating Robot * [On Reddit](https://www.reddit.com/r/weirddalle/comments/185yyrh/dating_robots_in_the_70s_when_you_said_lets/) Criminal how few upvotes there are for this. It's a DALL-E gen. Few things in this are looking really cool. I love the sci-fi look to it, and there's a 70s vibe that's coming through. The composition is pretty cool, and the way that there's the two chairs facing one another look like it's achieving a lot of assymetrical balance. It works really well. Something I probably would do if it were mine is to change the scale of the robot. While having it be big is interesting... It might just look better if there's a similarly human scale. It needs hand fixes. It needs like... worthless objects to be removed or to be made into a thing. I'd do something about the messy cabling. I like it in some ways, but, in others it needs to work with the piece and/or be cleaned up. But it's a really good initial generation, doesn't look like it's been iterated. ![](https://preview.redd.it/g46jvi8qz33c1.jpg?width=960&crop=smart&auto=webp&s=68a27ffcbbfcbbc1774235db75bd38e1d95fa238) # Technique of the week: A hand process I used earlier Before it's a little shabby... not enough fingers plus it's too long ![hand-before](https://hackmd.io/_uploads/Syp_SGqSa.jpg) So I generally just quick touch up the fingers with a little painting. ![hand-overpaint](https://hackmd.io/_uploads/HyXiSzcra.jpg) Then, I need to resize, so I select and then paste this area as a new layer. ![hand-select](https://hackmd.io/_uploads/BJ7CBMcBp.jpg) I position it to resize the hand, make it a little smaller ![hand-resize](https://hackmd.io/_uploads/rkbf8McSa.jpg) Then I mask for inpainting. ![hand-mask](https://hackmd.io/_uploads/r1mQwQ9H6.jpg) Inpainted "at full resolution / inpaint only masked" at 0.19 denoise. ![hand-touchedup](https://hackmd.io/_uploads/rkBEOQcrp.jpg) Realized I had a mistake earlier and it wasn't at high enough res, so I made another pass... ![hand-lastinpaint](https://hackmd.io/_uploads/SkhShXcrp.jpg) Side by side before/after ![hand-beforeafter](https://hackmd.io/_uploads/BkDC3Q5Sp.jpg) And the final... ![victorian-insta-c8-jpg](https://hackmd.io/_uploads/BJdXJP9BT.jpg)