by James | Dec 24, 2022 | Articles, Technology
Stable Diffusion v2.0 and prompts
The update brings improvements to SD’s text-to-image diffusion models, includes a powerful image upscaler, updates its Inpainting model, and more.
Also it appears to have made massive updates to the prompting system comparing Stable diffusion v1.4 to v2 gives pretty vast differences. They produced a prompts guide which was online but seems to have disappeared?
But irritatingly it can’t be downloaded as a .pdf, me being a massive nerd said f**k that and used python to pull the svgs and created a pdf.
Here are a few examples of how different things can get
So here are a few examples with the relevant seeds and prompts to show you how dramatically things have changed. Also, it should be mentioned a lot of celebrities have been removed from the model, NSFW images and famous artists like Rutkowski have been purged due to complaints of their style being essentially stolen by the machine.
What has been removed from Stable Diffusion’s training data, though, is nude and pornographic images. AI image generators are already being used to generate NSFW output, including both photorealistic and anime-style pictures. However, these models can also be used to generate NSFW imagery resembling specific individuals (known as non-consensual pornography) and images of child abuse.
Of course, there are a lot of angry Incels now wailing about censorship. This is bullshit, this stuff is open-source. No doubt a horde of horny teenagers are using PornHub and other sites to train NSFW models.
Prompt: Gandalf, d & d, fantasy, intricate, elegant, highly detailed, digital painting, artstation, concept art, matte, sharp focus, illustration, hearthstone, art by artgerm and greg rutkowski and alphonse mucha
Negative prompt: cartoon, 3d, ugly face, (disfigured), (bad art), (deformed), (poorly drawn), (extra limbs), strange colours, blurry, boring, sketch, lacklustre, repetitive, cropped, hands
Steps: 60, Sampler: DDIM, CFG scale: 12, Seed: 1940895508, Face restoration: GFPGAN, Size: 512×512, Model hash: 7460a6fa, Batch size: 6, Batch pos: 0
Prompt: Portrait digital art of Bill Murray from Scrooged (Arcane). wearing a suit, Christmas,
Negative prompt: cartoon, 3d, ugly face, (disfigured), (bad art), (deformed), (poorly drawn), (extra limbs), strange colours, blurry, boring, sketch, lacklustre, repetitive, cropped, hands
Steps: 40, Sampler: DDIM, CFG scale: 13, Seed: 3408805356, Face restoration: GFPGAN, Size: 512×512, Model hash: 7460a6fa, Batch size: 6, Batch pos: 0
by James | Dec 21, 2022 | Random Links, Writing
[et_pb_section fb_built=”1″ _builder_version=”4.16″ global_colors_info=”{}” theme_builder_area=”post_content” custom_padding=”0px|||||”][et_pb_row _builder_version=”4.16″ background_size=”initial” background_position=”top_left” background_repeat=”repeat” global_colors_info=”{}” theme_builder_area=”post_content”][et_pb_column type=”4_4″ _builder_version=”4.16″ custom_padding=”|||” global_colors_info=”{}” custom_padding__hover=”|||” theme_builder_area=”post_content”][et_pb_text _builder_version=”4.19.4″ background_size=”initial” background_position=”top_left” background_repeat=”repeat” hover_enabled=”0″ global_colors_info=”{}” theme_builder_area=”post_content” custom_padding=”||0px|||” sticky_enabled=”0″]
What is the full story
[/et_pb_text][et_pb_text _builder_version=”4.19.4″ background_size=”initial” background_position=”top_left” background_repeat=”repeat” hover_enabled=”0″ global_colors_info=”{}” theme_builder_area=”post_content” custom_padding=”||0px|||” sticky_enabled=”0″]
So recently the writer Kris Kashtanova used Midjournery AI to generate all the images in a comic, and then just recently the U.S. Copyright Office appears to be backtracking on its decision to grant protection to an AI-generated comic book. now on the face of it this seems like a no-brainer of course he shouldn’t be able to copyright the images generated by a machine, no matter how carefully they were obliged to type the prompts in to get the images they desired for the story (no mean feat).
However, there is a contentious element here for me, it looks like the USPTO is removing the copyright from the entire book including the writing. Now perhaps this isn’t the case and I’m misreading things but even in a comic book art and writing are two separate elements. Kashtanova didn’t use ChatGPT to write the story or text in the comic, that was his own work, the book should still be copyrighted to him even if the art isn’t his.
It would be like me finding royalty free for use images online and then using them to illustrate my story, am I now in danger of losing my other rights, what about a talented painter who uses ChatGPT to help write a story with prompts and then paints the scenes, what happens to their rights?
[/et_pb_text][et_pb_text _builder_version=”4.19.4″ _module_preset=”default” global_colors_info=”{}” theme_builder_area=”post_content”]
The future is going to get even messier
Sometimes it is hard to realize the incredible progress we’ve made in terms of cheap, accessible computing power and AI. Then something like this comes along and people are shocked, but this is really just the beginning of this, what we are witnessing is the tip of the AI glacier that is about crash down on society.
In 10 years AI art generators will be an order of magnitude better, and our ability to prompt them with far more nuanced, faces, eyes, and body proportions are all hit or miss currently. In 10 or 20 years we’ll see fusions of deepfakes, art and media that will change a lot of industries, forever.
Let’s take an example I love the film Scrooged so in 20 years I think hey let’s watch that, but you know I’ll make it a cartoon, like Arcane and add labelling to the action scenes like in Scott Pilgrim vs the World, maybe change out the bad guy for Alan Rickman. Then maybe 20/30 minutes later, I watch my new personalised version of Scrooged.
That’s not hyperbole, that’s just where things are going to be on the route we’re currently heading. If you think that sounds crazy, remember that 20 years ago the best kind of phone you could have was a Nokia.
[/et_pb_text][et_pb_image src=”https://jamesrtyrrell.com/wp-content/uploads/2022/12/06683-3408805356-Portrait-digital-art-of-Bill-Murray-from-Scrooged-Arcane.-wearing-a-suit-Christmas.png” _builder_version=”4.19.4″ _module_preset=”default” theme_builder_area=”post_content” hover_enabled=”0″ sticky_enabled=”0″ alt=”Portrait digital art of Bill Murray from Scrooged (Arcane). wearing a suit, Christmas,” title_text=”Portrait digital art of Bill Murray from Scrooged (Arcane). wearing a suit, Christmas,”][/et_pb_image][et_pb_text _builder_version=”4.19.4″ _module_preset=”default” theme_builder_area=”post_content” hover_enabled=”0″ sticky_enabled=”0″]
What’s next
I’m going to write more about this because it really interests me and as I study and learn about it I’ll keep a running commentary on my blog.
It’s the first time in a long time I’ve seen a subject that’s blown me away like this so hopefully, that will translate into me getting back into writing fiction, I’ll just have to be careful how I illustrate it I guess ;).
[/et_pb_text][/et_pb_column][/et_pb_row][/et_pb_section]