Posts tagged ‘midjourney’

2023/01/18

The Klara Trilogy is done!

The story of Klara, written by me channeling the Collective Unconscious, illustrated by me using Midjourney, and narrated and set to music and videographed by the talented Karima Hoisan, is finally finished!

I originally thought it was finished at the end of the first forty-frame thing; and then when I did Part Two at about the same length, I thought it was finished; and now having struggled for months on Part Three I’m pretty sure it actually is done. :)

Having just watched Karima’s videos of all three parts in order (playlist here!), I’m glad various viewers convinced me not to stop at one or two parts. It’s pretty good!

And I say this with all modesty; I feel like this story came through me, more than like it is something that I did. The comments over in Karima’s weblog, and her narration, have suggested various meanings and facets to me that I hadn’t thought of before.

In terms of the experience of creating it, it’s been interesting to see the various phases of interaction with the AI tool. I started out Part One by creating various variations of the prompt “detailed surrealism” on the v3 engine on Midjourney, and then weaving the story around pretty much whatever came out.

It happens that in v3, that prompt pretty reliably produces scenes from a stylistically coherent universe, including the MJ Girl, who plays the part of Klara in the first two parts. In Part Two, I had a bit more of an idea of what I wanted to happen, in a general way, but continued using v3 and the same prompt. This required somewhat more work, because it would produce images that didn’t fit with the story I wanted, so I had to put those aside and make more. But the style was at least not much trouble.

Part Three was quite different. For plot reasons, being in basically a different reality, the style needed to be different. It was relatively easy to do that, by using the “test” and “testp” engines, either alone or by “remastering” images made under v3. But the resulting images, while different from those of the first two parts, weren’t nearly as consistent among themselves as those of parts one and two. So I had to play around a lot more with the workflows and the prompts, and produce quite a few more pictures, to get a reasonably consistent style.

The style of Part Three still shifts around quite a bit; the flavor of the city, the color of Klara’s hair, the cat’s fur, and many other things change somewhat from panel to panel, but I wanted a nice mixture of consistent and in flux; and that took work!

Then there was the Story issue. The beginning “recap” part of Part Three was relatively easy that way, summarizing the story of the first two parts from a different point of view. But then I quickly got stuck; I wanted to do something more satisfying and less random than I would get by letting the AI’s raw output drive the action. For whatever reason, it took me quite awhile to find the story thread that I liked, and then about as long to create (or obtain, if you prefer!) the images to go with it.

(The images still drove the narrative to some extent; for instance the firefly line, which I adore, was inspired by the image that goes with it, not vice-versa.)

But finally I finished! :) And Karima made the video in record time, and there it is! Woooo!

I keep feeling like I should make it into good PDFs, or something (even) more readable, and officially post links to that; maybe even have it printed somewhere onto atoms. On the other hand, without the narrative and music and video, it would hardly be the same… :)

2023/01/16

Little Imaginary Diagrams

I asked Midjourney for some simple proofs of the Pythagorean Theorem. The results make me happy. :)

(On the text side: GPT-2 and even GPT-3 might have hallucinated something interesting. ChatGPT would just error out a few times and then give a boring literal description of one in a condescending tone. My ability to be interested in ChatGPT as an interaction partner is severely limited by how boring it is. But anyway, back to the pictures!)

Presented without comment (beyond the alt text):

A geometric diagram with various lines and colored areas and illegible labels (some of which may be small integers). Amusingly, there do not appear to be any right triangles.
A geometric diagram with various lines and colored areas and labels. Some labels are illegible, but there is an 8, a 3, a 4, and a few 1's. Some of the colored areas contain brick patterns, and there is a random architectural arch and a few other map-like textures thrown in.
A comparatively simple geometric diagram of lines and colored areas. There is a right triangle labeled E textured in a pebbly pattern, a rectangle labelled with a G and a unfamiliar glyph, and various areas with fine blue stripes.
A relatively modern-looking flat geometrical diagram containing three triangles (two of them right triangles) in gradients of different colors, a large grey striped area, and various lines. There are labels that look vaguely numeric, but are basically unreadable.

I hope you find these at least as amusing, endearing, and/or thought-provoking as I do. :)

2022/12/26

December 26th, 2022

We made just 106 dumplings this year, plus another eight filled with Extra Sharp Cheddar Cheese (that was the little boy’s idea; they’re pretty good!). This is a smaller number than usual (drill back into prior years here). The small number was probably mostly because single units of ground meat from FreshDirect tend to weigh just a pound, whereas single units from the grocery in prior years were more like 1.25 to 1.4 pounds. (Although, come to think of it, just where did we get the ground meat last year? Not sure.) And also because grownups tend to put more meat in each dumpling, perhaps. But in any case, we are now all pleasantly full, and the little daughter and her BF are safely back in the urbanity.

What has occurred? I feel like things have occurred, to an extent. I am more on Mastodon now than on Twitter, and if you want to keep up with the images I’ve been making in Midjourney and so on, you’ll want my Pixelfed feed. I listed lots of various of these pointers back the other week (and wow having every chapter of the novel as a weblog post makes it hard to scroll through the weblog). When Elon “facepalm” Musk briefly prohibited linking from Twitter to Mastodon, I actually set up a LinkTree page with my links.

Someone must have said “they can still link to Mastodon via Linktree” in his hearing, because he then briefly prohibited linking to LinkTree. That caused me to set up my own Links page over on the neglected (and in fact apparently pretty much empty) theogeny.com; I should put back all the stuff that used to be there sometime!

Note how ossum that Links page is! When you move the cursor over it, the thing that the mouse is over that you will go to if you click (if any) changes color (although I drew the line at having it bouncily change size the way Linktree does). You can look at the page source, and see the lovely hand-coded CSS and HTML. :) It even validates! (w3c seems to have a change of mind about validation badges, which makes me a little sad, so there’s no little “valid HTML 5!” badge on the page that links to the verification of the claim, but hey.)

That reminded me of the One-Dimensional Cellular Automaton that I make in hand-coded CSS and HTML and JavaScript the other year; it vanished for a long time, even from my personal backups of davidchess.com, and I’d almost given up on finding it until I thought of the Internet Archive‘s Wayback Machine, and discovered that it had snapshotted that page exactly once, in February of 2012.

So after a bit of fiddling around, I can once again present the One-Dimensional Cellular Automaton for your amusement. The page source there is also quite readable, I tell myself.

Note that many other things on davidchess.com are currently / still broken, although in the process of bringing that page back, I also brought the main page back, so you can see the extremely retro rest of the site (working and otherwise), including the entries in this (well, sort of “this”) weblog between 1999 and 2011.

Oh yeah, we had Christmas! That was nice. :) I got lots of chocolate, and the little (not little anymore) boy gave me a digital image of Spennix (my WoW main) dressed like the pioneer in the Satisfactory game, with a perfect “Spennixfactory” logo. And wife and daughter both got me books: “The Hotel Bosphorus” (a murder mystery set in Istanbul, my current Bucket List destination, and involving a bookshop, so what could be better?) from M, and “Klara and the Sun” (which I’ve been meaning to get, but never had) from the little daughter. (She thought that maybe I already had it and that’s why Klara is called “Klara” in the Klara stories, but it was as far as I know a complete coincidence.)

I’m working away at Part Three of Klara, after she leaves the clockwork world, but it’s slow going. I have an actual plot in mind that I want to illustrate, and I’m using a different graphical style which necessitates a different Midjourney workflow that I haven’t quite optimized yet. But it’ll get done! Probably! :)

We close with a Seasonal Image for the Solstice…

A disc with abstract shapes of fir trees, decorations, planets, and whatnot around the edge. In the center a round shape with small spiked protrusions, perhaps the sun, sits atop what may be a tree trunk that projects upward from what may be the ground and some roots at the bottom of the image. Branches stick out of the perhaps-sun, and some stars and planets and a few more enigmatic shapes inhabit the spaces between the branches.

Here’s to the coming of the longer days! Or the cooler ones, to those on the flipside… :)

2022/12/13

Hemingway, by Midjourney

I now have like 190 images in the folder that Windows uses to pick desktop backgrounds from; building on the twenty-odd that I posted here the other day. They are fun! But I’m not going to post any more right now; right now, I’m going to post some images comparing the various Midjourney engines (which they have generously kept all of online). I’m going to use the prompt “Hemingway’s Paris in the rain”, because why not! We can do other prompts some other time.

For most of these (all but “test” and “testp” I think), it produced four images, and I chose one to make bigger. Otherwise (except as noted) these are all just one-shots on that prompt. I’m going to paste them in more or less full-size, and let WordPress do what it will. Click on an image might or might not bring up a larger version or something who knows.

Here is the quite historical v1:

A rather vague but definitely rainy image of Hemingway's Paris in the rain. There is a tall black tower to the left that may be inspired by the Eiffel Tower, but resembles it only vaguely.

Here, similarly, is v2:

Another vague and rainy, perhaps slightly less streaky, image of Hemingway's Paris in the rain. A possible bit of Eiffel Tower inspired tower shows over the buildings to the right.

I rather like both of these; they are impressionistic, which I like, and I suspect it’s mostly because that’s the best they can do in rendering things.

Here is “hd”, which may be the same thing as v1 or v2 I’m not sure; this particular image is more strongly monochrome and sort of vintage-looking photo-wise:

A somewhat blurry and rainy of an old city square with some people in it, some with umbrellas. Could be Hemingway's Paris; no towers evident.

Now v3, which is pretty much when I started using Midjourney; it’s interesting how impressionistic this is, given that we know v3 can also do rather more realistic stuff (all of this, for instance, was v3):

A rather impressionistic drawing, perhaps in charcoal, with a somewhat Eiffelish tower to the left. Definitely rain, likely Paris.

Between v3 and v4, we had this engine, lyrically named “test” (I used the additional “creative” flag, because why wouldn’t one?); one is getting a bit more photographic here:

A slightly less vague still image of Paris in the rain, black and white, umbrellas, and so on.

and here is the “testp” variant of “test”; the “p” is supposed to be for “photographic”; I used the “creative” flag here also. It’s not notably more photographic than “test” in this case; maybe it’s the rain:

Another rainy city street, monochrome, a few cars, shiny impressionistic pavement, townhouses.

Now brace yourself a bit :) because here is the first version of v4 (technically in terms of the current switches it’s “v 4” and “style 4a”):

A soft-edge realistic painting of a Paris street in the rain, in muted but glowing colors. A few people walking in the distance are vague but convincing shapes. The Eiffel Tower is visible in the distance.

Yeah, that’s quite a difference. We have colors, we have lanterns casting light, we have very definite chairs and awnings and things. But now, the current v4 (“style 4b” which is I think currently the v4 default):

A rather realistic painting of vintage Paris in the rain; a couple of old-style cards on the street, their headlights and the lights of the shops reflecting in the wet pavement. Shopfronts and awnings, people in identifiable clothing. There are words on a couple of the shopfronts, but they are unintelligible: something like PHASESILN for instance.

Yeah, that’s gotten rather realistic, hasn’t it? It’s even trying to spell out the signs on shopfronts, even if it hasn’t really mastered language. But those cars are extremely car-like and detailed compared to anything earlier.

Can this currently-fanciest engine give us something a bit more like the atmosphere of the older ones, if we want that? Basically yes, if we ask for it. Here is the latest v4 again, with “impressionistic” added to the prompt:

Yet another wet rainy city street scene, again in full convincing muted color, but more impressionistic than the last. Again we have people (and hats) and umbrellas and shopfronts, but no attempt at individual letters on signs.

I rather like that! And “monochrome” would make it monochrome, and so on.

It’s perhaps interesting that the more recent engines were less insistent that pictures of Paris include the Eiffel Tower. Possibly just the random number generator, given how tiny our sample is here, but possibly significant in some way.

So there we are, nine probably rather enormous pictures of Hemingway’s Paris in the rain, as conceived by various stages of development of the Midjourney AI, and with only very minimal human fiddling around (picking the prompt and the one to feature from each set of four, having the idea to compare the versions in the first place, and like that) by me.

Comments welcome as always, or just enjoy the bits. :)

2022/11/30

Free Desktop Wallpapers!

Haha, what a great title.

But yes, in fact I’ve been using good ol’ Midjourney to make some wallpapers, and figured out how to get Windows to permute among them as desktop backgrounds on this brand-new Framework laptop I have (I should write a long boring geeky entry about my old Windows laptop breaking and my replacing it with this lovely new thing whose only disadvantage is that I’m still running Windows on it ewww), and I thought I would share them here as the first of the promised (or threatened) posts with tons of images made with Midjourney.

I think I will just do it as a big WordPress Gallery thing? Which means WordPress will I dunno display them in some random layout, but I hope you can still get the actual images at full size by clicking through and rightclick-saving? Or whatever?

2022/11/26

Woot woot!

Graphs from NaNoWriMo, showing a steadish 2,000 words per day from the 1st to the 25th of November.

Kept the ol’ 2,000 words per day pretty constant during NaNoWriMo, except for a couple of days off that I made up for on the next weekend, so I made the goal of 50,000, and not by coincidence the end of the story, right there on the 25th (which was, let’s see, yeah, yesterday!). A nice feeling.

I think I like this year’s rather a lot. The little Midjourney pictures at the start of each Fling (where “Flings” really turned out to be Chapters) was fun, but I think not ultimately transformative; not a big deal. A few plot elements, some important, (the libraries, the plants, the fast sharp ships) came from the images, but without the images something else would I expect have sprung to mind and perhaps carried the same basic ideas, about meaning, and communication, identity and the symbol-grounding problem.

As a reminder; the whole thing can be read in order by clicking on the cover page here, and then clicking the bold link at the bottom of each Fling. I may be going through and fixing a few errors between now and the end of the month (although the relative inconvenience of doing that in WordPress may limit how much I do).

In other news, I’ve been on Twitter less, and on Mastodon / Fediverse more, prompted by the gross antics of the billionaire narcissist, but continuing just because it’s a more interesting place, with (so far?) more interesting and less upsetting communication going on. (It could be argued that given the State of Things, one ought to be upset; but so far I think the argument is flawed.)

I’ve been making tons and tons of images on Midjourney still (getting up near 20,000, the system tells me!) and they are still constantly improving the engine(s), which is very cool. I’ve been posting some of them on PixelFed (roughly, PixelFed is to Instagram as Mastodon is to Twitter), and also still on Twitter (the same ones, mostly). I have enough pictures that I love to fill many, many weblog posts, and I’m sure such posts will appear.

Here’s just one image for now that’s a total favorite; it’s called “Accord”:

A woman with a very long neck in foreground just left of center, looking to our right. Her hair extends fractally into infinity upper left. An infinite line of smaller women in dark clothes, all looking in the same direction, extends from her shoulder to the right, where a tower is dimly present through fog and insects. Two more of the smaller women stand behind her, eyes closed.

Is that amazing, or what? He said modestly.

In the legal domain, there is talk of a class-action suit against Microsoft / GitHub / OpenAI / Copilot, on something like the claim that training an AI on a piece of code requires the appropriate license from the owner of that code (or equivalent, as for public domain code or code you wrote yourself). The possibility of implications for AI art tools like Midjourney, and AI text generators like NovelAI, is clear, although there may also be significant differences. For instance, there seem to be various examples of exact plagiarism by Copilot, whereas as far as I’m aware no such thing exists for say Midjourney or NovelAI.

(There was at least one person persistently spamming Twitter and Reddit with a copy-pasted claim that GPT-3 plagiarizes, pointing at various things on the web that did not actually show, or generally even claim, that. I can’t find them today; perhaps OpenAI’s lawyers sent them a letter. Similarly I’ve been told by one person on Twitter (and at least one other who agreed with them) that for “[a]lmost all pieces I’ve seen thus far, I can point at and name the elements that came from individual artists, and often individual paintings or works”, but when I expressed interest and asked for a concrete example, they said roughly “I’ll get back to you tonight” and then went silent.)

It will be interesting to see what happens with this lawsuit. Somewhat sadly, I think that:

  • The most likely outcome is that they’ll just lose, because Microsoft is rich and individual Open Source contributors, even as a class, aren’t rich,
  • Second most likely, Microsoft will give some symbolic amount of money to something that will benefit some Open Source contributors a little and some lawyers a lot, and there will be no precedent-setting court decision,
  • Less likely, after some long wrangling process, something like the Private Copying Levy might be worked out, which is sort of like that last bullet, but more codified and involving more money, and possibly a precedent that there is a copyright violation at least potentially involved,
  • Even less likely, there would be some kind of opt-out process whereby a creator could indicate they didn’t want their stuff used to train AIs, and makers of AI engines would have to like re-generate their neural nets annually without the opted-art works,
  • And at the bottom, perhaps fairest in some sense but also least likely, a straightforward finding that AI Engine makers, at least ones that make money, really do need the right to copy and/or prepare derivative works of the things they train their engines on. So we’d get engines trained on just public domain works, things out of copyright, things posted under sufficiently permissive licenses, things they explicitly license, and so on. I would be fine with this, myself, but I wouldn’t bet on it happening.

We’ll see!

What else? That’s the main things I can think of. Oh, yeah, Thanksgiving was very nice; the four of us and the little daughter’s SO. We were (I was) especially lazy this year; beyond the HelloFresh pre-planned ingredients that we’ve used the last couple of years, this year we got the pre-planned pre-cooked just-needs-warming version from FreshDirect (ETOOMUCHFRESH). It wasn’t bad! And certainly easy. :) We also bought pre-made apple and pumpkin pies. I resist feeling guilty!!

Also my Windows laptop is broken (I’m not sure why or how; it behaves like a bad storage device, but both the HDD and the SSD seem perfectly readable when stuck into external USB things). Whatever’s wrong with it inside, it’s also vaguely falling apart, with cracked and broken keys, a non-functional direct Ethernet connection (on all connectors somehow), and some other stuff.

So I have an exciting new Framework laptop coming as an early Solstice present! (It’s supposedly in Alaska right now, on the way here in under a week or so.) Inspired, like so many other people, by Cory Doctorow’s glowing review. We’ll see if I am frustrated by the Intel graphics chipset. I’m pretty optimistic, as what I want to run isn’t like the latest AAA game; more like WoW and SecondLife and the GIMP and No Man’s Sky and Satisfactory. I might have to turn the resolution down some at worst I expect.

(In the meantime I’ve been using my phone and this tiny cheap Samsung Chromebook and just not using any of those programs; turns out my life doesn’t depend on any of them! The thing I’m most eager to do is get the GIMP going to work on Part 3 of Klara; in theory I could enable Linux on the Chromebook here and run the GIMP in that, but I rather doubt its CPU is up to it. Just typing this into the WordPress editor is lagging significantly just because I’m also watching YouTube and have a few dozen Chrome tabs open including like Discord and Mastodon and…)

There! :) Thanks for coming, and enjoy.

2022/11/05

A Saturday Morning in November

Midjourney V4 (well, an “alpha” version thereof) is out! As if I didn’t already have enough to play with.

That house, floating above the sea with some balloons and things, is typical of the results of my old favorite “detailed surrealism” prompt. And this:

is from the prompt “neutral prompt”. We can tentatively conclude that it likes cute fantasy houses. :)

Here is a v4 (alpha) Yeni Cavan scene:

which is pretty cool.

In other news, I’m over 8500 words into NaNoWriMo 2022 as of yesterday (I haven’t written anything yet today). I’ve also make a cover page for the book, which links to the first Fling, and each Fling links to the next, so you can start at the cover, and go through the whole thing in the right order by just clicking obvious things. This may partially atone for posting it as a bunch of weblog entries in the first place. :)

I made the cover image in (obviously) Midjourney, and then fiddled a little and put on some titles (and my Government Name!) in the GIMP. I note that the skills of professional cover designers are subtle and profound; the titles on my cover are obviously in the wrong place, a professional designer would put them in places that were so obviously in the right place that one wouldn’t even notice, and I have no idea what makes the difference.

Okay! Now I am off to make the header image for Fling Seven, and start writing. I think it will be more of Alissa’s story.

2022/10/31

Weirdness from the Copyright Office

A quickish update. I have said, and still believe, that things created using AI tools are just like anything else with respect to copyright. But recent events remind me that the Copyright Office is made up of people, and people are unpredictable, and US Copyright law is in many places a squashy mess made up of smaller squashy messes, so logic does not always apply.

Here is a currently-relevant set of data points:

  • I have registered the copyright on an image I made using MidJourney. I didn’t mention that I used MidJourney (or Chrome, or Windows) on the application form, because there was no place to put that; the form didn’t ask. The application for registration was granted routinely, without any complication.
    • I imagine there are hundreds / thousands of similar registrations from other people.
  • This person has registered the copyright on a work that they made using MidJourney (I think it was), and the work itself makes it clear that MidJourney was used. The application was afaik granted routinely, without any complication.
    • But now it appears that the copyright office has said “oh wait we didn’t notice that MidJourney thing, so we’re cancelling your registration”.
    • And the person is appealing, apparently with the help of MidJourney themselves. (Hm, they’ve also apparently deleted some of their tweets on the subject; lawyer’s advice perhaps.)
  • This person has applied apparently to register various images made with various workflows involving AI (dalle2 I think) to various extents, clearly stated, and rather than being just accepted or just rejected they’ve received emails from the copyright office asking them for details of what they did, and especially bizarrely suggesting that perhaps at least one of the works might have been “conceived” by the AI.
    • Which seems crazy, because the Copyright Office has generally had the opinion that software isn’t creative, and can’t (like) conceive things.

I suspect that things are just rather in disarray at the Copyright Office, and different examiners are doing different things, perhaps having gotten different memos on the subject, or just having their own different opinions about things. It will be interesting to see how the appeal mentioned above goes!

To me, it seems obvious that things created with AI tools should be prima facie registerable with the copyright office, just like photographs presumably are, and if someone wants to challenge based on some legal theory about either lack of creativity or derivative works or whatever, they can do that. The copyright office itself, I would think, would want to stay far away from any situation where they have to somehow evaluate themselves how many units of creativity are in each of the kazillions of applications they get daily.

On the other hand, the Copyright Office could simply issue some sort of guidance saying “We won’t register copyrights on works created with the significant use of an AI tool like dalle or MidJourney, so don’t bother asking” (and could even update the forms to have a question about it).

I think that would be dumb, and lead to court cases eventually that would either overturn that or at least cause a great deal of faffing about that they could have avoided.

But then people and government offices do dumb stuff all the time, so who knows! All is in flux…

And here is an image that I made using Midjourney. No matter what the Copyright Office thinks today. :)

2022/10/25

Figure Three

Another “fun corners of the AI’s network” post. These are all pretty much unfiltered and unretried and unmodified results with the prompt “figure three” with the current “test” or “testp” engine (v4 said to be coming soon!) on MidJourney. I have no comment except that I find them all wonderful. :)

(There are, typically, various women’s faces, and perhaps the word “figure” got us more sort-of-bodies than we would have gotten otherwise?)

2022/10/24

Klara, Part Two

Have you noticed, that sometimes one person is much more productive than another? :)

Due to my skilled collaborator on the first Klara video being one of those much more productive (than me) people, there is now a Part Two of Klara’s story, and that Part Two exists in the form of another amazing video on the You Tube!

detailed surrealism

Here is Karima’s post on the subject, and here is a direct pointer to the video itself (don’t forget to Like and Subscribe!). Images by me using MidJourney and the GIMP, words by me, voicing and everything else by Karima.

Given my comparatively relaxed productivity :) I may or may not put the largish (or even an edited smallish) pdf of Part Two up somewhere. Perhaps arranged with the one for Part One, in some organized way!

This is the end of Klara’s story for now, but one never knows; she may appear again, for Further Adventures, on other days. :)

I am still creating hundreds and hundreds of images; over fourteen thousand all together, MidJourney tells me. And NightCafe says I’ve done another “4.5K+” there. A handful in dalle2. Lots and lots in NovelAI because it is so fast, but it also doesn’t retain them or give any kind of count, so I don’t know! But let’s say around twenty thousand altogether. Rather a lot!

November is approaching, and I have no real idea what I might do NaNoWriMo-wise. Will I use Klara’s story in some way? Will I use MidJourney images? NovelAI words? Or just type a lot? Stay tuned! :) And enjoy these lovely videos in the meantime…

2022/10/15

Klara by Dale Innis & Karima Hoisan

Well, this is just too much fun. :) Very good Second Life friend and collaborator liked the little Klara piece so much that she voiced it and set it to the perfect music and made it into a rather wonderful YouTube! Definitely more accessible :) and more of an experience this way than the 327MB pdf file. Wooot!

Digital Rabbit Hole

Very excited to share with you all, this off-beat, pretty long (almost 10 minutes) surreal video collaboration with Dale Innis
Those of you who read me regularly, know that Dale Innis is a scripter friend who has collaborated with me and also with Natascha & I for the last 10 years and lately has been dabbling in all sorts of AI Art, especially MidJourney, which is a veritable game-changer in this blossoming field.
He showed me a pdf file of slides and a story-line, that he had made and I fell in love…fell obsessed, is a better word, to try to bring this to a way more people could see it.
This is how the project was born. I found, what we both agree, is the perfect music   Meditative Music and I made a voice-over and edited the slides into what you’ll see below.
This is a very slow-…

View original post 45 more words

2022/10/12

Klara’s Story (Part One)

So after I did “Ice Dreams” (50M pdf), as casually announced here, I did another graphic novel (to the extent that that phrase fits at all), or the first part of one, in a very different style and by a very different process.

For “Klara’s Story” (working title), I generated two-by-two grids of Midjourney images using the prompt “detailed surrealism” (a favorite of mine) and some variants thereof, and crafted some sort of story around the images (rather than using the AI to create images for a more-or-less known story).

I haven”t yet had the patience to pare it down at all, so here is the current like 327M pdf draft.

The huge size does make it a bit awkward and slow to deal with, but… there it is!

2022/10/09

More Visions of Yeni Cavan

I first found Yeni Cavan as a story and art venue, based on a bunch of words used as prompts in the pre-Stable Diffusion NightCafe, way back in February. Since then I’ve tried to find it in various other engines and things, casually and without much luck. But after playing with the engine flows and prompts and things some, here are some images from MidJourney that I rather like; sufficiently Yeni Cavanish, I’d say, although so far I miss the little random patches of bright purple neon and such. (Maybe I’ll try some of the other venues as well eventually.)

Yeni Cavan; interior room (image started in the –hd engine)
Yeni Cavan; room interior (love the comfy couch with the … circuit board? sitting on it)
Yeni Cavan; room interior (I’d like to be there yes)
Yeni Cavan; room interior (pure v3 I think)
Yeni Cavan; room interior (pure –hd I think; intricate!)
Yeni Cavan; detailed surrealism (whee!)
Yeni Cavan; adorable surreal bots
Yeni Cavan; more detailed surrealism!
Yeni Cavan; upstanding citizen
Yeni Cavan; City Waterfront
2022/10/01

AI Art and Copyright some more

I am losing track of the number of AI-based image-creation tools I have access to now. It’s not that huge a number, but it’s complicated! :) There’s at least:

  • good old ArtBreeder, which I haven’t used in ages, and which seems to have a potentially interesting new mode where you sketch a thing with a few shapes, and then type text telling the AI what to make it into,
  • MidJourney with the old V3 engine and the newer and lyrically named ‘test’ and ‘testp’ engines and mixmashes of those,
  • NightCafe, which was my main goto image tool quite some weeks, with the old Artistic and Coherent engines, but now also the new Stable Diffusion (SD) based “Stable” engine, and various workflows among those,
  • NovelAI which now does images as well as text; the images are also in a Discord bot, and it’s really fast; it uses some heuristic smut-blurrer (maybe just the standard SD one?) but the devs sort of promise they will eventually move it off of discord and then have few or no restrictions (similarly to their text generator),
  • and now I discover that I have access to Dall-E also, from OpenAI, which I have just barely begun to use (detailed surrealism).

The “you can’t copyright art made with AIs” meme seems to have withered (which is good since it’s not true, although nothing is certain), but my experiment to gather additional evidence against it has finally borne fruit (months before I expected it to, really): I have now registered my copyright in this masterpiece of mine:

A blonde porcelain doll and a worn teddy bear sit on a trunk, in a musty attic in light from the window

with the real actual US Copyright Office, who have sent me a real actual certificate testifying to it. The registration can also be found on the web (you have to go to that page and then search on Registration Number for “VA0002317843”; I have yet to find a permalink that persists, bizarrely).

I did it through LegalZoom rather than myself; it cost more (I think), but I was more confident that I was Doing It Right during the process. There were no questions about whether AI was involved, or about what software I used to create it, or anything like that. I did have to say that I’m the creator, of course, but since I am :) I don’t see a problem there.

Registering the copyright doesn’t mean it’s 100% correct, it just creates a legal presumption. Someone could still challenge it, arguing that I wasn’t really the creator at all. I think that would be very unlikely to succeed.

And in any case, here is a nice concrete counterexample to any remaining “you can’t copyright art produced with an AI” claims that might be floating around.

The image is, by the way, provided under the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license, so feel free to do anything allowed by that license. :) Knock yourself out! Figuratively!

Extremely generous friend Karima also continues updating the virtual world region “AI Dreams in Art” with things she likes from my Twitter feed, etc, so drop by! It is getting blushingly positive reviews on the Social Medias; apparently there are significant numbers of people who have heard a lot about this AI Art stuff, but never really seen any. They seem to like mine! :)

2022/09/10

A Photograph #MidJourney

As we’ve discussed, one of my favorite things is to give a text- or image-generating AI a vague and/or ambiguous prompt, and just see what happens. The results are sometimes kind of horrifying, but here I’m going to post a bunch of results that aren’t especially horrifying, and that are sometimes lovely.

The prompt for all of these is basically just “a photograph”. And what I really want to do (and I am realizing that there are various services out there that would let me do it without much fuss) is make a nice coffee-table book of these, accompanied by text produced by like NovelAI. Just because it would be neat.

What a world, eh?

2022/09/07

One Inside of Another #MidJourney

I continue having way too muchy fun making images with MidJourney (and NightCafe, and now some things that I can’t quite show off yet).

I’m realizing that I’m a little weird, in that most people seem to be interested in just how exactly they can get the tool to produce an image that they’re thinking of, whereas I am almost entirely into typing somewhat random ambiguous stuff, and seeing what fun things the AI responds with.

For instance, here’s a snapshot of a whole bunch of images made using the prompt “one inside of another” with various seeds and switches and engine flows and things:

two dozen rather varied and ominous images

I love all of these (there were some I didn’t love, and I didn’t upscale those, so they aren’t here). The first two got me:

It seemed like there was really something going on there.

These two are with a slightly different engine flow than most of the others, but are no less wonderful:

What’s going on here? Is the AI showing wild creativity? Is it just starting in a basically random place due to the vague prompt, and then drifting into some weird random local minimum from there? Is that different from showing wild creativity?

Clearly there are lots of pictures of faces (especially women’s faces) and rooms with windows in the training set, so we get lots of those, that makes sense. But why do we get two different images that are (inter alia) the face of a person holding (and/or confronting) some semi-abstract sharp object? Why are there two faces which are split in half vertically, and one half striped / pixelated?

And what are these?

One thing is certainly inside of another. Is that a coincidence? Or is the AI “aware” of it in some sense?

I feel like I could swim in this stuff forever! That is what I thought at first about the GPT-3 stuff, though, and that wasn’t true. :) Still, if it’s just that I’m still in the initial flush of excitement, it’s a very fun flush.

Oh, and somewhat relatedly, here is a stealth announcement of a new graphic novel (or perhaps picture book) based on MidJourney images. This time I generated many many images from the same (small set of related) prompts, four at a time, and then tried to construct a story that would make sense with them. Note that this version is like 327MB for some reason, so click with care: Klara, Part 1.

2022/08/28

Sunday in August

Brilliant title, eh? :) Various things have occurred! I will mention some of them.

There is now a full draft of my graphic novel(ette); it’s 40 pages, and about 50MB, so don’t expect your phone to pop it up very quickly. And also don’t expect it to be that good :) as I’ve never really written in this medium before, and it’s tough. In the most recent draft I removed considerable exposition which felt out of place, replacing it with images and short statements. Now I’m afraid the result is basically incomprehensible :) at least to anyone not already familiar with the SF tropes I’m touching on.

It was really fun to do, though! As I’ve mentioned, all of the art was done using MidJourney, and the compositing and most of the text was done in the GIMP. I got a few nice pieces of display text from cooltext.com; if I’d thought about it a little harder, I would have used something more interesting than Sans-Serif Bold (and sometimes Italic) from the GIMP font collection. (There’s a little Verdana, just on the copyright page at the end I think.)

This was the most fun when I was putting together the images that I’d already created that inspired the story in the first place. It was more frustrating when I needed a particular image and was trying to create it in MidJourney; it is sometimes a challenge to generate a specific thing! The water droplet at the very end, for instance, came after many, many attempts to make a crystal / water sphere that wasn’t sitting on a surface of some kind.

Other things! In order to get even more meta than this, we entered “Here is a short description of an image that has some interesting visual elements:” into NovelAI. It responded:

A man wearing a white t-shirt and blue jeans sits in his chair, staring at the television. His eyes are closed as he watches a show about two women discussing the weather. The screen reads ‘NBC News’ with a picture of a woman.

and I typed that into MidJourney, and got this:

Four rather fuzzy pictures containing a person and some TV screens

So that’s in some sense entirely AI-generated, using a human-designed procedure. It’s also really boring!

Let’s try again; this time NovelAI says:

A woman in a white dress, standing on a rocky beach. The ocean is behind her and the setting sun makes for a bright glare to one side of her face as she looks out into the water.

and MidJourney responds with (after a considerable delay because I am in relax mode, which is basically “nice -n 19”:

Four nice slightly impressionistic images of a woman standing on rocks by the water with the sun low.

which is quite nice (although again not exactly what the prompt says).

So there you are: the first two (or eight) images produced by a particular meta-algorithm using Modern AI Technology! :)

Other things are to a great extent prevented from occurring by the fact that it is Very Humid outside, and there are Pandemics and so on still. I went out to get bagels this morning, and I was like “yow, what is this very large humid windy room here?”. There’s a chance I’ll get into Manhattan next week; that will be quite a shock!

I have not been playing Video Games to speak of, because all of these AI stuff has been more interesting. There is all sorts of stuff to say about legal issues (Yes, content generated using an AI can be copyrighted by the human creator!) and societal issues (impact of AI on artists and art perhaps similar to impacts of photography on same?) and all like that there. But it is more fun to make cool pictures!

So in closing here is the one I used on the copyright page of the Graphic Novel(ette). Be well!

A surreal image of maybe a sheep standing in shallow water looking at maybe like a blimp made of sticks or something.
2022/08/22

So many AIs and images and stuff!

I was thinking of a post extending the legal thoughts from last time to talk about this widespread claim (based on the Thaler decisions that I mentioned briefly there) that “Artwork made with an AI can’t be copyrighted”. It’s all over the clickbait-website press, and it’s wrong. The rulings in question said that an AI can’t be the creator-in-fact of a work (in the U.S.) so someone can’t get copyright to a work based on being the “employer” of the creator-in-fact AI. But they say nothing about the obvious alternative that a human can be the creator (simpliciter) of a work make with an AI, just as a human can be the creator of a work made with Photoshop, or a paintbrush.

Heh heh, I guess I’ve already written a bit about that here now, haven’t I? But there are various arguments and counterarguments that one could talk about that I’m not going to.

Then there’s the fact that I’ve been generating So Many Images in Midjourney, which for a while there had pretty much entirely drawn me away from NightCafe. As well as those So Many Images, I’ve started to put a bunch of them together in the GIMP in the form of a sort of amateur manga or graphic story that attempts to have an actual plot and stuff; here’s a pdf of the story (the first 10 pages of it, which is all that currently exists), at considerably reduced resolution so it isn’t like over 30MB. Feedback welcome. :)

But then! By which I mean just today I think! NightCafe has become very interesting again, due to adding the Stable Diffusion engine. Which I have been using extensively, and have noted that:

  • It is kind of boring compared to the other engines I’ve used, in that it seems to usually take the simplest and most quotidian interpretation of a prompt, and create the most unremarkable (and, admittedly, sometimes impressively realistic!) image possible from it.
  • The right set of adjectives and so on can get more interesting results from it sometimes. The prompt prefix for Yeni Cavan, for instance, produces recognizably Yeni Cavan images, but somewhat less smoky and mysterious ones than Midjourney or the NightCafe Artistic engine do.
  • It has some kind of risible post-censorship blurring algorithm, and if a picture looks too naughty to that algorithm, it comes out with a very heavy blur applied. I have (accidentally) gotten one NFSW image that its filter didn’t detect, and on the other hand just including “in the style of Gauguin” in a prompt seems to pretty reliably produce just a blur. (“Well, yeah, he’s in the training set, but his stuff is really too naughty to output.”) I mean, /facepalm and all.
  • Update: when I reported a couple of very obvious porn-filter false positives, NightCafe support replied that the filter should be gone / optional in “a few days”. Very gratifyin’!
  • I wish NightCafe had an “effectively free, but might be slow” generation mode like Midjourney does. I’m running out of NightCafe credits after playing with Stable Diffusion for hours, and I’m near out of credits, and given the overall experience I will probably just to back to Midjourney now and make more images for the comic. :)

So that’s those things! But mostly it’s been lots of cool pictures. We will close with a recent one from Midjourney:

Atomic surrealism detailed render

and something that Stable Diffusion did (rather interestingly) with the same prompt:

Atomic surrealism detailed render

Stay surreal! :D

2022/08/14

Is it plagiarism? Is it copyright infringement?

So I’ve been producing so many images in Midjourney. I’ve been posting the best ones (or at least the ones I decide to post) in the Twitters; you can see basically all of them there (apologies if that link’s annoying to use for non-Twitterers). And an amazing friend has volunteered to curate a display of some of them in the virtual worlds (woot!), which is inexpressibly awesome.

Lots of people use “in the style of” or even “by” with an artist’s name in their Midjourney prompts. I’ve done it occasionally, mostly with Moebius because his style is so cool and recognizable. It did imho an amazing job with this “Big Sale at the Mall, by Moebius”:

“Big Sale at the Mall, by Moebius” by Midjourney

It captures the coloration and flatness characteristic of the artist, and also the feeling of isolation in huge impersonal spaces that his stuff often features. Luck? Coolness?

While this doesn’t particularly bother me for artists who are no longer living (although perhaps it should), it seems questionable for artists who are still living and producing, and perhaps whose works have been used without their permission and without compensation in training the AI. There was this interesting exchange on Twitter, for instance:

The Midjourney folks replied (as you can I hope see in the thread) that they didn’t think any of this particular artist’s works were in the training set, and that experimentally adding their name to a prompt didn’t seem to do anything to speak of; but what if it had? Does an artist have the right to say that their works which have been publicly posted, but are still under copyright of one kind or another, cannot be used to train AIs? Does this differ between jurisdictions? Where they do have such a right, do they have any means of monitoring or enforcing it?

Here’s another thread, about a new image-generating AI (it’s called “Stable Diffusion” or “Stability AI”, and you can look it up yourself; it’s in closed beta apparently and the cherrypicked images sure do look amazing!) which seems to offer an explicit list of artists, many still living and working, that it can forge, um, I mean, create in the style of:

So what’s the law?

That’s a good question! I posted a few guesses on that thread (apologies again if Twitter links are annoying). In particular (as a bulleted list for some reason):

  • One could argue that every work produced by an AI like this, is a derivative work of every copyrighted image that it was trained on.
  • An obvious counterargument would be that we don’t say that every work produced by a human artist is a derivative work of every image they’ve studied.
  • A human artist of course has many other inputs (life experience),
  • But arguably so does the AI, if only in the form of the not-currently-copyrighted works that it was also trained on (as well as the word associations and so on in the text part of the AI, perhaps).
  • One could argue that training a neural network on a corpus that includes a given work constitutes making a copy of that work; I can imagine a horrible tangle of technically wince-inducing arguments that reflect the “loading a web page on your computer constitutes making a copy!” arguments from the early days of the web. Could get messy!
  • Perhaps relatedly, the courts have found that people possess creativity / “authorship” that AIs don’t, in at least one imho badly-brought case on the subject: here. (I say “badly-brought” just because my impression is that the case was phrased as “this work is entirely computer generated and I want to copyright it as such”, rather than just “here is a work that I, a human, made with the help of a computer, and I want to assert / register my copyright”, which really wouldn’t even have required a lawsuit imho; but there may be more going on here than that.)
  • The simplest thing for a court to decide would be that an AI-produced work should be evaluated for violating copyright (as a derivative work) in the same way a human-produced work is: an expert looks at it, and decides whether it’s just too obviously close a knock-off.
  • A similar finding would be that an AI-produced work is judged that way, but under the assumption that AI-produced work cannot be “transformative” in the sense of adding or changing meaning or insights or expression or like that, because computers aren’t creative enough to do that. So it would be the same standard, but with one of the usual arguments for transformativity ruled out in advance for AI-produced works. I can easily see the courts finding that way, as it lets them use an existing (if still somewhat vague) standard, but without granting that computer programs can have creativity.
  • Would there be something illegal about a product whose sole or primary or a major purpose was to produce copyright-infringing derivative works? The DMCA might possibly have something to say about that, but as it’s mostly about bypassing protections (and there really aren’t any involved here), it’s more likely that rules for I dunno photocopiers or something would apply.

So whew! Having read some of the posts by working artists and illustrators bothered that their and their colleagues’ works are being used for profit in a way that might actively harm them (and having defended that side of the argument against one rather rude and rabid “it’s stupid to be concerned” person on the Twitter), I’m now feeling some more concrete qualms about the specific ability of these things to mimic current artists (and maybe non-current artists whose estates are still active).

It should be very interesting to watch the legal landscape develop in this area, especially given how glacially slowly it moves compared to the technology. I hope the result doesn’t let Big AI run entirely roughshod over the rights of individual creators; that would be bad for everyone.

But I’m still rather addicted to using the technology to make strange surreal stuff all over th’ place. :)

2022/08/01

Midjourney haunted by mysterious woman

I’ve apparently made over 500 images in Midjourney now, and posted quite many of them to the Twitter. It’s great fun, just like when I first found AIDungeon, NightCafe, etc. It remains to be seen how long this will last, and if I slack off after a while with the feeling that I’ve sort of scouted all the interesting parts of the conceptual space. That has certainly not happened yet. :)

In the Good Old Days of AIDungeon, there was a running joke on the subreddit about how characters named Count Grey and Karth and Kyros would constantly appear unbidden; eventually people (including me, hem hem) discovered the various stories in the specialization-set that contained those names, and eventually after that the devs added an optional filter to avoid outputs containing any of those names.

In recent days, I have discovered sort of the same kind of thing in Midjourney! There is a particular woman’s face that appears, well, much more often than any other face seems to (at least with the prompts I give, which may or may not be significant, see below) and sometimes in cases such as random-gibberish prompts where one wouldn’t especially expect a face at all.

The rest of this post will be just various pictures in which the mystery woman has appeared, from earliest to latest, with possibly-amusing observations and history in the captions.

Prompt: “The Sketches She Made Today” and that’s her upper-right and lower-left (#2 and #3)
I blew up #3 because it was such a striking face. Perhaps this is when she began slipping into my Midjourney images! Like a spirit!
Prompt: “The things we see when we close our eyes”; not necessarily her, but not obviously NOT her.
Prompt: “Le cœur a ses raisons que la raison ne connaît point”; that’s her, obviously, upper left
The prompt here was just “Realistic detailed portrait”. We notice a certain similarity!
Prompt: “tilsit commerath bex dunnig”; she appears as #3 even with a nonsense prompt!
Prompt: “Soft-focus steampunk portrait”
Prompt: “All of her faces are angry now”. Lower-right.
Prompt: “professional portrait in the library”; Lower right again!
Prompt: “Award-winning sepiatone photograph of an enigmatic face in repose”; lower left, obvs
Prompt: “Faces in the trees; dark ominous haunted; detailed image; fantasy, night, dream, eyes”; yipes!
Prompt: “pictures of Lordes”; lower-right once again at least
prompt: “concept art film noir, songs of love”; the female lead is clearly HER
“concept art film noir, Night Life”
“People All Over The World Have Seen This Woman In Their Dreams”
yes, that was actually the prompt, and this was one of the 4-up.
Prompt: “tuppy wup kazami ghent-blum plornish” I mean, come ON!
Prompt: “Manga Action Sequence; thrilling and detailed” and now we have a Manga version of That Face

There are a few others that might arguably have been Her as well, but I tried to stick to the most compelling examples.

Clearly we ought to have a name for this woman! I attempted to get the AI to reveal it:

Prompt: “A woman wearing a name tag”

… but it did not fall for our simple strategem.

So the mystery continues! Who is this woman? Someone who occurs especially often or especially notably in some training set? Or a sort of local minimum / maximum in the network’s energy space, based on what a typical face in the set looks like? (You will have to take my word that other faces, male or blonde or with a different nose, are much rarer in Midjourney outputs.) Inquiring Minds Want To Know!