![Observations and output from a month of learning the DALL·E AI image generator.](/img/blog/dalle.png)
30 Days of DALL·E
Observations and output from a month of learning the DALL·E AI image generator.
DALL·E is arguably the most advanced AI image generator available. With a short description or prompt of an intended image, it can create four completely original works within seconds. From realistic photographs to abstract paintings and everything in between.
I was blown away when I saw the quality and detail of DALL·E. The implications of AI-imagery on creative industries instantly dawned on me. Illustrators, photographers, physical and digital artists, web and product designers, film and game producers, animators, marketers, etc.
Take a case study of commissioning a digital illustration for editorial use. Currently an agency would search through artists for the right aesthetics, write a brief, negotiate a price and timeline, wait for the first draft, provide feedback, wait for revisions, then pay the bill. This whole exercise, which once cost thousands in resources, can now be done in ten minutes for a dollar or two.
I decided to learn everything I could immediately about how the tech works and how I can adapt, utilise, and integrate it in my work. If you are in one of the industries above, I recommend doing the same.
So, every day in August, I created a new DALL·E image. I tried experimenting with a variety of subjects, themes, and styles, to get a broad understanding of capabilities and shortcomings.
Before starting, I thought of DALL·E as a useful commercial tool that could not replace real art and its exploration of the human condition and beyond (the divine, sublime, infinite, eternal). Now I am not so sure. Many pieces are unexpected, thought-provoking and awe-inspiring. Considering that DALL·E knows only what we have taught it, the outputs provide a deep reflection of us and the human experience through the lens of visual language.
A few more thoughts
Observations
The notes below only apply to my experience with DALL·E. There are other AI image generators with different strengths and weaknesses, including Stable Diffusion, Midjourney, Google Parti, Jasper Art, et. al.
Ideas/Vibes > Specific Visions. When I had a precise outcome in mind, I was usually disappointed. DALL·E needs a whole dartboard to aim at, not just the bullseye. Some overly specific prompts that I could not get right were: “one small violet flower in the middle of an expansive sandy desert” and “a church with paint oozing down the walls and becoming colourless”.
It struggles with faces. Attempted portrait photographs with human faces often have wild deformities. Horror movie shit. I have seen amazing counter-examples, though, so this could have been bad prompts from my end.
Scale is finnicky. At times I needed to be very specific about how large or small something should be. For example, when asking for "a mango tree beneath a vibrant rainbow" none of the pictures showed the complete tree – only a closeup of the branches.
It doesn't do words. . Whenever text is in the output image (i.e. for a cartoon strip or newspaper ad), it will usually be in some weird old-English latin jibberish language. I'm unsure whether this is intentional or not. I assume it is.
It struggles with things it hasn't seen before. . I threw a few curve balls; uncommon or unlikely things like "a bridge that connects two ponds, arching over land". It produced only what it was familiar with – ordinary bridges that connected land over water.
Other capabilities
I only experimented with DALL·E’s image generation via text prompts. Other capabilities out there now or on the horizon include:
- Sketch-to-image. Along with a text prompt, provide a rough sketch (think MS Paint quality) to give the engine a clearer starting point.
- Mending and manipulation (inpainting). Photoshop editing capabilities via text prompt. Touch up, replace, tweak any part of an existing image.
- Outpainting. Draw out from a seed image to change the context, environment, or level of detail, and create massive, expansive pieces.
- Character generation. Create human-like fakes or new fantasy characters for animated films or video games.
- GenArt + AI. Combine traditional generative art with AI-image engine to get crazy results.
- Photoshop and Figma plugins. Combine all of these features right inside one app. Here’s a sneak peek.
- Video. This prompt will soon work: “15 second video of three polar bears enjoying a Lipton Iced Tea in the sun”. There will be AI-generated films in cinemas. But we’re not there yet.
Ethics
There are ethical concerns around using artists’ names directly in prompts without permission. Particularly for living artists. Also, An AI-Generated Artwork Won First Place at a State Fair Fine Arts Competition.
For me, it makes sense to clearly delineate what is AI and what is human generated. Every AI-generated image should carry some immutable metadata saying exactly what it is – for transparency, equality, and to bury the bullshit before it flies.
Beyond images
DALL·E’s AI engine, GPT-3, is all about natural language tasks. It’s already very capable and can copywrite, summarise, classify, and translate. You can use this now to make marketing material for your company, summarise lengthy articles, or as a brainstorming friend/scratchpad.
OpenAI is also building a prompt-to-code package, meaning developers might be working themselves out of a job. Or at least outsourcing mundane code to AI.
AI-generated music is also a thing. Personal choice, but I’m not paying attention yet. Nothing has wowed me in that space so far.
It's now time to level-up your understanding, practice prompt-writing, and invent new ways to apply this technology.
Resources
- Lexica – search a database of already-written prompts (good you’re out of free credits elsewhere).
- The DALLE 2 Prompt Book
- List of AI art image synthesis tools
Gallery
![Action photograph of a cricket match where a player dives to catch a meat pie while wearing an akubra hat, overhead sunlight in 1976](/img/dalle/day-1.jpg)
![The red hot chili peppers band performing on stage indoors, where all performers are replaced by giant pepper vegetables 🌶️, dynamic and ecstatic mood, low-key, dramatic, red lighting](/img/dalle/day-2b.png)
![A yowie in the Australian forest with 12 beers, in the style of Hokusai](/img/dalle/day-3a.png)
![A tiny civilisation living on a mossy rock, fantasy art](/img/dalle/day-28b.png)
![20 black cockatoos perched on a long tree branch, in the style of Sidney Nolan](/img/dalle/day-14a.png)
![A jug of wine and a bottle of bread on a messy table, whimsical 1960s oil painting](/img/dalle/day-21a.png)
![A weeping willow hanging over a lone fisherman by a pond, vaporwave](/img/dalle/day-24b.png)
![ET riding a jetski into the moon, claymation](/img/dalle/day-25.gif)
![A hobo sleeping in the gutter, on a street where a bed with wheels is driving past, painting](/img/dalle/day-32d.png)
![A bridge that connects two ponds, arching over land, digital art](/img/dalle/day-33a.png)
![A giant blue groper leaping out of the water at Clovelly Beach, in the style of Arthur Streeton](/img/dalle/day-5.png)
![The ghost of electricity in the bones of a womans face, melancholic and subdued impressionist oil painting](/img/dalle/day-15a.png)
![Russian president looking into the mirror, surrealist melting clocks, like Dali](/img/dalle/day-9a.png)
![A koala driving a bulldozer into a house, colourful childs drawing in crayon](/img/dalle/day-10.png)
![3D render of jesus drinking a babycino in a busy nightclub, close-up in a dark room with strobe lighting](/img/dalle/day-6.png)
![Realistic close-up portrait of a child in pencil](/img/dalle/day-30b.png)
![A path to hell paved with good intentions, William Blake illuminated print](/img/dalle/day-16a.png)
![A yowie in the Australian forest with 12 beers, in the style of Hokusai](/img/dalle/day-3d.png)
![Russian president looking into the mirror, surrealist melting clocks, like Dali](/img/dalle/day-9b.png)
![Art Nouveau painting of hot air balloons drifting over a mountain range](/img/dalle/day-27b.png)
![A hyper-realistic movie poster showing a giant 5ft duck surrounded by countless tiny horses](/img/dalle/day-7a.png)
![Classified ad for jousting sticks in an archival newspaper](/img/dalle/day-34.png)
![1970s movie poster of a monkey in outerspace watching a supernova](/img/dalle/day-31a.png)
![Ned Kelly wearing armour in the australian bush, pixel art](/img/dalle/day-13a.png)
![20 black cockatoos perched on a long tree branch, in the style of Sidney Nolan](/img/dalle/day-14b.png)
![Art Nouveau painting of hot air balloons drifting over a mountain range](/img/dalle/day-27c.png)
![A rat swimming in a golden chalice, classical painting](/img/dalle/day-29b.png)
![A yowie in the Australian forest with 12 beers, in the style of Hokusai](/img/dalle/day-3c.png)
![Ancient greek fresco painting of people dabbing](/img/dalle/day-12b.png)
![The red hot chili peppers band performing on stage indoors, where all performers are replaced by giant pepper vegetables 🌶️, dynamic and ecstatic mood, low-key, dramatic, red lighting](/img/dalle/day-2a.png)
![The ghost of electricity in the bones of a womans face, melancholic and subdued impressionist oil painting](/img/dalle/day-15b.png)
![Gigantic steel sculpture, black and white photograph](/img/dalle/day-22.png)
![A path to hell paved with good intentions, William Blake illuminated print](/img/dalle/day-16b.png)
![20 black cockatoos perched on a long tree branch, in the style of Sidney Nolan](/img/dalle/day-14d.png)
![Old people sun bathing on concrete beside the fountain of youth, Northern Renaissance oil painting](/img/dalle/day-23b.png)
![An emu eating a chiko roll in the australian outback](/img/dalle/day-17.png)
![A hobo sleeping in the gutter, on a street where a bed with wheels is driving past, painting](/img/dalle/day-32c.png)
![A yowie in the Australian forest with 12 beers, in the style of Hokusai](/img/dalle/day-3b.png)
![A bridge that connects two ponds, arching over land, digital art](/img/dalle/day-33b.png)
![Ancient greek fresco painting of people dabbing](/img/dalle/day-12a.png)
![Snow falling on a cottage in a vast meadow, style of studio ghibli](/img/dalle/day-20.png)
![A jug of wine and a bottle of bread on a messy table, whimsical 1960s oil painting](/img/dalle/day-21b.png)
![Interior of a old quaint Australian home](/img/dalle/day-18.png)
![Old people sun bathing on concrete beside the fountain of youth, Northern Renaissance oil painting](/img/dalle/day-23a.png)
![A botanical illustration of a fish on parchment paper](/img/dalle/day-4.png)
![A weeping willow hanging over a lone fisherman by a pond, vaporwave](/img/dalle/day-24a.png)
![A pokemon dragon laying on a thousand dragonfruits](/img/dalle/day-26.png)
![20 black cockatoos perched on a long tree branch, in the style of Sidney Nolan](/img/dalle/day-14c.png)
![A hyper-realistic movie poster showing a giant 5ft duck surrounded by countless tiny horses](/img/dalle/day-7b.png)
![One small violet flower in the middle of an expansive sandy desert, distant film photograph in daylight f22](/img/dalle/day-8.png)
![Art Nouveau painting of hot air balloons drifting over a mountain range](/img/dalle/day-27a.png)
![A tiny civilisation living on a mossy rock, fantasy art](/img/dalle/day-28a.png)
![A rat swimming in a golden chalice, classical painting](/img/dalle/day-29a.png)
![A hobo sleeping in the gutter, on a street where a bed with wheels is driving past, painting](/img/dalle/day-32a.png)
![Realistic close-up portrait of a child in pencil](/img/dalle/day-30a.png)
![1970s movie poster of a monkey in outerspace watching a supernova](/img/dalle/day-31b.png)
![A hobo sleeping in the gutter, on a street where a bed with wheels is driving past, painting](/img/dalle/day-32b.png)
![A metropolis overrun by moss and vines and lizards, fantasy art](/img/dalle/day-35.png)
![An accurate topographical map of New Zealand](/img/dalle/day-36.png)
![A seagull that has been dipped in blue paint, style of Audubon](/img/dalle/day-37.png)