There has been a lot of talk about AI generated images and visual art. One of the projects that I found most impressive was DALL·E 2. This is a new AI system that can create realistic images and art from a description in natural language. Currently it is invite only so I entered my email to the waitlist. A few days later, the invite arrived. Instead of focusing on more meaningful work I fooled around with the system. Here’s what I got…
Natural language descriptions
As a primarily landscape photographer I wanted to test when will AI replace me. My first description that I typed in was “snow covered mountains with a rainbow and a flying unicorn”. And DALL·E 2 created this,
Clearly, I can keep my job for now?! Next I tried something more realistic, “beautiful waterfall with a lots of green moss and rocks in the foreground”
If these above are really AI generated, I am impressed! It looks real. So I continued with “autumn forest in the mist”
A bit dull so I added “autumn forest in the mist with mushrooms on the ground”
Impressive again! Let’s try something harder, “autumn forest in the mist with mushrooms on the ground and a deer in the back”
Less impressive?… I do like how the AI added the birds and how the mushrooms started to levitate?.
Uploading my own images
DALL·E 2 offers an option to upload your own images and then the system generates variations. This sounds interesting too and might be even greater test of the AI power. The AI has to “read” the image and then create meaningful alternatives. How did it work? Let’s see.
My image of Lake Bled in autumn seemed perfect example to test. The composition, the colours and the subject are all distinct and should be easy to replicate.
Next I wanted to test how DALL·E 2 generates people in the image. I uploaded one of my travel portraits and got this message.
Fair enough, it makes sense. So I tried with a more silhouetted person, without recognisable face. This time it worked.
The photographer and his camera in hand are both unnaturally skewed and distorted. The mountain layers are all very realistic and obviously changed.
Editing existing images
Once you upload your own images, there is an option to edit them. Simply by using brush on a part of the image and describing in text what you want to create in this area. Here are a few results…
As you can see, not very impressive. It’s fun, but not realistic by any means. The land rovers are quite okay, but the sheep or the birds are clearly fake. Here below are the original images I used to create the above variations.
DALL·E 2 – Conclusion
AI has definitely made a huge progress in the last few years. We’ve seen music composed by the computers, deep fake videos of famous dead people and now images being created by simply typing in text descriptions! I am not sure if AI is the right term as these are a set of algorithms that learn and improve with more and more data being fed to it. Anyways, the results I got with playing around were better than expected. There are many completely silly results, but sometimes the algorithm gets it really well.
I can see how useful this can become in the future. And how easy it will be to abuse it. The technology however is still in its infancy. At the moment I see it more ore less as a toy to experiment with. Maybe in a few years this will change and we will see a flood of amazing images that have nothing to do with reality. Then I believe, true photography craft will only become more valued.
What are your thoughts on these emerging AI image/art tools? Let me know what you think in the comments below. Thank you!
Thank you very much for sharing. Please keep me posted
Great article and examples!
Thank you Luka for the article. Impressive and fun if one treats it as a game but scary to think of the possible dark side to AI applications