AI&Creativity: Visual Anarchist Realism of CycleGAN (and how AI cheats on developers)

You already tried out transforming photos into a painting by Van Gogh. But have you ever imagined the other way round? I mean, transforming painting into photos? While the first attempt is close to a filter function (visuals are adjusted and style is transfered, so you can recognize specific artistic character), the second one appears to be nearly impossible. Bringing to reality an artistic phantasy. Pygmalion would be proud.
Wrong. It’s possible now. With CycleGAN.
The principle of this process is already containing in the title — and in the paper “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”. A huge difference to previous attemps is in following:
Image-to-image translation is a class of vision and graphics problems where the goal is to learn the mapping between an input image and an output image using a training set of aligned image pairs. However, for many tasks, paired training data will not be available. We present an approach for learning to translate an image from a source domain X to a target domain Y in the absence of paired examples.
The cycled process can allow the transition between styles and other visual aspects of an image with consistent data coherence. In short: if you take image A and transform it with Deep Learned Adversarial Networks to image B, you can transform it back without data loss.
But there is more. Consistency is just a beginning. AI can recognize objects — and you can manipulate the recognition.
You can control the results with optional changing outputs. You can do wonders. The paper (PDF) and the website of the project shows the possibilities.
Here just some of them.
You can transform a painting to a photo.


GAN recognize the objects depicted by the artist — and interpret it with trained knowledge of photographic vision. Again, the nature elements and backgrounds (grass, water, sky) look photorealistic, already because the image libraries, which are used for training the Networks, contain the majority of backgrounds — along with specific objects in the foreground. Say, there are thousands of images containing dozen of dog races — but even if dogs are labelled according to their type, the backgrounds behind the dogs are generic and main.
You can even transfer the painting to the photo preserving color palette of the original:

Request: photographic vision of abstract painting. Will AI collapse or will we get some fascinating pseudo-realities? (OK, forget “pseudo”).
You can replace one object with another.
For example, semantic replacement of horse by a zebra (and vice versa):

You can even do ot with video footage:

Note the stripes of the zebra — the patterns remain consistent during the movements (even if other objects in foreground are also “zebrafied”)
Fruits are replaceable as well — even if the visual artefacts are present.

Well, nobody is perfect, and sometimes even AI does failures, like it doesn’t recognize the Russian President sitting on a horse, so…

You can change the season setting
Here: summer to winter.

The color palettes are adjusted and aligned to another season.
You can turn maps to aerial photos.
This is the newest development. Networks can intelligently read the map, recognize the topology and convert it to simulation of satellite photos:







