Neural Style Transfer MXNet

style target

Pick a content and a style:

content picture

What is this?

Neural Style Transfer is the process of taking the content of one image and fusing it with the style of another, for example famous grand masters paintings. The demo seen here is trained on the Image Transformation Network proposed by Johnson et. al. using MXNet and deployed using Amazon Elastic Container Service. The training code used is from the mxnet-fast-neural-style repository.

How does it work?

The Image Transformation Network proposed by Johnson et. al.

The Image Transformation Network for Neural Style Transfer is trained on a per style basis, (i.e) train one model per style image that can be used to transfer the style to any input content image. Figure 1 gives a brief overview of the Image Transformation Network training procedure.

It's all about the losses

Equation 1: Feature reconstruction loss

The goal is to learn weights for the image transformation network based on the feature reconstruction loss and the style reconstruction loss. The Feature reconstruction loss penalizes the Image Transformation Network for the features of the output image y^{^} deviating from the features of the input (the content image x), at particular layers of the VGG net. The feature reconstruction loss is presented in Equation 1.

The Style reconstruction loss is the squared Frobenius norm of the difference between the Gram matrices of the output and target style images. It is described in Equation 2. At high level, the style reconstruction loss penalizes the Image transformation network for the features of the generated image deviating from the features of the style image.

Equation 2: Style reconstruction loss

Note: More information about the use of Gram matrices for style reconstruction loss can be found in Demystifying Neural Style Transfer. To train the model end to end, the loss functions are combined as shown in Equation 3. It is the weighted sum of the feature reconstruction loss and the style reconstruction loss and other simpler loss functions such as the per pixel loss and the total variation regularization.

Equation 3: Perceptual losses for neural style transfer

Resources

Training code for fast neural style transfer
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
MXNet Github Repo
Configuring Docker container for MMS
Serverless Inference with MMS on Fargate
MXNet Gluon 60 minutes crash course: Get started with MXNet Gluon in this 60 minutes crash course
Icons made by www.flaticon.com is licensed by CC 3.0 BY

Acknowledgement

Demo based on SK's work

Questions?

Create an issue on the demo github repo
Reach out on twitter, linkedin or github

Demo built by Thomas Delteil

Neural Style Transfer with