Release Notes

Highlights

Add Apache MXNet backend support in Keras 2.1.6.
Supports Convolutional Neural Network (CNN) and experimental Recurrent Neural Network (RNN) training and inference.
Supports high performance, distributed Multi-GPU training of CNN and RNN networks.
Supports exporting native MXNet Model from Keras-MXNet trained model. Enabling faster research with Keras interface and high performance, large scale inference in production with the native MXNet engine. You can use all language bindings of MXNet (Scala/Python/Julia/R/Perl) for inference on the exported model.
Add Keras benchmarking utility for performing CNN and RNN benchmarks with standard networks and datasets. Supports benchmarking on CPU, one GPU and multi-GPU distributed training.
Add keras.utils.to_channels_first() for easy conversion of channels_last data to channels_first.
PyPi package for keras-mxnet. pip install keras-mxnet.

Keras-MXNet shows significant improvement in performance with channels_first image_data_format. Performance drops with channels_last data format. Tutorial
Keras-MXNet CNN benchmarks shows upto 3X performance improvement on GPUs. See Benchmark Results document for more details.
Keras-MXNet RNN support is experimental. Performance on CPU is known to be upto 2X slower (Related issue). However, performance on GPU shows upto 2X performance improvement.

RNN with Keras-MXNet is experimental and do not support variable length inputs and unroll=False. You need to pad input sequences to make it static length and provide input_length and set unroll=True. See using RNN with Keras-MXNet tutorial for more details.
depthwise_conv2d and separable_conv2d operators are not supported.
18 Keras operators are not supported with MXNet backend. Update operators, symbolic gradient, local_conv1d, local_conv2d, higher order functions, other operator liks cumsum, cumprod, stack, ctc and more. See Operators missing with MXNet backend Github Issue for more details.
Sparse Tensors are not supported with MXNet backend in this release.
Unsupported keras/examples list.
Cross backend models are not supported. Training with TensorFlow backend and loading the Keras model with MXNet backend is not supported.

MXNet backend performance significantly drops with channels_last image_data_format. It is highly recommended to user channels_first image_data_format. See performance guide for more details.
MXNet backend do not support boolean. For example, in_topk operator with MXNet backend uses 0/1 instead of boolean. Issue
depthwise_conv2d supports depth_multiplier=1 only. Issue
LSTM layer fails if dropout is used. Issue
Models with Custom Loss are not serializable. Issue

Thanks to all the contributors for their contributions in this release:
@sandeep-krishnamurthy, @jiajiechen, @karan6181, @roywei, @kalyc