An adaptation of lucid to visualizing features for the I3D video classification network. The video classification network pretrained on the kinetics dataset weights source.
At the time of the development of this code the n-dimensional real inverse furrier transform was not differentiable in TensorFlow. In order to bypass this a function turning the spectrum of a real signal to a hermitian array was developed, this allowed to make use of the complex inverse transform, which was differentiable, the source code could be found here.
Click on the images to see a higher resolution video.