We’re excited to release neon v1.5 with Python 2 and Python 3 support, support for Pascal GPUs (GTX 1080) and performance enhancements such as persistent RNN kernels (based on the paper by Greg Diamos at Baidu), bringing a 12x performance gain compared to v1.4.0.
Highlights from this release include:
RNN kernels benchmarked on Titan X with batch size 4 and 1152 activations. Public Baidu kernels from GitHub.
As always, you can grab this release from github at: https://github.com/NervanaSystems/neon