2.1
- Added transformer encoder and decoder (thanks to @Adversing)
- Added attention layer (thanks to @Adversing)
- 10x Speed up for Adam optimizer
- Added basics for CNNs
- Added
Vector
class to make dataset management easier - Added the basic gradient descent optimizer
- Added
ELU
,GELU
andSoftmax
activations
Full Changelog: 2.0.3...2.1