v0.0.12
- Larger GPT2 models can now work with long sequences in GPU without running out of memory.
- Neuron activations: ability to specify capturing activations from certain layers
Thanks to contributor @nostalgebraist
Thanks to contributor @nostalgebraist