Skip to content

0.2.0 — faster training!

Latest
Compare
Choose a tag to compare
@proger proger released this 20 May 10:34
db7145f

@unixpickle has fused the sequence reversal required by backward into the kernel and vectorized loads and stores to load entries, training is 30-40 percent faster on 3090.

image