Releases · mirkoruether/ann-cpp

23 Mar 10:50

mirkoruether

1.0.1

005e3bd

Performance Level 4 Latest

Latest

CUDA support

Assets 2

29 Mar 12:00

mirkoruether

2.0

7b7aae3

Convolutional input layer

2.0

Conv and pooling layers are working

Assets 2

28 Nov 13:05

mirkoruether

0.3

5fb0560

Performance Level 3 Pre-release

Pre-release

Performance measurement:
Hardware:
Laptop (ASUS FX753VE): Intel i7-7700HQ / NVIDIA GTX 1050Ti (Mobile)
Desktop: Intel i5 6600k / Palit Jetstream NVIDIA GTX 970

Configuration:
Neural Net: (784 | 30 | 10) fully connected
Precision: double
Training data: MNIST 60k
MiniBatchSize: 8
MiniBatchesPerEpoch: 7.500
Epochs: 5

Performance Level 3:
Parallelized

Feed forward and error calculation by subdividing the mini batch and processing partial mini batches parallel
Gradient calculation and parameter adjustment by doing it parallel for the biases and weights on each layer

Redesigned linear algebra without memory allocation during training
x64 RelWithDebInfo (/MD /Zi /O2 /Ob1 /DNDEBUG)

Results:
Laptop: N/A
Desktop: 15.2s

Assets 4

26 Nov 10:25

mirkoruether

0.2.1

18d8df5

Performance Level 2 Pre-release

Pre-release

Performance measurement:
Hardware:
Laptop (ASUS FX753VE): Intel i7-7700HQ / NVIDIA GTX 1050Ti (Mobile)
Desktop: Intel i5 6600k / Palit Jetstream NVIDIA GTX 970

Configuration:
Neural Net: (784 | 30 | 10) fully connected
Precision: double
Training data: MNIST 60k
MiniBatchSize: 8
MiniBatchesPerEpoch: 7.500
Epochs: 5

Performance Level 2:
Redesigned linear algebra without memory allocation during training
x64 RelWithDebInfo (/MD /Zi /O2 /Ob1 /DNDEBUG)

Results:
Laptop: N/A
Desktop: 35.8s

Assets 4

21 Oct 08:27

mirkoruether

0.1

cdf2529

Version 0.1 - Performance Level 0/1 Pre-release

Pre-release

Performance measurement:
Hardware:
Laptop (ASUS FX753VE): Intel i7-7700HQ / NVIDIA GTX 1050Ti (Mobile)
Desktop: Intel i5 6600k / Palit Jetstream NVIDIA GTX 970

Configuration:
Neural Net: (784 | 30 | 10) fully connected
Precision: double
Training data: MNIST 60k
MiniBatchSize: 8
MiniBatchesPerEpoch: 7.500
Epochs: 5

Performance Level 0:
x64 RelWithDebInfo (/MD /Zi /O2 /Ob1 /DNDEBUG)

Results:
Laptop: 136.1s
Desktop: 123.3s

Performance Level 1:
Parallel minibatch error calculation
x64 RelWithDebInfo (/MD /Zi /O2 /Ob1 /DNDEBUG)

Results:
Laptop: 117.5s
Desktop: 101.7s

Assets 5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: mirkoruether/ann-cpp

Performance Level 4

Convolutional input layer

Performance Level 3

Performance Level 2

Version 0.1 - Performance Level 0/1