More accurate Winograd/Cook/Toom F(4x4, 3x3) transforms #224

andravin · 2016-04-03T04:10:28Z

Here are some more accurate transform matrices for the Winograd F(4x4, 3x3) kernel.

They are about 4X as accurate as the original transforms, and within a factor of 2X of the accuracy of direct convolution.

AT (the inverse transform) requires a couple more flops to compute than the original transform, because some of the values of +/-1 were replaced with rational numbers. I think the other transforms are the same number of flops.


AT =
⎡1   1      1     1      1    0⎤
⎢                              ⎥
⎢0  7/10  -7/10  3/2   -3/2   0⎥
⎢                              ⎥
⎢    49     49                 ⎥
⎢0  ───    ───   9/4    9/4   0⎥
⎢   100    100                 ⎥
⎢                              ⎥
⎢   343   -343                 ⎥
⎢0  ────  ─────  27/8  -27/8  1⎥
⎣   1000   1000                ⎦

G =
⎡ 400              ⎤
⎢ ───     0     0  ⎥
⎢ 441              ⎥
⎢                  ⎥
⎢-625   -125   -25 ⎥
⎢─────  ─────  ────⎥
⎢ 1078   308    88 ⎥
⎢                  ⎥
⎢-625    125   -25 ⎥
⎢─────   ───   ────⎥
⎢ 1078   308    88 ⎥
⎢                  ⎥
⎢  25     25    25 ⎥
⎢ ───    ───    ── ⎥
⎢ 198    132    88 ⎥
⎢                  ⎥
⎢  25   -25     25 ⎥
⎢ ───   ────    ── ⎥
⎢ 198   132     88 ⎥
⎢                  ⎥
⎣  0      0     1  ⎦

BT =
⎡441         -137              ⎤
⎢───    0    ─────    0    1  0⎥
⎢400           50              ⎥
⎢                              ⎥
⎢     -63                      ⎥
⎢ 0   ────   -9/4   7/10   1  0⎥
⎢      40                      ⎥
⎢                              ⎥
⎢      63                      ⎥
⎢ 0    ──    -9/4   -7/10  1  0⎥
⎢      40                      ⎥
⎢                              ⎥
⎢     -147   -49               ⎥
⎢ 0   ─────  ────    3/2   1  0⎥
⎢      200   100               ⎥
⎢                              ⎥
⎢      147   -49               ⎥
⎢ 0    ───   ────   -3/2   1  0⎥
⎢      200   100               ⎥
⎢                              ⎥
⎢      441          -137       ⎥
⎢ 0    ───     0    ─────  0  1⎥
⎣      400            50       ⎦

The text was updated successfully, but these errors were encountered:

andravin · 2016-04-04T18:32:01Z

This transform appears to be the same or slightly more accurate:

AT =
⎡1   1    1     1      1    0⎤
⎢                            ⎥
⎢   √2   -√2                 ⎥
⎢0  ──   ────   √2    -√2   0⎥
⎢   2     2                  ⎥
⎢                            ⎥
⎢0  1/2  1/2    2      2    0⎥
⎢                            ⎥
⎢   √2   -√2                 ⎥
⎢0  ──   ────  2⋅√2  -2⋅√2  1⎥
⎣   4     4                  ⎦

G =
⎡ 1     0     0  ⎤
⎢                ⎥
⎢      -√2       ⎥
⎢-2/3  ────  -1/3⎥
⎢       3        ⎥
⎢                ⎥
⎢       √2       ⎥
⎢-2/3   ──   -1/3⎥
⎢       3        ⎥
⎢                ⎥
⎢       √2       ⎥
⎢1/6    ──   1/3 ⎥
⎢       6        ⎥
⎢                ⎥
⎢      -√2       ⎥
⎢1/6   ────  1/3 ⎥
⎢       6        ⎥
⎢                ⎥
⎣ 0     0     1  ⎦

BT =
⎡1   0    -5/2   0    1  0⎤
⎢                         ⎥
⎢                √2       ⎥
⎢0  -√2    -2    ──   1  0⎥
⎢                2        ⎥
⎢                         ⎥
⎢               -√2       ⎥
⎢0   √2    -2   ────  1  0⎥
⎢                2        ⎥
⎢                         ⎥
⎢   -√2                   ⎥
⎢0  ────  -1/2   √2   1  0⎥
⎢    2                    ⎥
⎢                         ⎥
⎢    √2                   ⎥
⎢0   ──   -1/2  -√2   1  0⎥
⎢    2                    ⎥
⎢                         ⎥
⎣0   1     0    -5/2  0  1⎦

I had posted a variation of these matrices earlier that scaled some of the columns of AT and rows of G, but that seemed to hurt accuracy slightly.

scott-gray · 2016-04-30T01:00:41Z

This one is done is pending merge for release.

buttercutter · 2019-09-06T17:08:35Z

Could anyone advise how to obtain this improved version of winograd convolution from the original version which is described by equation (7) in the paper ?

andravin · 2019-09-06T21:32:40Z

Hi @ProMach

I generated the above transforms using an early version of the winCNN software, now available here: https://github.com/andravin/wincnn

Matrix A^T is a Vandermonde matrix, and the "polynomial roots" used by winCNN are just the numbers in the second row, leaving off the last column.

Since these matrices were released, others have researched possibly better transforms. Please refer to apache/tvm#3553

buttercutter · 2019-10-21T07:54:31Z

@andravin

I do not understand why the proposed toom-cook ALGORITHM 1 in the paper Error Analysis and Improving the Accuracy of Winograd Convolution for Deep Neural Networks does not need polynomial interpolation stage ?

buttercutter · 2020-03-29T08:23:51Z

Matrix AT is a Vandermonde matrix, and the "polynomial roots" used by winCNN are just the numbers in the second row, leaving off the last column.

@andravin What do you exactly mean by 'polynomial roots' ? and why are they located in the second row ?

jennifermyers added the enhancement label Apr 30, 2016

jennifermyers assigned scott-gray Apr 30, 2016

jennifermyers added this to the v1.5.0 milestone Jun 23, 2016

jennifermyers added the in progress label Jun 23, 2016

jennifermyers closed this as completed Jun 30, 2016

jennifermyers removed the in progress label Jun 30, 2016

andravin mentioned this issue Jul 19, 2019

Add Winograd matrices computation. apache/tvm#3553

Merged

This was referenced Aug 3, 2020

Derivation of matrices G and B jdnie/Winograd_study#1

Open

Winograd Convolution antkillerfarm/antkillerfarm.github.com#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More accurate Winograd/Cook/Toom F(4x4, 3x3) transforms #224

More accurate Winograd/Cook/Toom F(4x4, 3x3) transforms #224

andravin commented Apr 3, 2016

andravin commented Apr 4, 2016

scott-gray commented Apr 30, 2016

buttercutter commented Sep 6, 2019

andravin commented Sep 6, 2019

buttercutter commented Oct 21, 2019 •

edited

Loading

buttercutter commented Mar 29, 2020

More accurate Winograd/Cook/Toom F(4x4, 3x3) transforms #224

More accurate Winograd/Cook/Toom F(4x4, 3x3) transforms #224

Comments

andravin commented Apr 3, 2016

andravin commented Apr 4, 2016

scott-gray commented Apr 30, 2016

buttercutter commented Sep 6, 2019

andravin commented Sep 6, 2019

buttercutter commented Oct 21, 2019 • edited Loading

buttercutter commented Mar 29, 2020

buttercutter commented Oct 21, 2019 •

edited

Loading