Cublas convolution. So convolution with FFT is slower than this method.

creator avatar