When I include the spatialdepthwiseconvolution in the network, the GPU memory usage keep growing during training, then running out of memory.