 See above. The gemm_kernels get longer with concurrency (not really though).