When I run tensenflow resnet50 benchmark on tf_opt, a segmemtation fault may accur in "__cudaPopCallConfiguration" function at some point.