1.Add debug flags when creating RawKernel:
RawKernel
compute_systemG_kernel = cp.RawKernel( lines, "compute_systemG_kernel", options=("-G", "--generate-line-info") )
2.Launch with:
cuda-gdb --args python train.py