[Numpy-discussion] numpy.dot causes segfault after ctypes call to cudaMalloc