I need to copy 64-bit integer data from the host to the device memory.
Both of them are declared as unsigned __int64, and I used cudaMemcpyToSymbol().
When validated using Parallel Nsight, the copied data is displayed as a negative integer. I think the most significant bit of the lower 4 bytes is considered as a sign bit, which should not be.
Can someone give me any direction to come up with this?
source
share