Permanent memory versus texture memory and global memory in CUDA

Question

I am trying to find the differences between permanent memory and texture memory versus global memory in CUDA.

I can find the following relevant articles, but I can not find the answer to my question

An article that looks at the performance implications of all three: http://forum.beyond3d.com/showthread.php?t=52510

+6

thinkcool Nov 29 '11 at 6:53

1 answer

thinkcool · Accepted Answer · 2011-11-29T08:49:04+0000

Read only memory:

Kernel constants and arguments are stored here.

Slow, but with a cache (8 kb)

Permanent memory optimized for broadcast

Texture memory:

Cache optimized for 2D spatial access pattern

Reading has some advantages, such as address modes and interpolation, that can be used at no extra cost.

Global memory:

Slow and unencrypted (1.0), cached (2.0)

Requires sequential and aligned 16-byte read and write for fast (combined read / write)