The computing power of 3.x devices (Kepler) has 4 skew schedulers on SM. On each cycle, each warp planner selects the basics and issues 1-2 instructions from the basics. An SM has only one storage unit storage unit (LSU) that serves L1 and shared memory requests, so only 1 out of 8 potential instructions can be sent to the LSU, so bank conflicts between skews will not occur.
source
share