In some CUDA materials, I often found the following words:
"At any time, only one of the warps is performed by SM."
Here I do not quite understand, since each SM can simultaneously launch hundreds or thousands of threads, why only one warp, which is 32 threads, can be executed at a time?
Thanks!
CUDA, , , SM 8 , 4 ( 4 ). , 4- SMT, 32 SM.
, GPU SM. 30, 30 x 32 warps = 960 , . , , , . 960 "" , 960 .
Tesla, . SM . warp warp (, ) . SM . CUDA , . /, ..
1.x (Tesla)
Compute Capability 2.0 (Fermi 1st Generation)
Compute Capability 2.1 (Fermi 2nd Generation)
3.x (Kepler)
Source: https://habr.com/ru/post/1527116/More articles:How to solve "Failed to create route1 route exception" in Apache Camel? - javaReport services report freezes only in Internet Explorer - internet-explorerCompleting a UI job - javaCUDA: банковские конфликты между различными перекосами? - shared-memoryClang auto variable template error - c ++Why doesn't git rm -rf get rid of a folder with a submodule in it? - gitpython, selenium, chromedriver 'selenium.common.exceptions.WebDriverException: Message: u'chrome is unavailable - pythonSelenium and wordpress: new post test - seleniumChange color based on background color with Sass - cssConEmu Commands in the Task - batch-fileAll Articles