Does anyone know the table or documentation that lists these pairs?
These pairs are over 9000 and all of them cannot be listed.
For instance:
VADD.F32 q0,q0,q1 VMUL.F32 q3,q0,q2
the first command writes the result in the 4th cycle, while the second command requires (q0) as the source in the 2nd cycle, since the source is not ready yet, there is a stall (or conveyor hole) between these two instructions.
To calculate these kiosks, you can use the following online tool:
http://pulsar.webshaker.net/ccc/result.php?lng=us
source share