Does the following code transition from AVX to SSE? If so, how can I save the sum of the vector __m256 without prejudice to this fine?
__mm256 x_swap = _mm_permute2f128_ps(x,x,1) x = _mm256_add_ps(x, x_swap); x = _mm256_hadd_ps(x,x); x = _mm256_hadd_ps(x,x);
Thanks.
source share