void
store_nta(double *
p
,
F64vec2 A);
Stores
(non-temporal) the two double-precision floating-point values of A
.
Requires
a 16-byte aligned address.
Corresponding
intrinsic: _mm_stream_pd
void
store_nta(float *p, F32vec4 A);
Stores
(non-temporal) the four single precision floating-point values of A
.
Requires a 16-byte aligned address.
Corresponding
intrinsic: _mm_stream_ps