Cacheability Support Operations

Stores (non-temporal) the two double-precision, floating-point values of A. Requires a 16-byte aligned address.

void store_nta(double *p, F64vec2 A);
Corresponding intrinsic: _mm_stream_pd

Stores (non-temporal) the four single-precision, floating-point values of A. Requires a 16-byte aligned address.

void store_nta(float *p, F32vec4 A);
Corresponding intrinsic: _mm_stream_ps