About the Classes
C++ Classes and SIMD Operations
Capabilities
Details About the Libraries
Hardware and Software Requirements
Arithmetic Operators
Cacheability Support Operations
Classes Quick Reference
Compare Operators
Conditional Select Operators for Fvec Classes
Constructors and Initialization
Conversions
Debugging
Fvec Notation Conventions
Load and Store Operators
Logical Operations
Minimum and Maximum Operators
Move Mask Operator
Overview: Floating-point Vector Classes
Programming Example
Unpack Operators for Fvec Operators
Addition and Subtraction Operators
Assignment Operator
Clear MMX(TM) State Operator
Comparison Operators
Conditional Select Operators
Conversions between Fvec and Ivec
Debug
Integer Intrinsics for Streaming SIMD Extensions
Logical Operators
Multiplication Operators
Overview: Integer Vector Classes
Pack Operator
Rules for Operators
Shift Operators
Terms, Conventions, and Syntax Defined
Unpack Operators
Controlling Inline Expansion of User Functions
Criteria for Inline Function Expansion
Complex Functions
Exponential Functions
Hyperbolic Functions
Intel math library
Miscellaneous Functions
Nearest Integer Functions
Remainder Functions
Special Functions
Trigonometric Functions
Intrinsics Cross-processor Implementation
Intrinsics for Implementation Across All IA
MMX(TM) Technology Intrinsics Implementation
Streaming SIMD Extensions 2 Intrinsics Implementation
Streaming SIMD Extensions Implementation
Alignment Support
Allocating and Freeing Aligned Memory Blocks
Overview: Data Alignment, Memory Allocation Intrinsics, and Inline Assembly
Conversion Intrinsics
Intrinsics for Itanium(R) Instructions
Load and Store
Lock and Atomic Operation Related Intrinsics
Multimedia Additions
Native Intrinsics for Itanium(R) Instructions
Operating System Related Intrinsics
Register Names for getReg library function and setReg library function
Floating-point Vector Intrinsics
Integer Vector Intrinsics
Macro Functions
Miscellaneous Intrinsics
Overview: New IA-32 Intrinsics
MMX(TM) Technology Compare Intrinsics
MMX(TM) Technology General Support Intrinsics
MMX(TM) Technology Intrinsics on Itanium Architecture
MMX(TM) Technology Logical Intrinsics
MMX(TM) Technology Packed Arithmetic Intrinsics
MMX(TM) Technology Set Intrinsics
MMX(TM) Technology Shift Intrinsics
Arithmetic Operations for the Streaming SIMD Extensions
Cacheability Support Using Streaming SIMD Extensions
Comparisons for the Streaming SIMD Extensions
Conversion Operations for the Streaming SIMD Extensions
Floating-point Intrinsics for Streaming SIMD Extensions
Integer Intrinsics Using Streaming SIMD Extensions
Load Operations for the Streaming SIMD Extensions
Logical Operations for the Streaming SIMD Extensions
Macro Function for Matrix Transposition
Macro Function for Shuffle Using Streaming SIMD Extensions
Macro Functions to Read and Write the Control Registers
Memory and Initialization Using Streaming SIMD Extensions
Miscellaneous Intrinsics Using Streaming SIMD Extensions
Overview: Streaming SIMD Extensions
Set Operations for Streaming SIMD Extensions
Store Operations for Streaming SIMD Extensions
Using Streaming SIMD Extensions on Itanium(R) Architecture
Comparison Operations for Streaming SIMD Extensions 2
Conversion Operations for Streaming SIMD Extensions 2
Floating-point Arithmetic Operations for Streaming SIMD Extensions 2
Load Operations for Streaming SIMD Extensions 2
Logical Operations for Streaming SIMD Extensions 2
Miscellaneous Operations for Streaming SIMD Extensions 2
Set Operations for Streaming SIMD Extensions 2
Store Operations for Streaming SIMD Extensions 2
Streaming SIMD Extensions 2 Floating-point Memory and Initialization Operations
Cacheability Support Operations for Streaming SIMD Extensions 2
Conversion Operations for Streaming SIMD Extensions 2
Integer Arithmetic Operations for Streaming SIMD Extensions 2
Integer Comparison Operations for Streaming SIMD Extensions 2
Integer Load Operations for Streaming SIMD Extensions 2
Integer Logical Operations for Streaming SIMD Extensions 2
Integer Memory and Initialization for Streaming SIMD Extensions 2
Integer Set Operations for Streaming SIMD Extensions 2
Integer Shift Operations for Streaming SIMD Extensions 2
Integer Store Operations for the Streaming SIMD Extensions 2
Macro Function for Shuffle
Miscellaneous Operations for Streaming SIMD Extensions 2
Auto-parallelization: Enabling, Options, and Environment Variables
OpenMP* Environment Variables
Auto-parallelization: Enabling, Options, and Environment Variables
OpenMP* Environment Variables
Compiling with OpenMP*, Directive Format, and Diagnostics
Example Function
Examples of OpenMP* Usage
Intel Extensions
OpenMP* Directives and Clauses
OpenMP* Environment Variables
OpenMP* Runtime Library Routines
OpenMP* Support Libraries
Overview: Intel Workqueuing Model
Overview: Parallelization with OpenMP*
Parallel Processing Thread Model
Floating-point Arithmetic Options for Itanum(R)-based Systems
Floating-point Arithmetic Precision
Auto CPU Dispatch
Combining Processor Optimization and Auto CPU Dispatch (IA-32 only)
Processor Optimization (IA-32 only)
Processor-specific Optimization (IA-32 only)
Absence of Loop-carried Memory Dependency with IVDEP Directive
Loop Transformations
Loop Unrolling
Overview: High-level Language Optimization
Analyzing the Effects of Multifile IPO
Compilation with Real Object Files
Creating a Multifile IPO Executable
Creating a Multifile IPO Executable with xild
Criteria for Inline Function Expansion
Interprocedural Optimization Options
Overview: Interprocedural Optimizations
Overview: Multifile IPO
Using -ip or -ipo with -Qoption Specifiers
Auto-parallelization Threshold Control and Diagnostics
Auto-parallelization: Enabling, Options, and Environment Variables
Compiling with OpenMP*, Directive Format, and Diagnostics
Example Function
Examples of OpenMP* Usage
Intel Extensions
OpenMP* Directives and Clauses
OpenMP* Environment Variables
OpenMP* Runtime Library Routines
OpenMP* Support Libraries
Overview: Auto-parallelization
Overview: Intel Workqueuing Model
Overview: Parallel Programming
Overview: Parallelization with OpenMP*
Parallel Processing Thread Model
Programming with Auto-parallelization
Basic PGO Options
Code-coverage Tool
Dumping and Resetting Profile Information
Dumping Profile Information
Environment Variable
Example of Profile-guided Optimization
Interval Profile Dumping
Overview: Profile-guided Optimizations
PGO API Support Overview
PGO Environment Variables
Profile-guided Optimizations Methodology
Resetting the Dynamic Profile Counters
Test-prioritization Tool
Using profmerge to Relocate the Source Files
Data Dependence
Language Support and Directives
Loop Constructs
Loop Exit Conditions
Loop Interchange and Subscripts: Matrix Multiply
Loop Parallelization and Vectorization
Overview: Vectorization
Some Vectorization Examples
Statements in the Loop Body
Stripmining and Cleanup
Types of Loops Vectorized
Vectorization Key Programming Guidelines
Vectorizer Options