Intel编译器选项
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
Intel编译器选项
Linux* and Mac OS* X
Windows*
Description
-O3
/O3
Enables aggressive optimization for code speed. Recommended for code with loops that perform substantial calculations or process large data sets.
Enables -O2(Linux and Mac OS X) or /O2(Windows) optimizations, as well as more aggressive optimizations, including prefetching, scalar replacement, cache blocking, and loop and memory access transformations.
As compared to -O2 (Linux) or /O2 (Windows), the optimizations enabled by this option often result in faster program execution, but can slow down code execution in some cases. Using this option may result in longer compilation times.
当循环中处理大量float运算或是大数据集时,使用该选项
-O2 (or -O)
/O2
Affects code speed. This is the default option; the compiler uses this optimization level if you do not specify anything.
This option enables the following capabilities for performance gain: inlining intrinsic functions, constant propagation, copy propagation, dead-code elimination, global register allocation, global instruction scheduling and control speculation, loop unrolling, optimized code selection, partial redundancy elimination, strength reduction/induction variable simplification, variable renaming, exception handling
optimizations, tail recursions, peephole optimizations, structure assignment lowering optimizations, and dead store elimination.
For IA-32 and Intel 64 architectures:
Enables certain optimizations for speed, such as vectorization.
-O1
/O1
Affects code size and locality. Disables specific optimizations.
-fast
/fast
Enables a collection of common, recommended optimizations for run-time performance. Can introduce architecture dependency.
-O0
/Od
Disables optimization. Use this for rapid compilation while debugging an application.
Linux OS and Mac OS X
Windows OS
Description
-axSSE4.2
/QaxSSE4.2
Can generate Intel® SSE4 Efficient Accelerated String and Text Processing instructions supported by Intel® Core™ i7 processors. Can generate Intel® SSE4 Vectorizing Compiler and Media Accelerator, Intel® SSSE3, SSE3, SSE2,
and SSE instructions and it can optimize for the Intel® Core™ processor family.
-axSSE4.1
/QaxSSE4.1
Can generate Intel® SSE4 Vectorizing Compiler and Me dia Accelerator instructions for Intel processors. Can generate Intel® SSSE3, SSE3, SSE2, and SSE instructions and it can optimize for Intel® 45nm Hi-k next generation Intel® Core™ microarchitecture. This replaces value S, which is deprecated.
Mac OS X: IA-32 and Intel® 64 architectures.
-axSSSE3
/QaxSSSE3
Can generate Intel® SSSE3, SSE3, SSE2, and SSE instructions for Intel processors and it can optimize for the Intel® Core™2Duo processor family. This replaces value T, which is deprecated.
Mac OS X: IA-32 architecture.
-axSSE3
/QaxSSE3
Can generate Intel® SSE3, SSE2, and SSE instructions for Int el processors and it can optimize for processors based on Intel® Core™ microarchitecture and Intel NetBurst® microarchitecture. This replaces value P, which is deprecated.
Mac OS X: IA-32 architecture.
-axSSE2
/QaxSSE2
Can generate Intel® SSE2 and SSE instructions for Intel processors, and it can optimize for Intel® Pentium® 4 processors, Intel® Pentium® M processors, and Intel® Xeon® processors with Intel® SSE2.
Linux and Windows: IA-32 architecture