CPU2006 Result Flag Description

Compilers: IBM XL C/C++ Advanced Edition for Linux V9.0 and XL Fortran Advanced Edition for Linux V11.1

Base Optimization Flags

C benchmarks

- -O5
- COPTIMIZE
- Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option.
  
  -O5 is equivalent to the following flags
  - -O4
  - -qipa=level=2
  - -qarch=auto
  - -qtune=auto
- Includes:
  - -O4
    - -O3
      
      -O2
      
      -O
      
      -qhot=level=0
    - -qipa=level=1
    - -qarch=auto
    - -qtune=auto
  - -qipa=level=2
  - -qarch=auto
  - -qtune=auto
- -qnoenablevmx
- COPTIMIZE
- Disables generation of vector instructions for processors that support them.
- -lhugetlbfs
- EXTRA_CLIBS
- Link with libhugetlbfs.so. This enables heap to be backed by the 16 Megabyte pages.

C++ benchmarks

- -O5
- CXXOPTIMIZE
- Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option.
  
  -O5 is equivalent to the following flags
  - -O4
  - -qipa=level=2
  - -qarch=auto
  - -qtune=auto
- Includes:
  - -O4
    - -O3
      
      -O2
      
      -O
      
      -qhot=level=0
    - -qipa=level=1
    - -qarch=auto
    - -qtune=auto
  - -qipa=level=2
  - -qarch=auto
  - -qtune=auto
- -qrtti
- CXXOPTIMIZE
- Cause the C++ compiler to generate Run Time Type Identification code for exception handling and for use by the typeid and dynamic_cast operators.
- -qnoenablevmx
- CXXOPTIMIZE
- Disables generation of vector instructions for processors that support them.
- -qstaticlink
- CXXOPTIMIZE
- Controls how shared and non-shared runtime libraries are linked into an application. When -qstaticlink is in effect, the compiler links only static libraries with the object file named in the invocation. When -qnostaticlink is in effect, the compiler links shared libraries with the object file named in the invocation. This option provides the ability to specify linking rules that are equivalent to those implied by the GNU options -static, -static-libgcc, and -shared-libgcc, used singly and in combination.

Fortran benchmarks

- -O5
- FOPTIMIZE
- Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option.
  
  -O5 is equivalent to the following flags
  - -O4
  - -qipa=level=2
  - -qarch=auto
  - -qtune=auto
- Includes:
  - -O4
    - -O3
      
      -O2
      
      -O
      
      -qhot=level=0
    - -qipa=level=1
    - -qarch=auto
    - -qtune=auto
  - -qipa=level=2
  - -qarch=auto
  - -qtune=auto
- -qsmallstack=dynlenonheap
- FOPTIMIZE
- Causes the Fortran compiler to allocate dynamic arrays on the heap instead of the stack

-qalias=nostd
FOPTIMIZE

 qalias=ansi | noansi
   If ansi is specified, type-based aliasing is
   used during optimization, which restricts the
   lvalues that can be safely used to access a
   data object. The default is ansi for the xlc,
   xlC, and c89 commands. This option has no
   effect unless you also specify the -O option.

 qalias=std |nostd
   Indicates whether the compilation units contain
   any non-standard aliasing (see Compiler Reference
   for more information). If so, specify nostd.

- -qnoenablevmx
- FOPTIMIZE
- Disables generation of vector instructions for processors that support them.
- -B/usr/share/libhugetlbfs/
- FOPTIMIZE
- Determines substitute path names for XL Fortran executables such as the compiler, assembler, linker, and preprocessor. It can be used in combination with the -t option, which determines which of these components are affected by -B.

-tl
FOPTIMIZE

Applies the prefix specified by the -B option to the designated components.

Parameter	Description	Executable name
a	Assembler	as
b	Low-level optimizer	xlfcode
c	Compiler front end	xlfentry
d	Disassembler	dis
F	C preprocessor	cpp
h	Array language optimizer	xlfhot
I	High-level optimizer, compile step	ipa
l	Linker	ld
z	Binder	bolt

- -Wl,--hugetlbfs-link=BDT
- FOPTIMIZE
- Pass the --hugetlbfs-link=BDT flag to the linker so that the text, initialized data, and BSS segments of the application are backed by hugepages.

Benchmarks using both Fortran and C

- -O5
- COPTIMIZE, FOPTIMIZE
- Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option.
  
  -O5 is equivalent to the following flags
  - -O4
  - -qipa=level=2
  - -qarch=auto
  - -qtune=auto
- Includes:
  - -O4
    - -O3
      
      -O2
      
      -O
      
      -qhot=level=0
    - -qipa=level=1
    - -qarch=auto
    - -qtune=auto
  - -qipa=level=2
  - -qarch=auto
  - -qtune=auto
- -qnoenablevmx
- COPTIMIZE, FOPTIMIZE
- Disables generation of vector instructions for processors that support them.
- -qsmallstack=dynlenonheap
- FOPTIMIZE
- Causes the Fortran compiler to allocate dynamic arrays on the heap instead of the stack

-qalias=nostd
FOPTIMIZE

 qalias=ansi | noansi
   If ansi is specified, type-based aliasing is
   used during optimization, which restricts the
   lvalues that can be safely used to access a
   data object. The default is ansi for the xlc,
   xlC, and c89 commands. This option has no
   effect unless you also specify the -O option.

 qalias=std |nostd
   Indicates whether the compilation units contain
   any non-standard aliasing (see Compiler Reference
   for more information). If so, specify nostd.

- -B/usr/share/libhugetlbfs/
- FOPTIMIZE
- Determines substitute path names for XL Fortran executables such as the compiler, assembler, linker, and preprocessor. It can be used in combination with the -t option, which determines which of these components are affected by -B.

-tl
FOPTIMIZE

Applies the prefix specified by the -B option to the designated components.

Parameter	Description	Executable name
a	Assembler	as
b	Low-level optimizer	xlfcode
c	Compiler front end	xlfentry
d	Disassembler	dis
F	C preprocessor	cpp
h	Array language optimizer	xlfhot
I	High-level optimizer, compile step	ipa
l	Linker	ld
z	Binder	bolt

- -Wl,--hugetlbfs-link=BDT
- FOPTIMIZE
- Pass the --hugetlbfs-link=BDT flag to the linker so that the text, initialized data, and BSS segments of the application are backed by hugepages.

Peak Optimization Flags

C benchmarks

433.milc

- -qpdf1
- PASS1_CFLAGS, PASS1_LDFLAGS
- The option used in the first pass of a profile directed feedback compile that causes pdf information to be generated. The profile directed feedback optimization gathers data on both exectuion path and data values. It does not use hardware counters, nor gather any data other than path and data values for PDF specific optimizations.
- -qpdf2
- PASS2_CFLAGS, PASS2_LDFLAGS
- The option used in the second pass of a profile directed feedback compile that causes PDF information to be utilized during optimization.
- -O5
- OPTIMIZE
- Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option.
  
  -O5 is equivalent to the following flags
  - -O4
  - -qipa=level=2
  - -qarch=auto
  - -qtune=auto
- Includes:
  - -O4
    - -O3
      
      -O2
      
      -O
      
      -qhot=level=0
    - -qipa=level=1
    - -qarch=auto
    - -qtune=auto
  - -qipa=level=2
  - -qarch=auto
  - -qtune=auto
- -qnoenablevmx
- OPTIMIZE
- Disables generation of vector instructions for processors that support them.
- -B/usr/share/libhugetlbfs/
- OPTIMIZE
- Determines substitute path names for XL Fortran executables such as the compiler, assembler, linker, and preprocessor. It can be used in combination with the -t option, which determines which of these components are affected by -B.

-tl
OPTIMIZE

Applies the prefix specified by the -B option to the designated components.

Parameter	Description	Executable name
a	Assembler	as
b	Low-level optimizer	xlfcode
c	Compiler front end	xlfentry
d	Disassembler	dis
F	C preprocessor	cpp
h	Array language optimizer	xlfhot
I	High-level optimizer, compile step	ipa
l	Linker	ld
z	Binder	bolt

- -Wl,--hugetlbfs-link=BDT
- OPTIMIZE
- Pass the --hugetlbfs-link=BDT flag to the linker so that the text, initialized data, and BSS segments of the application are backed by hugepages.

470.lbm

- -O3
- OPTIMIZE
- Performs additional optimizations that are memory intensive, compile-time intensive, and may change the semantics of the program slightly, unless -qstrict is specified. We recommend these optimizations when the desire for run-time speed improvements outweighs the concern for limiting compile-time resources.
  
  -O3 is equivalent to the following flags
  - -O2
  - -qhot=level=0
- Includes:
  - -O2
    - -O
      
      -O2
  - -qhot=level=0
- -qarch=pwr6e
- OPTIMIZE
- Produces object code containing instructions that will run on the specified processors. "auto" selects the processor the complile is being done on. "pwr5x" is the POWER5+ processor.
  Supported values for this flag are
  - auto
  - pwr6e
  - pwr6
  - pwr5x
  - pwr5
  - pwr4
  - ppc970
- -qtune=pwr6
- OPTIMIZE
- Specifies the architecture system for which the executable program is optimized. This includes instruction scheduling and cache setting. The supported values for suboption are:

-B/usr/share/libhugetlbfs/ OPTIMIZE Determines substitute path names for XL Fortran executables such as the compiler, assembler, linker, and preprocessor. It can be used in combination with the -t option, which determines which of these components are affected by -B. -tl OPTIMIZE Applies the prefix specified by the -B option to the designated components. Parameter Description Executable name a Assembler as b Low-level optimizer xlfcode c Compiler front end xlfentry d Disassembler dis F C preprocessor cpp h Array language optimizer xlfhot I High-level optimizer, compile step ipa l Linker ld z Binder bolt -Wl,--hugetlbfs-link=BDT OPTIMIZE Pass the --hugetlbfs-link=BDT flag to the linker so that the text, initialized data, and BSS segments of the application are backed by hugepages. -q64 COPTIMIZE Generates 64 bit ABI binaries. The default is to generate 32 bit binaries.

482.sphinx3 -Wl,-q LDCFLAGS Pass the -q flag to the linker causing the final executable to have the relocation information. -qpdf1 PASS1_CFLAGS, PASS1_LDFLAGS The option used in the first pass of a profile directed feedback compile that causes pdf information to be generated. The profile directed feedback optimization gathers data on both exectuion path and data values. It does not use hardware counters, nor gather any data other than path and data values for PDF specific optimizations. -qpdf2 PASS2_CFLAGS, PASS2_LDFLAGS The option used in the second pass of a profile directed feedback compile that causes PDF information to be utilized during optimization. -O4 OPTIMIZE Perform optimizations for maximum performance. This includes interprocedural analysis on all of the objects presented on the "link" step. -O4 is equivalent to the following flags -O3 -qipa=level=1 -qarch=auto -qtune=auto Includes: -O3 -O2 -O -qhot=level=0 -qipa=level=1 -qarch=auto -qtune=auto -lhugetlbfs EXTRA_LIBS Link with libhugetlbfs.so. This enables heap to be backed by the 16 Megabyte pages. C++ benchmarks 444.namd -qpdf1 PASS1_CXXFLAGS, PASS1_LDFLAGS The option used in the first pass of a profile directed feedback compile that causes pdf information to be generated. The profile directed feedback optimization gathers data on both exectuion path and data values. It does not use hardware counters, nor gather any data other than path and data values for PDF specific optimizations. -qpdf2 PASS2_CXXFLAGS, PASS2_LDFLAGS The option used in the second pass of a profile directed feedback compile that causes PDF information to be utilized during optimization. -O3 OPTIMIZE Performs additional optimizations that are memory intensive, compile-time intensive, and may change the semantics of the program slightly, unless -qstrict is specified. We recommend these optimizations when the desire for run-time speed improvements outweighs the concern for limiting compile-time resources. -O3 is equivalent to the following flags -O2 -qhot=level=0 Includes: -O2 -O -O2 -qhot=level=0 -qarch=pwr6e OPTIMIZE Produces object code containing instructions that will run on the specified processors. "auto" selects the processor the complile is being done on. "pwr5x" is the POWER5+ processor. Supported values for this flag are auto Use the processor on which the program is compiled. pwr6e The POWER6 processor in "Enhanced" mode based systems. pwr6 The POWER6 processor based systems. pwr5x The POWER5+ processor based systems. pwr5 The POWER5 processor based systems. pwr4 The POWER4 processor based systems. ppc970 The PPC970 processor based systems. -qtune=pwr6 OPTIMIZE Specifies the architecture system for which the executable program is optimized. This includes instruction scheduling and cache setting. The supported values for suboption are: auto Use the processor on which the program is compiled. pwr6 The POWER6 processor based systems. pwr5x The POWER5+ processor based systems. pwr5 The POWER5 processor based systems. pwr4 The POWER4 processor based systems. ppc970 The PPC970 processor based systems. 447.dealII -O5 OPTIMIZE Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option. -O5 is equivalent to the following flags -O4 -qipa=level=2 -qarch=auto -qtune=auto Includes: -O4 -O3 -O2 -O -qhot=level=0 -qipa=level=1 -qarch=auto -qtune=auto -qipa=level=2 -qarch=auto -qtune=auto -qrtti OPTIMIZE Cause the C++ compiler to generate Run Time Type Identification code for exception handling and for use by the typeid and dynamic_cast operators. -qnoenablevmx OPTIMIZE Disables generation of vector instructions for processors that support them. -qstaticlink OPTIMIZE Controls how shared and non-shared runtime libraries are linked into an application. When -qstaticlink is in effect, the compiler links only static libraries with the object file named in the invocation. When -qnostaticlink is in effect, the compiler links shared libraries with the object file named in the invocation. This option provides the ability to specify linking rules that are equivalent to those implied by the GNU options -static, -static-libgcc, and -shared-libgcc, used singly and in combination. -Wl,--whole-archive /usr/lib/libsmartheap.a EXTRA_LIBS Instructs the linker to include every object file in /usr/lib/libhugetlbfs.a, rather than searching the library for the required object files. -Wl,--no-whole-archive EXTRA_LIBS Turn off the effect of the --whole-archive flag. 450.soplex -qpdf1 PASS1_CXXFLAGS, PASS1_LDFLAGS The option used in the first pass of a profile directed feedback compile that causes pdf information to be generated. The profile directed feedback optimization gathers data on both exectuion path and data values. It does not use hardware counters, nor gather any data other than path and data values for PDF specific optimizations. -qpdf2 PASS2_CXXFLAGS, PASS2_LDFLAGS The option used in the second pass of a profile directed feedback compile that causes PDF information to be utilized during optimization. -O4 OPTIMIZE Perform optimizations for maximum performance. This includes interprocedural analysis on all of the objects presented on the "link" step. -O4 is equivalent to the following flags -O3 -qipa=level=1 -qarch=auto -qtune=auto Includes: -O3 -O2 -O -qhot=level=0 -qipa=level=1 -qarch=auto -qtune=auto -qstrict OPTIMIZE Turns off aggressive optimizations which have the potential to alter the semantics of your program. -qstrict sets -qfloat=nofltint:norsqrt. -qnostrict sets -qfloat=rsqrt. This option is only valid with -O2 or higher optimization levels. Default: o -qnostrict at -O3 or higher. o -qstrict otherwise. -lhugetlbfs EXTRA_LIBS Link with libhugetlbfs.so. This enables heap to be backed by the 16 Megabyte pages. 453.povray -qpdf1 PASS1_CXXFLAGS, PASS1_LDFLAGS The option used in the first pass of a profile directed feedback compile that causes pdf information to be generated. The profile directed feedback optimization gathers data on both exectuion path and data values. It does not use hardware counters, nor gather any data other than path and data values for PDF specific optimizations. -qpdf2 PASS2_CXXFLAGS, PASS2_LDFLAGS The option used in the second pass of a profile directed feedback compile that causes PDF information to be utilized during optimization. -O5 OPTIMIZE Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option. -O5 is equivalent to the following flags -O4 -qipa=level=2 -qarch=auto -qtune=auto Includes: -O4 -O3 -O2 -O -qhot=level=0 -qipa=level=1 -qarch=auto -qtune=auto -qipa=level=2 -qarch=auto -qtune=auto -lsmartheap EXTRA_LIBS Link with MicroQuill's SmartHeap (32-bit) library for Linux on POWER. This is a library that optimizes calls to new, delete, malloc and free. Fortran benchmarks 410.bwaves -O5 OPTIMIZE Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option. -O5 is equivalent to the following flags -O4 -qipa=level=2 -qarch=auto -qtune=auto Includes: -O4 -O3 -O2 -O -qhot=level=0 -qipa=level=1 -qarch=auto -qtune=auto -qipa=level=2 -qarch=auto -qtune=auto -qsmallstack=dynlenonheap OPTIMIZE Causes the Fortran compiler to allocate dynamic arrays on the heap instead of the stack -lhugetlbfs EXTRA_LIBS Link with libhugetlbfs.so. This enables heap to be backed by the 16 Megabyte pages. 416.gamess -qpdf1 PASS1_FFLAGS, PASS1_LDFLAGS The option used in the first pass of a profile directed feedback compile that causes pdf information to be generated. The profile directed feedback optimization gathers data on both exectuion path and data values. It does not use hardware counters, nor gather any data other than path and data values for PDF specific optimizations. -qpdf2 PASS2_FFLAGS, PASS2_LDFLAGS The option used in the second pass of a profile directed feedback compile that causes PDF information to be utilized during optimization. -O5 OPTIMIZE Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option. -O5 is equivalent to the following flags -O4 -qipa=level=2 -qarch=auto -qtune=auto Includes: -O4 -O3 -O2 -O -qhot=level=0 -qipa=level=1 -qarch=auto -qtune=auto -qipa=level=2 -qarch=auto -qtune=auto -qalias=nostd OPTIMIZE qalias=ansi | noansi If ansi is specified, type-based aliasing is used during optimization, which restricts the lvalues that can be safely used to access a data object. The default is ansi for the xlc, xlC, and c89 commands. This option has no effect unless you also specify the -O option. qalias=std |nostd Indicates whether the compilation units contain any non-standard aliasing (see Compiler Reference for more information). If so, specify nostd. -qnoenablevmx OPTIMIZE Disables generation of vector instructions for processors that support them. 434.zeusmp -qpdf1 PASS1_FFLAGS, PASS1_LDFLAGS The option used in the first pass of a profile directed feedback compile that causes pdf information to be generated. The profile directed feedback optimization gathers data on both exectuion path and data values. It does not use hardware counters, nor gather any data other than path and data values for PDF specific optimizations. -qpdf2 PASS2_FFLAGS, PASS2_LDFLAGS The option used in the second pass of a profile directed feedback compile that causes PDF information to be utilized during optimization. -O3 OPTIMIZE Performs additional optimizations that are memory intensive, compile-time intensive, and may change the semantics of the program slightly, unless -qstrict is specified. We recommend these optimizations when the desire for run-time speed improvements outweighs the concern for limiting compile-time resources. -O3 is equivalent to the following flags -O2 -qhot=level=0 Includes: -O2 -O -O2 -qhot=level=0 -qarch=pwr6e OPTIMIZE Produces object code containing instructions that will run on the specified processors. "auto" selects the processor the complile is being done on. "pwr5x" is the POWER5+ processor. Supported values for this flag are auto Use the processor on which the program is compiled. pwr6e The POWER6 processor in "Enhanced" mode based systems. pwr6 The POWER6 processor based systems. pwr5x The POWER5+ processor based systems. pwr5 The POWER5 processor based systems. pwr4 The POWER4 processor based systems. ppc970 The PPC970 processor based systems. -qtune=pwr6 OPTIMIZE Specifies the architecture system for which the executable program is optimized. This includes instruction scheduling and cache setting. The supported values for suboption are: auto Use the processor on which the program is compiled. pwr6 The POWER6 processor based systems. pwr5x The POWER5+ processor based systems. pwr5 The POWER5 processor based systems. pwr4 The POWER4 processor based systems. ppc970 The PPC970 processor based systems. -qxlf90=nosignedzero OPTIMIZE -qxlf90= Determines whether the compiler provides the Fortran 90 or the Fortran 95 level of support for certain aspects of the language. can be one of the following: signedzero | nosignedzero Determines how the SIGN(A,B) function handles signed real 0.0. In addition, determines whether negative internal values will be prefixed with a minus when formatted output would produce a negative sign zero. autodealloc | noautodealloc Determines whether the compiler deallocates allocatable arrays that are declared locally without either the SAVE or the STATIC attribute and have a status of currently allocated when the subprogram terminates. oldpad | nooldpad When the PAD=specifier is present in the INQUIRE statement, specifying -qxlf90=nooldpad returns UNDEFINED when there is no connection, or when the connection is for unformatted I/O. This behavior conforms with the Fortran 95 standard and above. Specifying -qxlf90=oldpad preserves the Fortran 90 behavior. Default: o signedzero, autodealloc and nooldpad for the xlf95, xlf95_r, xlf95_r7 and f95 invocation commands. o nosignedzero, noautodealloc and oldpad for all other invocation commands. -B/usr/share/libhugetlbfs/ OPTIMIZE Determines substitute path names for XL Fortran executables such as the compiler, assembler, linker, and preprocessor. It can be used in combination with the -t option, which determines which of these components are affected by -B. -tl OPTIMIZE Applies the prefix specified by the -B option to the designated components. Parameter Description Executable name a Assembler as b Low-level optimizer xlfcode c Compiler front end xlfentry d Disassembler dis F C preprocessor cpp h Array language optimizer xlfhot I High-level optimizer, compile step ipa l Linker ld z Binder bolt -Wl,--hugetlbfs-link=BDT OPTIMIZE Pass the --hugetlbfs-link=BDT flag to the linker so that the text, initialized data, and BSS segments of the application are backed by hugepages. 437.leslie3d -O3 OPTIMIZE Performs additional optimizations that are memory intensive, compile-time intensive, and may change the semantics of the program slightly, unless -qstrict is specified. We recommend these optimizations when the desire for run-time speed improvements outweighs the concern for limiting compile-time resources. -O3 is equivalent to the following flags -O2 -qhot=level=0 Includes: -O2 -O -O2 -qhot=level=0 -qarch=pwr6e OPTIMIZE Produces object code containing instructions that will run on the specified processors. "auto" selects the processor the complile is being done on. "pwr5x" is the POWER5+ processor. Supported values for this flag are auto Use the processor on which the program is compiled. pwr6e The POWER6 processor in "Enhanced" mode based systems. pwr6 The POWER6 processor based systems. pwr5x The POWER5+ processor based systems. pwr5 The POWER5 processor based systems. pwr4 The POWER4 processor based systems. ppc970 The PPC970 processor based systems. -qtune=pwr6 OPTIMIZE Specifies the architecture system for which the executable program is optimized. This includes instruction scheduling and cache setting. The supported values for suboption are: auto Use the processor on which the program is compiled. pwr6 The POWER6 processor based systems. pwr5x The POWER5+ processor based systems. pwr5 The POWER5 processor based systems. pwr4 The POWER4 processor based systems. ppc970 The PPC970 processor based systems. -B/usr/share/libhugetlbfs/ OPTIMIZE Determines substitute path names for XL Fortran executables such as the compiler, assembler, linker, and preprocessor. It can be used in combination with the -t option, which determines which of these components are affected by -B. -tl OPTIMIZE Applies the prefix specified by the -B option to the designated components. Parameter Description Executable name a Assembler as b Low-level optimizer xlfcode c Compiler front end xlfentry d Disassembler dis F C preprocessor cpp h Array language optimizer xlfhot I High-level optimizer, compile step ipa l Linker ld z Binder bolt -Wl,--hugetlbfs-link=BDT OPTIMIZE Pass the --hugetlbfs-link=BDT flag to the linker so that the text, initialized data, and BSS segments of the application are backed by hugepages. -q64 FOPTIMIZE Generates 64 bit ABI binaries. The default is to generate 32 bit binaries. 459.GemsFDTD -qpdf1 PASS1_FFLAGS, PASS1_LDFLAGS The option used in the first pass of a profile directed feedback compile that causes pdf information to be generated. The profile directed feedback optimization gathers data on both exectuion path and data values. It does not use hardware counters, nor gather any data other than path and data values for PDF specific optimizations. -qpdf2 PASS2_FFLAGS, PASS2_LDFLAGS The option used in the second pass of a profile directed feedback compile that causes PDF information to be utilized during optimization. -O5 OPTIMIZE Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option. -O5 is equivalent to the following flags -O4 -qipa=level=2 -qarch=auto -qtune=auto Includes: -O4 -O3 -O2 -O -qhot=level=0 -qipa=level=1 -qarch=auto -qtune=auto -qipa=level=2 -qarch=auto -qtune=auto -B/usr/share/libhugetlbfs/ OPTIMIZE Determines substitute path names for XL Fortran executables such as the compiler, assembler, linker, and preprocessor. It can be used in combination with the -t option, which determines which of these components are affected by -B. -tl OPTIMIZE Applies the prefix specified by the -B option to the designated components. Parameter Description Executable name a Assembler as b Low-level optimizer xlfcode c Compiler front end xlfentry d Disassembler dis F C preprocessor cpp h Array language optimizer xlfhot I High-level optimizer, compile step ipa l Linker ld z Binder bolt -Wl,--hugetlbfs-link=BDT OPTIMIZE Pass the --hugetlbfs-link=BDT flag to the linker so that the text, initialized data, and BSS segments of the application are backed by hugepages. -q64 FOPTIMIZE Generates 64 bit ABI binaries. The default is to generate 32 bit binaries. 465.tonto -qpdf1 PASS1_FFLAGS, PASS1_LDFLAGS The option used in the first pass of a profile directed feedback compile that causes pdf information to be generated. The profile directed feedback optimization gathers data on both exectuion path and data values. It does not use hardware counters, nor gather any data other than path and data values for PDF specific optimizations. -qpdf2 PASS2_FFLAGS, PASS2_LDFLAGS The option used in the second pass of a profile directed feedback compile that causes PDF information to be utilized during optimization. -O5 OPTIMIZE Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option. -O5 is equivalent to the following flags -O4 -qipa=level=2 -qarch=auto -qtune=auto Includes: -O4 -O3 -O2 -O -qhot=level=0 -qipa=level=1 -qarch=auto -qtune=auto -qipa=level=2 -qarch=auto -qtune=auto -lessl EXTRA_LIBS Link the Engineering and Scientifc Subroutine Library (ESSL), libessl.so. ESSL is a collection of subroutines providing a wide range of performance-tuned mathematical functions for many common scientific and engineering applications. The mathematical subroutines are divided into nine computational areas: Linear Algebra Subprograms Matrix Operations Linear Algebraic Equations Eigensystem Analysis Fourier Transforms, Convolutions, Correlations and Related Computations Sorting and Searching Interpolation Numerical Quadrature Random Number Generation -lsmartheap EXTRA_LIBS Link with MicroQuill's SmartHeap (32-bit) library for Linux on POWER. This is a library that optimizes calls to new, delete, malloc and free. -lxlf90_r EXTRA_LIBS Link the Fortran runtime library libxlf90_r.so which is required by libessl.so. Benchmarks using both Fortran and C 435.gromacs -Wl,-q LDFFLAGS Pass the -q flag to the linker causing the final executable to have the relocation information. -O2 OPTIMIZE Performs a set of optimizations that are intended to offer improved performance without an unreasonable increase in time or storage that is required for compilation. Includes: -O -O2 -O -qarch=pwr6e OPTIMIZE Produces object code containing instructions that will run on the specified processors. "auto" selects the processor the complile is being done on. "pwr5x" is the POWER5+ processor. Supported values for this flag are auto Use the processor on which the program is compiled. pwr6e The POWER6 processor in "Enhanced" mode based systems. pwr6 The POWER6 processor based systems. pwr5x The POWER5+ processor based systems. pwr5 The POWER5 processor based systems. pwr4 The POWER4 processor based systems. ppc970 The PPC970 processor based systems. -qtune=pwr6 OPTIMIZE Specifies the architecture system for which the executable program is optimized. This includes instruction scheduling and cache setting. The supported values for suboption are: auto Use the processor on which the program is compiled. pwr6 The POWER6 processor based systems. pwr5x The POWER5+ processor based systems. pwr5 The POWER5 processor based systems. pwr4 The POWER4 processor based systems. ppc970 The PPC970 processor based systems. -lhugetlbfs EXTRA_LIBS Link with libhugetlbfs.so. This enables heap to be backed by the 16 Megabyte pages. 436.cactusADM -Wl,-q LDFFLAGS Pass the -q flag to the linker causing the final executable to have the relocation information. -qpdf1 PASS1_CFLAGS, PASS1_FFLAGS, PASS1_LDFLAGS The option used in the first pass of a profile directed feedback compile that causes pdf information to be generated. The profile directed feedback optimization gathers data on both exectuion path and data values. It does not use hardware counters, nor gather any data other than path and data values for PDF specific optimizations. -qpdf2 PASS2_CFLAGS, PASS2_FFLAGS, PASS2_LDFLAGS The option used in the second pass of a profile directed feedback compile that causes PDF information to be utilized during optimization. -O2 OPTIMIZE Performs a set of optimizations that are intended to offer improved performance without an unreasonable increase in time or storage that is required for compilation. Includes: -O -O2 -O -qarch=pwr6e OPTIMIZE Produces object code containing instructions that will run on the specified processors. "auto" selects the processor the complile is being done on. "pwr5x" is the POWER5+ processor. Supported values for this flag are auto Use the processor on which the program is compiled. pwr6e The POWER6 processor in "Enhanced" mode based systems. pwr6 The POWER6 processor based systems. pwr5x The POWER5+ processor based systems. pwr5 The POWER5 processor based systems. pwr4 The POWER4 processor based systems. ppc970 The PPC970 processor based systems. -qtune=pwr6 OPTIMIZE Specifies the architecture system for which the executable program is optimized. This includes instruction scheduling and cache setting. The supported values for suboption are: auto Use the processor on which the program is compiled. pwr6 The POWER6 processor based systems. pwr5x The POWER5+ processor based systems. pwr5 The POWER5 processor based systems. pwr4 The POWER4 processor based systems. ppc970 The PPC970 processor based systems. -lhugetlbfs EXTRA_LIBS Link with libhugetlbfs.so. This enables heap to be backed by the 16 Megabyte pages. 454.calculix -qpdf1 PASS1_CFLAGS, PASS1_FFLAGS, PASS1_LDFLAGS The option used in the first pass of a profile directed feedback compile that causes pdf information to be generated. The profile directed feedback optimization gathers data on both exectuion path and data values. It does not use hardware counters, nor gather any data other than path and data values for PDF specific optimizations. -qpdf2 PASS2_CFLAGS, PASS2_FFLAGS, PASS2_LDFLAGS The option used in the second pass of a profile directed feedback compile that causes PDF information to be utilized during optimization. -O4 OPTIMIZE Perform optimizations for maximum performance. This includes interprocedural analysis on all of the objects presented on the "link" step. -O4 is equivalent to the following flags -O3 -qipa=level=1 -qarch=auto -qtune=auto Includes: -O3 -O2 -O -qhot=level=0 -qipa=level=1 -qarch=auto -qtune=auto -B/usr/share/libhugetlbfs/ OPTIMIZE Determines substitute path names for XL Fortran executables such as the compiler, assembler, linker, and preprocessor. It can be used in combination with the -t option, which determines which of these components are affected by -B. -tl OPTIMIZE Applies the prefix specified by the -B option to the designated components. Parameter Description Executable name a Assembler as b Low-level optimizer xlfcode c Compiler front end xlfentry d Disassembler dis F C preprocessor cpp h Array language optimizer xlfhot I High-level optimizer, compile step ipa l Linker ld z Binder bolt -Wl,--hugetlbfs-link=BDT OPTIMIZE Pass the --hugetlbfs-link=BDT flag to the linker so that the text, initialized data, and BSS segments of the application are backed by hugepages. 481.wrf -O5 OPTIMIZE Perform optimizations for maximum performance. This includes maximum interprocedural analysis on all of the objects presented on the "link" step. This level of optimization will increase the compiler's memory usage and compile time requirements. -O5 Provides all of the functionality of the -O4 option, but also provides the functionality of the -qipa=level=2 option. -O5 is equivalent to the following flags -O4 -qipa=level=2 -qarch=auto -qtune=auto Includes: -O4 -O3 -O2 -O -qhot=level=0 -qipa=level=1 -qarch=auto -qtune=auto -qipa=level=2 -qarch=auto -qtune=auto -qnoenablevmx OPTIMIZE Disables generation of vector instructions for processors that support them. -qalias=nostd FOPTIMIZE qalias=ansi | noansi If ansi is specified, type-based aliasing is used during optimization, which restricts the lvalues that can be safely used to access a data object. The default is ansi for the xlc, xlC, and c89 commands. This option has no effect unless you also specify the -O option. qalias=std |nostd Indicates whether the compilation units contain any non-standard aliasing (see Compiler Reference for more information). If so, specify nostd. -lhugetlbfs EXTRA_LIBS Link with libhugetlbfs.so. This enables heap to be backed by the 16 Megabyte pages.

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.

CPU2006 Flag DescriptionIBM Corporation IBM System p 570 (4.7 GHz, 1 core, RedHat)

Base Compiler Invocation

Peak Compiler Invocation

Base Portability Flags

Peak Portability Flags

Base Optimization Flags

Peak Optimization Flags

Base Other Flags

Peak Other Flags

Implicitly Included Flags

CPU2006 Flag Description
IBM Corporation IBM System p 570 (4.7 GHz, 1 core, RedHat)