hpc2021 Result Flag Description

Test sponsored by Lenovo Global Technology

Base Compiler Invocation

C benchmarks

- mpicc
- CC, LD
- The OpenMPI C driver configured for use with the NVIDIA HPC C compiler (nvc).

C++ benchmarks

- mpicxx
- CXX, LD
- The OpenMPI C++ driver configured for use with the NVIDIA HPC C++ compiler (nvc++).

Fortran benchmarks

- mpif90
- FC, LD
- The OpenMPI Fortran driver configured for use with the NVIDIA HPC Fortran compiler (nvfortran).

Base Portability Flags

521.miniswp_t

- -DUSE_KBA
- BENCH_CFLAGS
- Uses KBA to configure benchmark. FIXME.
- -DUSE_ACCELDIR
- BENCH_CFLAGS
- Uses ACCELDIR to configure benchmark. FIXME.

532.sph_exa_t

- -DSPEC_USE_LT_IN_KERNELS
- BENCH_CXXFLAGS
- Uses lookup tables for critical functions instead of computing them in-place using math functions.
- --c++17
- mpicxx
- CXXPORTABILITY
- Use C++ 17 language features.

Base Optimization Flags

C benchmarks

- -Mfprelaxed
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- Includes:
- -Mnouniform
- mpicc, mpicxx,mpif90
- OPTIMIZE
- The numerical method used when computing the residual iterations of a vectorized (SIMD) loop may be different than used in the vectorized loop. Using this option may lead for fast but less numerically consistent results.
- -Mstack_arrays
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Place automatic arrays on the stack.
- -fast
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Chooses generally optimal flags for the target platform.
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Mlre
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
- -acc=gpu
- mpicc,mpicxx,mpif90
- OPTIMIZE
- Enable OpenACC directives targeting NVIDIA GPUs
- -DSPEC_ACCEL_AWARE_MPI
- OPTIMIZE
- Definition of this macro indicates that the MPI implementation supports accelerator device-to-device transfers. Used in conjuction when using OpenACC or OpenMP w/ target offload.

C++ benchmarks

- -Mfprelaxed
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- Includes:
- -Mnouniform
- mpicc, mpicxx,mpif90
- OPTIMIZE
- The numerical method used when computing the residual iterations of a vectorized (SIMD) loop may be different than used in the vectorized loop. Using this option may lead for fast but less numerically consistent results.
- -Mstack_arrays
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Place automatic arrays on the stack.
- -fast
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Chooses generally optimal flags for the target platform.
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Mlre
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
- -acc=gpu
- mpicc,mpicxx,mpif90
- OPTIMIZE
- Enable OpenACC directives targeting NVIDIA GPUs
- -DSPEC_ACCEL_AWARE_MPI
- OPTIMIZE
- Definition of this macro indicates that the MPI implementation supports accelerator device-to-device transfers. Used in conjuction when using OpenACC or OpenMP w/ target offload.

Fortran benchmarks

- -DSPEC_ACCEL_AWARE_MPI
- OPTIMIZE
- Definition of this macro indicates that the MPI implementation supports accelerator device-to-device transfers. Used in conjuction when using OpenACC or OpenMP w/ target offload.
- -Mfprelaxed
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Instructs the compiler to use relaxed precision in the calculation of some intrinsic functions. Can result in improved performance at the expense of numerical accuracy.
- Includes:
- -Mnouniform
- mpicc, mpicxx,mpif90
- OPTIMIZE
- The numerical method used when computing the residual iterations of a vectorized (SIMD) loop may be different than used in the vectorized loop. Using this option may lead for fast but less numerically consistent results.
- -Mstack_arrays
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Place automatic arrays on the stack.
- -fast
- mpicc, mpicxx,mpif90
- OPTIMIZE
- Chooses generally optimal flags for the target platform.
- Includes:
  - -O2
    - -O1
  - -Munroll=c:1
    - -Munroll
  - -Mautoinline
  - -Mlre
  - -Mvect=sse
    - -Mvect
      
      -Mvect=assoc
      
      -Mvect=altcode
  - -Mcache_align
  - -Mflushz
- -acc=gpu
- mpicc,mpicxx,mpif90
- OPTIMIZE
- Enable OpenACC directives targeting NVIDIA GPUs

Base Other Flags

C benchmarks

- -w
- mpicc, mpicxx, mpif90
- OPTIMIZE
- Disable warning messages.

C++ benchmarks

- -w
- mpicc, mpicxx, mpif90
- OPTIMIZE
- Disable warning messages.

Fortran benchmarks

- -w
- mpicc, mpicxx, mpif90
- OPTIMIZE
- Disable warning messages.

Implicitly Included Flags

This section contains descriptions of flags that were included implicitly by other flags, but which do not have a permanent home at SPEC.

For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact info@spec.org
Copyright 2021-2023 Standard Performance Evaluation Corporation
Tested with SPEC hpc2021 v1.0.1.
Report generated on 2023-08-25 18:57:04 by SPEC hpc2021 flags formatter v1.0.3 .

hpc2021 Flag Description

Test sponsored by Lenovo Global Technology

Compilers: PGI Accelerator Fortran/C/C++ Server

Operating systems: Linux

Base Compiler Invocation

C benchmarks

C++ benchmarks

Fortran benchmarks

Base Portability Flags

521.miniswp_t

532.sph_exa_t

Base Optimization Flags

C benchmarks

C++ benchmarks

Fortran benchmarks

Base Other Flags

C benchmarks

C++ benchmarks

Fortran benchmarks

Implicitly Included Flags

	Indicates that the flag description came from the user flags file.
	Indicates that the flag description came from the suite-wide flags file.
	Indicates that the flag description came from a per-benchmark flags file.