CPU2017 Flag Description
Compal Electronics, Inc. SR230-2 (Intel Xeon 6787P)

Test sponsored by Compal Inc.

Copyright © 2016 Intel Corporation. All Rights Reserved.


Base Compiler Invocation

C benchmarks

C++ benchmarks

Fortran benchmarks

Benchmarks using both Fortran and C

Benchmarks using both C and C++

Benchmarks using Fortran, C, and C++


Peak Compiler Invocation

C benchmarks

C++ benchmarks

Fortran benchmarks

Benchmarks using both Fortran and C

Benchmarks using both C and C++

Benchmarks using Fortran, C, and C++


Base Portability Flags

503.bwaves_r

507.cactuBSSN_r

508.namd_r

510.parest_r

511.povray_r

519.lbm_r

521.wrf_r

526.blender_r

527.cam4_r

538.imagick_r

544.nab_r

549.fotonik3d_r

554.roms_r


Peak Portability Flags

503.bwaves_r

507.cactuBSSN_r

508.namd_r

510.parest_r

511.povray_r

519.lbm_r

521.wrf_r

526.blender_r

527.cam4_r

538.imagick_r

544.nab_r

549.fotonik3d_r

554.roms_r


Base Optimization Flags

C benchmarks

C++ benchmarks

Fortran benchmarks

Benchmarks using both Fortran and C

Benchmarks using both C and C++

Benchmarks using Fortran, C, and C++


Peak Optimization Flags

C benchmarks

519.lbm_r

538.imagick_r

544.nab_r

C++ benchmarks

508.namd_r

510.parest_r

Fortran benchmarks

503.bwaves_r

549.fotonik3d_r

554.roms_r

Benchmarks using both Fortran and C

Benchmarks using both C and C++

511.povray_r

526.blender_r

Benchmarks using Fortran, C, and C++


Implicitly Included Flags

This section contains descriptions of flags that were included implicitly by other flags, but which do not have a permanent home at SPEC.


Commands and Options Used to Submit Benchmark Runs

submit= MYMASK=`printf '0x%x' $((1<<$SPECCOPYNUM))`; /usr/bin/taskset $MYMASK $command
When running multiple copies of benchmarks, the SPEC config file feature submit is used to cause individual jobs to be bound to specific processors. This specific submit command, using taskset, is used for Linux64 systems without numactl.
Here is a brief guide to understanding the specific command which will be found in the config file:
submit= numactl --localalloc --physcpubind=$SPECCOPYNUM $command
When running multiple copies of benchmarks, the SPEC config file feature submit is used to cause individual jobs to be bound to specific processors. This specific submit command is used for Linux64 systems with support for numactl.
Here is a brief guide to understanding the specific command which will be found in the config file:

Shell, Environment, and Other Software Settings

numactl --interleave=all "runspec command"
Launching a process with numactl --interleave=all sets the memory interleave policy so that memory will be allocated using round robin on nodes. When memory cannot be allocated on the current interleave target fall back to other nodes.
KMP_STACKSIZE
Specify stack size to be allocated for each thread.
KMP_AFFINITY
Syntax: KMP_AFFINITY=[<modifier>,...]<type>[,<permute>][,<offset>]
The value for the environment variable KMP_AFFINITY affects how the threads from an auto-parallelized program are scheduled across processors.
It applies to binaries built with -qopenmp and -parallel (Linux and Mac OS X) or /Qopenmp and /Qparallel (Windows).
modifier:
    granularity=fine Causes each OpenMP thread to be bound to a single thread context.
type:
    compact Specifying compact assigns the OpenMP thread <n>+1 to a free thread context as close as possible to the thread context where the <n> OpenMP thread was placed.
    scatter Specifying scatter distributes the threads as evenly as possible across the entire system.
permute: The permute specifier is an integer value controls which levels are most significant when sorting the machine topology map. A value for permute forces the mappings to make the specified number of most significant levels of the sort the least significant, and it inverts the order of significance.
offset: The offset specifier indicates the starting position for thread assignment.

Please see the Thread Affinity Interface article in the Intel Composer XE Documentation for more details.

Example: KMP_AFFINITY=granularity=fine,scatter
Specifying granularity=fine selects the finest granularity level and causes each OpenMP or auto-par thread to be bound to a single thread context.
This ensures that there is only one thread per core on cores supporting HyperThreading Technology
Specifying scatter distributes the threads as evenly as possible across the entire system.
Hence a combination of these two options, will spread the threads evenly across sockets, with one thread per physical core.

Example: KMP_AFFINITY=compact,1,0
Specifying compact will assign the n+1 thread to a free thread context as close as possible to thread n.
A default granularity=core is implied if no granularity is explicitly specified.
Specifying 1,0 sets permute and offset values of the thread assignment.
With a permute value of 1, thread n+1 is assigned to a consecutive core. With an offset of 0, the process's first thread 0 will be assigned to thread 0.
The same behavior is exhibited in a multisocket system.
OMP_NUM_THREADS
Sets the maximum number of threads to use for OpenMP* parallel regions if no other value is specified in the application. This environment variable applies to both -qopenmp and -parallel (Linux and Mac OS X) or /Qopenmp and /Qparallel (Windows). Example syntax on a Linux system with 8 cores: export OMP_NUM_THREADS=8
OMP_STACKSIZE
The OMP_STACKSIZE environment variable controls the size of the stack for threads created by the OpenMP implementation
Set stack size to unlimited
The command "ulimit -s unlimited" is used to set the stack size limit to unlimited.
Free the file system page cache
The command "echo 3> /proc/sys/vm/drop_caches" is used to free up the filesystem page cache as well as reclaimable slab objects like dentries and inodes.
MALLOC_CONF
Used for Jemalloc tuning at runtime. MALLOC_CONF=retain:true will retain unused virtual memory for later resue rather than discarding it.

Red Hat Specific features

Transparent Huge Pages
On RedHat EL 6 and later, Transparent Hugepages increase the memory page size from 4 kilobytes to 2 megabytes. Transparent Hugepages provide significant performance advantages on systems with highly contended resources and large memory workloads. If memory utilization is too high or memory is badly fragmented which prevents hugepages being allocated, the kernel will assign smaller 4k pages instead.
Hugepages are used by default unless the /sys/kernel/mm/redhat_transparent_hugepage/enabled field is changed from its RedHat EL6 default of 'always'.

Operating System Tuning Parameters

OS Tuning

ulimit:

is a command used to set or check user limits on system resources such as memory, CPU, and the number of open files.Below are common usages of ulimit:

irqbalance:

irqbalance is a Linux background service that distributes hardware interrupts across multiple CPU cores to prevent overloading a single core and improve system performance.

Performance Governors (Linux):

are one of Linux's CPU frequency scaling mechanisms, used to determine how the CPU frequency should be managed.Simply put, it controls "how fast the CPU should run under different conditions."Common CPU governors include::

--governor,-g :

When set to performance, the CPU will always operate at its maximum frequency to deliver the highest computing performance.This will improve overall system performance.

Many companies execute the following command when conducting system performance testing to ensure that the CPU operates at its maximum frequency.

tuned-adm:

is a command-line tool used to manage performance tuning settings on Linux systems. It allows users to select predefined tuning profiles that automatically adjust CPU, power saving, I/O, and network parameters according to the system’s intended usage, optimizing either performance or energy efficiency.The following four are the most commonly used parameters:


Firmware / BIOS / Microcode Settings

Enable LP [Global] (Default = ALL LPs):

Enable LP [Global] to represent the number of logical processors (LP). LP is a term that is common for cloud instances to represent a vCPU.Values for this BIOS option can be:

LLC dead line alloc (Default = Enable):

It is a BIOS setting that controls whether a CPU cache line, when it is about to be written back to memory (i.e., a dead-line), is still allowed to be allocated into the LLC (Last Level Cache, typically L3 Cache).Values for this BIOS option can be:

Energy Efficient Turbo (Default = Enable):

is a technology that allows the processor to automatically boost its frequency based on workload while maintaining energy efficiency.Values for this BIOS option can be:

Enhanced Halt State (C1E) (Default = Enable):

is a mechanism that allows the processor to automatically reduce voltage and frequency during idle periods to enter a low-power state, while still enabling a quick return to full operation to balance power saving and performance.Values for this BIOS option can be:

LLC Prefetch (Default = Disable):

The LLC prefetcher is a feature that allows the processor, under a non-inclusive cache architecture, to prefetch data directly into the Last Level Cache (LLC) to improve memory access efficiency, though in some cases, disabling it may actually enhance performance.Values for this BIOS option can be:

AMP Prefetch (Default = Enable):

The AMP prefetcher is a mechanism that predicts future memory access patterns based on delta sequences between cache accesses and prefetches data into the mid-level cache (MLC), enhancing data retrieval efficiency without issuing LLC prefetches.Values for this BIOS option can be:

Patrol Scrub (Default = Enable at End of POST):

is a background memory scanning and correction mechanism. It periodically and proactively scans each bit of system memory to detect correctable ECC errors, and attempts to automatically repair them when such errors are found.Values for this BIOS option can be:

Homeless Prefetch (Default = Auto):

is a mechanism that allows data to be prefetched into the L2 cache when cache resources are limited to reduce latency, but it may impose a burden on uncore resources under certain workloads.Values for this BIOS option can be:

DCU Streamer Prefetcher (Default = Enable):

is an L1 data cache prefetcher. Recommended default setting is Enabled. In some cases, setting this option to disabled can improve performance. Values for this BIOS option can be:

Package C State (Default = Auto):

The system automatically manages C-state transitions based on workload and platform policies.Values for this BIOS option can be:

ENERGY_PERF_BIAS_CFG mode (Default = Balanced Performance):

It is a hint provided by the processor to the BIOS or operating system to guide the trade-off between power efficiency and performance.Values for this BIOS option can be:

Virtual Numa (Default = Disable):

Divide physical NUMA nodes into evenly sized virtual NUMA nodes in ACPI table. This may improve Windows performance on CPUs with more than 64 logical processors.Values for this BIOS option can be:

Hardware P-States (Default = Native Mode):

(HWP) are the key components of HWPM which deals with P-state control. HWP works by freeing the operating system from making direct frequency decisions.Values for this BIOS option can be:

Hardware Prefetcher(Default=Enable):

Hardware Prefetcher: (a.k.a. MLC Streamer Prefetcher) is an L2 cache prefetcher. Recommended default setting is Enabled. In some cases, setting this option to disabled can improve performance. Values for this BIOS option can be:

Adjacent Cache Prefetch(Default=Enable):

Adjacent Cache Prefetch: (a.k.a. MLC Spatial Prefetcher) is an L2 cache prefetcher. Recommended default setting is Enabled. In some cases, setting this option to disabled can improve performance. Values for this BIOS option can be:

DCU IP Prefetcher(Default=Enable):

DCU IP Prefetcher: the DCU Instruction Pointer (IP) prefetcher is an L1 cache prefetcher. Recommended default setting is Enabled. In some cases, setting this option to disabled can improve performance. Values for this BIOS option can be:

Turbo Mode(Default=Enable):

Turbo Boost Technology requires a PC with a processor with Intel Turbo Boost Technology capability. Intel Turbo Boost Technology performance varies depending on hardware, software, and overall system configuration.Values for this BIOS option can be:

Power Performance Tuning(Default=OS Controls EPB):

Power Performance Tuning is a BIOS setting that determines whether the processor’s performance and power efficiency preferences are managed by the operating system, BIOS, or an external controller.Values for this BIOS option can be:

Performance Mode(Default=Balanced):

It is a common setting in server BIOS used to control the system’s overall performance optimization strategy and power behavior.Values for this BIOS option can be:

Latency Optimized Mode(Default=Disable):

The highest performance mode (default on previous generations) where core and uncore frequencies are running up to their maximum limits within the RAPL budget. This mode is not performance-per-Watt optimized across the load line. Values for this BIOS option can be:

NUMA(Default=Enable):

Each CPU (or each group of CPUs) has its own "local memory," which can be accessed faster. Accessing "remote memory" across CPUs is slower. When NUMA is enabled, the operating system can arrange thread execution and memory allocation based on the "physical locality" of CPUs and memory, reducing latency and improving performance. Values for this BIOS option can be:

Opportunistic LLC to SF Migration(Default=Disable):

When data is present in the LLC (Last Level Cache, such as L3 Cache), the system may opportunistically migrate it to the Snoop Filter (SF) to optimize and accelerate inter-core data sharing.Values for this BIOS option can be:

Last updated May 7, 2025.


Flag description origin markings:

[user] Indicates that the flag description came from the user flags file.
[suite] Indicates that the flag description came from the suite-wide flags file.
[benchmark] Indicates that the flag description came from a per-benchmark flags file.

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2017/flags/Compal-Platform-Flags-Linux-Intel_V1.0.html,
http://www.spec.org/cpu2017/flags/Intel-ic2024-official-linux64.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2017/flags/Compal-Platform-Flags-Linux-Intel_V1.0.xml,
http://www.spec.org/cpu2017/flags/Intel-ic2024-official-linux64.xml.


For questions about the meanings of these flags, please contact the tester.
For other inquiries, please contact info@spec.org
Copyright 2017-2025 Standard Performance Evaluation Corporation
Tested with SPEC CPU2017 v1.1.9.
Report generated on 2025-06-02 14:36:45 by SPEC CPU2017 flags formatter v5178.