SPEC CPU(R)2017 Floating Point Rate Result Inspur Corporation Inspur OB3160A5 (AMD EPYC 7513) CPU2017 License: 3358 Test date: Aug-2021 Test sponsor: Inspur Corporation Hardware availability: Jun-2021 Tested by: Inspur Corporation Software availability: Mar-2021 Base Base Base Peak Peak Peak Benchmarks Copies Run Time Rate Copies Run Time Rate --------------- ------- --------- --------- ------- --------- --------- 503.bwaves_r 64 1973 325 * 16 448 358 S 503.bwaves_r 64 1970 326 S 16 448 358 * 503.bwaves_r 64 1976 325 S 16 448 358 S 507.cactuBSSN_r 64 316 256 S 64 301 269 S 507.cactuBSSN_r 64 315 258 * 64 304 266 * 507.cactuBSSN_r 64 310 261 S 64 307 264 S 508.namd_r 64 379 160 S 64 369 165 S 508.namd_r 64 379 161 * 64 372 163 * 508.namd_r 64 374 163 S 64 374 163 S 510.parest_r 64 1265 132 * 64 1272 132 * 510.parest_r 64 1272 132 S 64 1285 130 S 510.parest_r 64 1264 132 S 64 1262 133 S 511.povray_r 64 624 240 S 64 614 244 S 511.povray_r 64 620 241 * 64 607 246 S 511.povray_r 64 607 246 S 64 613 244 * 519.lbm_r 64 835 80.8 S 32 411 82.0 S 519.lbm_r 64 837 80.6 S 32 412 81.9 * 519.lbm_r 64 836 80.7 * 32 414 81.5 S 521.wrf_r 64 869 165 S 32 323 222 * 521.wrf_r 64 869 165 * 32 326 220 S 521.wrf_r 64 861 167 S 32 318 225 S 526.blender_r 64 404 241 * 64 399 244 S 526.blender_r 64 405 241 S 64 400 244 * 526.blender_r 64 396 246 S 64 402 242 S 527.cam4_r 64 506 221 S 64 491 228 S 527.cam4_r 64 507 221 * 64 497 225 S 527.cam4_r 64 508 220 S 64 493 227 * 538.imagick_r 64 180 884 * 64 182 876 S 538.imagick_r 64 181 880 S 64 180 883 * 538.imagick_r 64 177 900 S 64 179 889 S 544.nab_r 64 332 325 * 64 326 331 S 544.nab_r 64 332 325 S 64 326 331 * 544.nab_r 64 350 308 S 64 326 330 S 549.fotonik3d_r 64 2238 111 S 32 1095 114 S 549.fotonik3d_r 64 2237 111 * 32 1095 114 S 549.fotonik3d_r 64 2236 112 S 32 1095 114 * 554.roms_r 64 1347 75.5 S 32 548 92.8 S 554.roms_r 64 1326 76.7 * 32 554 91.8 * 554.roms_r 64 1318 77.2 S 32 558 91.1 S ================================================================================= 503.bwaves_r 64 1973 325 * 16 448 358 * 507.cactuBSSN_r 64 315 258 * 64 304 266 * 508.namd_r 64 379 161 * 64 372 163 * 510.parest_r 64 1265 132 * 64 1272 132 * 511.povray_r 64 620 241 * 64 613 244 * 519.lbm_r 64 836 80.7 * 32 412 81.9 * 521.wrf_r 64 869 165 * 32 323 222 * 526.blender_r 64 404 241 * 64 400 244 * 527.cam4_r 64 507 221 * 64 493 227 * 538.imagick_r 64 180 884 * 64 180 883 * 544.nab_r 64 332 325 * 64 326 331 * 549.fotonik3d_r 64 2237 111 * 32 1095 114 * 554.roms_r 64 1326 76.7 * 32 554 91.8 * SPECrate(R)2017_fp_base 199 SPECrate(R)2017_fp_peak 210 HARDWARE -------- CPU Name: AMD EPYC 7513 Max MHz: 3650 Nominal: 2600 Enabled: 32 cores, 1 chip, 2 threads/core Orderable: 1 chip Cache L1: 32 KB I + 32 KB D on chip per core L2: 512 KB I+D on chip per core L3: 128 MB I+D on chip per chip, 32 MB shared / 8 cores Other: None Memory: 256 GB (8 x 32 GB 2Rx4 PC4-3200AA-R) Storage: 1 x 480 GB SATA SSD Other: None SOFTWARE -------- OS: Red Hat Enterprise Linux release 8.2 (Ootpa) 4.18.0-193.el8.x86_64 Compiler: C/C++/Fortran: Version 3.0.0 of AOCC Parallel: No Firmware: Version 3.05.00 released Aug-2021 File System: xfs System State: Run level 3 (multi-user) Base Pointers: 64-bit Peak Pointers: 64-bit Other: jemalloc: jemalloc memory allocator library v5.2.0 Power Management: BIOS and OS set to prefer performance at the cost of additional power usage. Compiler Notes -------------- The AMD64 AOCC Compiler Suite is available at http://developer.amd.com/amd-aocc/ Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit runcpu command invoked through numactl i.e.: numactl --interleave=all runcpu Set dirty_ratio=8 to limit dirty cache to 8% of memory Set swappiness=1 to swap only if necessary Set zone_reclaim_mode=1 to free local node memory and avoid remote memory sync then drop_caches=3 to reset caches before invoking runcpu ASLR is disabled to reduce run-to-run issues. dirty_ratio, swappiness, zone_reclaim_mode, drop_caches and ASLR were all set using privileged echo (e.g. echo 1 > /proc/sys/vm/swappiness). Transparent huge pages set to 'always' for Int Peak, FP Base/Peak (OS default) Transparent huge pages set to 'madvise' for Int Base Environment Variables Notes --------------------------- Environment variables set by runcpu before the start of the run: LD_LIBRARY_PATH = "/home/CPU2017/amd_rate_aocc300_milan_A_lib/64;/home/CPU2017/amd_rate_ao cc300_milan_A_lib/32:" MALLOC_CONF = "retain:true" General Notes ------------- Binaries were compiled on a system with 2x AMD EPYC 7742 CPU + 512GiB Memory using OpenSUSE 15.2 NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown) is mitigated in the system as tested and documented. Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1) is mitigated in the system as tested and documented. Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2) is mitigated in the system as tested and documented. jemalloc: configured and built with GCC v9.1.0 in Ubuntu 19.04 with -O3 -znver2 -flto jemalloc 5.2.0 is available here: https://github.com/jemalloc/jemalloc/releases/download/5.2.0/jemalloc-5.2.0.tar.bz2 Platform Notes -------------- BIOS configuration: NUMA nodes per socket set to NPS4 DRAM Scrub time set to Disable cTDP Control set to Manual cTDP set to 240 APBDIS set to 1 Sysinfo program /home/CPU2017/bin/sysinfo Rev: r6622 of 2021-04-07 982a61ec0915b55891ef0e16acafc64d running on localhost.localdomain Wed Jan 8 21:35:02 2020 SUT (System Under Test) info as seen by some common utilities. For more information on this section, see https://www.spec.org/cpu2017/Docs/config.html#sysinfo From /proc/cpuinfo model name : AMD EPYC 7513 32-Core Processor 1 "physical id"s (chips) 64 "processors" cores, siblings (Caution: counting these is hw and system dependent. The following excerpts from /proc/cpuinfo might not be reliable. Use with caution.) cpu cores : 32 siblings : 64 physical 0: cores 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 From lscpu from util-linux 2.32.1: Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Byte Order: Little Endian CPU(s): 64 On-line CPU(s) list: 0-63 Thread(s) per core: 2 Core(s) per socket: 32 Socket(s): 1 NUMA node(s): 4 Vendor ID: AuthenticAMD CPU family: 25 Model: 1 Model name: AMD EPYC 7513 32-Core Processor Stepping: 1 CPU MHz: 2977.790 CPU max MHz: 2600.0000 CPU min MHz: 1500.0000 BogoMIPS: 5190.25 Virtualization: AMD-V L1d cache: 32K L1i cache: 32K L2 cache: 512K L3 cache: 32768K NUMA node0 CPU(s): 0-7,32-39 NUMA node1 CPU(s): 8-15,40-47 NUMA node2 CPU(s): 16-23,48-55 NUMA node3 CPU(s): 24-31,56-63 Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 invpcid_single hw_pstate sme ssbd mba sev ibrs ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr wbnoinvd arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload vgif umip pku ospke vaes vpclmulqdq rdpid overflow_recov succor smca /proc/cpuinfo cache data cache size : 512 KB From numactl --hardware WARNING: a numactl 'node' might or might not correspond to a physical chip. available: 4 nodes (0-3) node 0 cpus: 0 1 2 3 4 5 6 7 32 33 34 35 36 37 38 39 node 0 size: 64295 MB node 0 free: 64051 MB node 1 cpus: 8 9 10 11 12 13 14 15 40 41 42 43 44 45 46 47 node 1 size: 64507 MB node 1 free: 64271 MB node 2 cpus: 16 17 18 19 20 21 22 23 48 49 50 51 52 53 54 55 node 2 size: 64507 MB node 2 free: 64171 MB node 3 cpus: 24 25 26 27 28 29 30 31 56 57 58 59 60 61 62 63 node 3 size: 64495 MB node 3 free: 64202 MB node distances: node 0 1 2 3 0: 10 12 12 12 1: 12 10 12 12 2: 12 12 10 12 3: 12 12 12 10 From /proc/meminfo MemTotal: 263992480 kB HugePages_Total: 0 Hugepagesize: 2048 kB /sbin/tuned-adm active Current active profile: throughput-performance /sys/devices/system/cpu/cpu*/cpufreq/scaling_governor has performance From /etc/*release* /etc/*version* os-release: NAME="Red Hat Enterprise Linux" VERSION="8.2 (Ootpa)" ID="rhel" ID_LIKE="fedora" VERSION_ID="8.2" PLATFORM_ID="platform:el8" PRETTY_NAME="Red Hat Enterprise Linux 8.2 (Ootpa)" ANSI_COLOR="0;31" redhat-release: Red Hat Enterprise Linux release 8.2 (Ootpa) system-release: Red Hat Enterprise Linux release 8.2 (Ootpa) system-release-cpe: cpe:/o:redhat:enterprise_linux:8.2:ga uname -a: Linux localhost.localdomain 4.18.0-193.el8.x86_64 #1 SMP Fri Mar 27 14:35:58 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux Kernel self-reported vulnerability status: CVE-2018-12207 (iTLB Multihit): Not affected CVE-2018-3620 (L1 Terminal Fault): Not affected Microarchitectural Data Sampling: Not affected CVE-2017-5754 (Meltdown): Not affected CVE-2018-3639 (Speculative Store Bypass): Mitigation: Speculative Store Bypass disabled via prctl and seccomp CVE-2017-5753 (Spectre variant 1): Mitigation: usercopy/swapgs barriers and __user pointer sanitization CVE-2017-5715 (Spectre variant 2): Mitigation: Full AMD retpoline, IBPB: conditional, IBRS_FW, STIBP: conditional, RSB filling CVE-2020-0543 (Special Register Buffer Data Sampling): No status reported CVE-2019-11135 (TSX Asynchronous Abort): Not affected run-level 3 Jan 8 13:08 SPEC is set to: /home/CPU2017 Filesystem Type Size Used Avail Use% Mounted on /dev/mapper/rhel-home xfs 392G 15G 378G 4% /home From /sys/devices/virtual/dmi/id Vendor: Inspur Product: IIMS Product Family: Not specified Serial: produceSN Additional information from dmidecode 3.2 follows. WARNING: Use caution when you interpret this section. The 'dmidecode' program reads system data which is "intended to allow hardware to be accurately determined", but the intent may not be met, as there are frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard. Memory: 8x Samsung M393A4G43AB3-CWE 32 GB 2 rank 3200 BIOS: BIOS Vendor: American Megatrends Inc. BIOS Version: 3.05.00 BIOS Date: 08/04/2021 BIOS Revision: 5.22 (End of data from sysinfo program) Compiler Version Notes ---------------------- ============================================================================== C | 519.lbm_r(base, peak) 538.imagick_r(base, peak) | 544.nab_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ ============================================================================== C++ | 508.namd_r(base, peak) 510.parest_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ ============================================================================== C++, C | 511.povray_r(base, peak) 526.blender_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ ============================================================================== C++, C, Fortran | 507.cactuBSSN_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ ============================================================================== Fortran | 503.bwaves_r(base, peak) 549.fotonik3d_r(base, peak) | 554.roms_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ ============================================================================== Fortran, C | 521.wrf_r(base, peak) 527.cam4_r(base, peak) ------------------------------------------------------------------------------ AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin AMD clang version 12.0.0 (CLANG: AOCC_3.0.0-Build#78 2020_12_10) (based on LLVM Mirror.Version.12.0.0) Target: x86_64-unknown-linux-gnu Thread model: posix InstalledDir: /opt/AMD/aocc-compiler-3.0.0/bin ------------------------------------------------------------------------------ Base Compiler Invocation ------------------------ C benchmarks: clang C++ benchmarks: clang++ Fortran benchmarks: flang Benchmarks using both Fortran and C: flang clang Benchmarks using both C and C++: clang++ clang Benchmarks using Fortran, C, and C++: clang++ clang flang Base Portability Flags ---------------------- 503.bwaves_r: -DSPEC_LP64 507.cactuBSSN_r: -DSPEC_LP64 508.namd_r: -DSPEC_LP64 510.parest_r: -DSPEC_LP64 511.povray_r: -DSPEC_LP64 519.lbm_r: -DSPEC_LP64 521.wrf_r: -DSPEC_CASE_FLAG -Mbyteswapio -DSPEC_LP64 526.blender_r: -funsigned-char -D__BOOL_DEFINED -DSPEC_LP64 527.cam4_r: -DSPEC_CASE_FLAG -DSPEC_LP64 538.imagick_r: -DSPEC_LP64 544.nab_r: -DSPEC_LP64 549.fotonik3d_r: -DSPEC_LP64 554.roms_r: -DSPEC_LP64 Base Optimization Flags ----------------------- C benchmarks: -m64 -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti C++ benchmarks: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -mllvm -enable-partial-unswitch -mllvm -unroll-threshold=100 -finline-aggressive -flv-function-specialization -mllvm -loop-unswitch-threshold=200000 -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -mllvm -convert-pow-exp-to-int=false -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti Fortran benchmarks: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Hz,1,0x1 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -Kieee -Mrecursive -mllvm -fuse-tile-inner-loop -funroll-loops -mllvm -extra-vectorizer-passes -mllvm -lsr-in-nested-loop -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti Benchmarks using both Fortran and C: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -Hz,1,0x1 -Kieee -Mrecursive -mllvm -fuse-tile-inner-loop -funroll-loops -mllvm -extra-vectorizer-passes -mllvm -lsr-in-nested-loop -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti Benchmarks using both C and C++: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -enable-partial-unswitch -mllvm -unroll-threshold=100 -finline-aggressive -mllvm -loop-unswitch-threshold=200000 -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -convert-pow-exp-to-int=false -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti Benchmarks using Fortran, C, and C++: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -O3 -ffast-math -march=znver3 -fveclib=AMDLIBM -fstruct-layout=5 -mllvm -unroll-threshold=50 -mllvm -inline-threshold=1000 -fremap-arrays -mllvm -function-specialize -flv-function-specialization -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -enable-partial-unswitch -mllvm -unroll-threshold=100 -finline-aggressive -mllvm -loop-unswitch-threshold=200000 -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -convert-pow-exp-to-int=false -Hz,1,0x1 -Kieee -Mrecursive -mllvm -fuse-tile-inner-loop -funroll-loops -mllvm -lsr-in-nested-loop -z muldefs -lamdlibm -ljemalloc -lflang -lflangrti Base Other Flags ---------------- C benchmarks: -Wno-unused-command-line-argument C++ benchmarks: -Wno-unused-command-line-argument Fortran benchmarks: -Wno-unused-command-line-argument Benchmarks using both Fortran and C: -Wno-unused-command-line-argument Benchmarks using both C and C++: -Wno-unused-command-line-argument Benchmarks using Fortran, C, and C++: -Wno-unused-command-line-argument Peak Compiler Invocation ------------------------ C benchmarks: clang C++ benchmarks: clang++ Fortran benchmarks: flang Benchmarks using both Fortran and C: flang clang Benchmarks using both C and C++: clang++ clang Benchmarks using Fortran, C, and C++: clang++ clang flang Peak Portability Flags ---------------------- Same as Base Portability Flags Peak Optimization Flags ----------------------- C benchmarks: 519.lbm_r: -m64 -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -lamdlibm -ljemalloc 538.imagick_r: Same as 519.lbm_r 544.nab_r: -m64 -flto -Wl,-mllvm -Wl,-region-vectorize -Wl,-mllvm -Wl,-function-specialize -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -lamdlibm -ljemalloc C++ benchmarks: 508.namd_r: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -finline-aggressive -mllvm -unroll-threshold=100 -flv-function-specialization -mllvm -enable-licm-vrp -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -lamdlibm -ljemalloc 510.parest_r: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-suppress-fmas -Wl,-mllvm -Wl,-function-specialize -Ofast -march=znver3 -fveclib=AMDLIBM -finline-aggressive -mllvm -unroll-threshold=100 -flv-function-specialization -mllvm -enable-licm-vrp -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -lamdlibm -ljemalloc Fortran benchmarks: 503.bwaves_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -Kieee -Mrecursive -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -lamdlibm -ljemalloc -lflang -lflangrti 549.fotonik3d_r: Same as 503.bwaves_r 554.roms_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -Kieee -Mrecursive -mllvm -reduce-array-computations=3 -mllvm -global-vectorize-slp=true -mllvm -enable-licm-vrp -Hz,1,0x1 -mllvm -fuse-tile-inner-loop -lamdlibm -ljemalloc -lflang -lflangrti Benchmarks using both Fortran and C: 521.wrf_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -Kieee -Mrecursive -lamdlibm -ljemalloc -lflang -lflangrti 527.cam4_r: -m64 -Wl,-mllvm -Wl,-enable-X86-prefetching -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-force-vector-interleave=1 -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -O3 -ffast-math -funroll-loops -mllvm -extra-vectorizer-passes -mllvm -lsr-in-nested-loop -Mrecursive -lamdlibm -ljemalloc -lflang -lflangrti Benchmarks using both C and C++: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -finline-aggressive -mllvm -unroll-threshold=100 -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -lamdlibm -ljemalloc Benchmarks using Fortran, C, and C++: -m64 -std=c++98 -mno-adx -mno-sse4a -Wl,-mllvm -Wl,-x86-use-vzeroupper=false -Wl,-mllvm -Wl,-enable-licm-vrp -flto -Wl,-mllvm -Wl,-function-specialize -Wl,-mllvm -Wl,-align-all-nofallthru-blocks=6 -Wl,-mllvm -Wl,-reduce-array-computations=3 -Ofast -march=znver3 -fveclib=AMDLIBM -fstruct-layout=7 -mllvm -unroll-threshold=50 -fremap-arrays -flv-function-specialization -mllvm -inline-threshold=1000 -mllvm -enable-gvn-hoist -mllvm -global-vectorize-slp=true -mllvm -function-specialize -mllvm -enable-licm-vrp -mllvm -reduce-array-computations=3 -mllvm -unroll-threshold=100 -mllvm -loop-unswitch-threshold=200000 -finline-aggressive -mllvm -reroll-loops -mllvm -aggressive-loop-unswitch -mllvm -extra-vectorizer-passes -mllvm -convert-pow-exp-to-int=false -Kieee -Mrecursive -lamdlibm -ljemalloc -lflang -lflangrti Peak Other Flags ---------------- C benchmarks: -Wno-unused-command-line-argument C++ benchmarks: -Wno-unused-command-line-argument Fortran benchmarks: -Wno-unused-command-line-argument Benchmarks using both Fortran and C: -Wno-unused-command-line-argument Benchmarks using both C and C++: -Wno-unused-command-line-argument Benchmarks using Fortran, C, and C++: -Wno-unused-command-line-argument The flags files that were used to format this result can be browsed at http://www.spec.org/cpu2017/flags/aocc300-flags-A1.html http://www.spec.org/cpu2017/flags/Inspur-Platform-Settings-amd-V1.1.html You can also download the XML flags sources by saving the following links: http://www.spec.org/cpu2017/flags/aocc300-flags-A1.xml http://www.spec.org/cpu2017/flags/Inspur-Platform-Settings-amd-V1.1.xml SPEC CPU and SPECrate are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ---------------------------------------------------------------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact info@spec.org. Copyright 2017-2021 Standard Performance Evaluation Corporation Tested with SPEC CPU(R)2017 v1.1.8 on 2020-01-08 21:35:02-0500. Report generated on 2021-09-14 19:19:30 by CPU2017 text formatter v6255. Originally published on 2021-09-14.