SPEC® CPU2017 Floating Point Rate Result

Copyright 2017-2018 Standard Performance Evaluation Corporation

Lenovo Global Technology

ThinkSystem SR950
(2.50 GHz, Intel Xeon Platinum 8180M)

SPECrate2017_fp_base = 96300

SPECrate2017_fp_peak = 97800

CPU2017 License: 9017 Test Date: Nov-2017
Test Sponsor: Lenovo Global Technology Hardware Availability: Sep-2017
Tested by: Lenovo Global Technology Software Availability: Sep-2017

Benchmark result graphs are available in the PDF report.

Hardware
CPU Name: Intel Xeon Platinum 8180M
  Max MHz.: 3800
  Nominal: 2500
Enabled: 224 cores, 8 chips, 2 threads/core
Orderable: 2,4,8 chips
Cache L1: 32 KB I + 32 KB D on chip per core
  L2: 1 MB I+D on chip per core
  L3: 38.5 MB I+D on chip per chip
  Other: None
Memory: 3 TB (96 x 32 GB 2Rx4 PC4-2666V-R)
Storage: 800 GB tmpfs
Other: None
Software
OS: SUSE Linux Enterprise Server 12 SP2 (x86_64)
Kernel 4.4.21-69-default
Compiler: C/C++: Version 18.0.0.128 of Intel C/C++
Compiler for Linux;
Fortran: Version 18.0.0.128 of Intel Fortran
Compiler for Linux
Parallel: No
Firmware: Lenovo BIOS Version PSE105X 1.00 released Aug-2017
File System: tmpfs
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 64-bit
Other: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
SPECrate2017_fp_base 96300
SPECrate2017_fp_peak 97800
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
503.bwaves_r 448 2307 1950 2302 1950 2303 1950 448 2305 1950 2299 1950 2297 1960
507.cactuBSSN_r 448 604 939 603 941 603 941 448 610 930 610 930 610 929
508.namd_r 448 470 905 473 900 472 902 448 467 911 467 912 466 914
510.parest_r 448 2338 501 2356 497 2372 494 448 2370 494 2361 496 2364 496
511.povray_r 448 728 1440 727 1440 725 1440 448 607 1720 608 1720 610 1710
519.lbm_r 448 989 477 989 477 989 477 448 991 477 991 477 990 477
521.wrf_r 448 1164 862 1175 854 1175 854 448 1189 844 1188 844 1185 847
526.blender_r 448 548 1240 549 1240 550 1240 448 545 1250 546 1250 546 1250
527.cam4_r 448 684 1150 686 1140 687 1140 448 677 1160 680 1150 683 1150
538.imagick_r 448 575 1940 572 1950 572 1950 448 576 1930 573 1940 572 1950
544.nab_r 448 451 1670 446 1690 446 1690 448 437 1720 438 1720 438 1720
549.fotonik3d_r 448 2755 634 2756 634 2754 634 448 2757 633 2754 634 2755 634
554.roms_r 448 1650 432 1651 431 1644 433 448 1645 433 1641 434 1645 433

Submit Notes

 The numactl mechanism was used to bind copies to processors. The config file option 'submit'
 was used to generate numactl commands to bind each copy to a specific processor.
 For details, please see the config file.

Operating System Notes

 Stack size set to unlimited using "ulimit -s unlimited"
 Tmpfs filesystem can be set with:
  mount -t tmpfs -o size=800g tmpfs /home
 Process tuning setting:
  echo 50000     > /proc/sys/kernel/sched_cfs_bandwidth_slice_us
  echo 240000000 > /proc/sys/kernel/sched_latency_ns
  echo 5000000   > /proc/sys/kernel/sched_migration_cost_ns
  echo 100000000 > /proc/sys/kernel/sched_min_granularity_ns
  echo 150000000 > /proc/sys/kernel/sched_wakeup_granularity_ns

General Notes

Environment variables set by runcpu before the start of the run:
LD_LIBRARY_PATH = "/home/cpu2017.1.0.2.ic18.0/lib/ia32:/home/cpu2017.1.0.2.ic18.0/lib/intel64"
LD_LIBRARY_PATH = "$LD_LIBRARY_PATH:/home/cpu2017.1.0.2.ic18.0/je5.0.1-32:/home/cpu2017.1.0.2.ic18.0/je5.0.1-64"
 Binaries compiled on a system with 1x Intel Core i7-4790 CPU + 32GB RAM
 memory using Redhat Enterprise Linux 7.4
 Transparent Huge Pages enabled by default
 Prior to runcpu invocation
 Filesystem page cache synced and cleared with:
 sync; echo 3>       /proc/sys/vm/drop_caches
 runcpu command invoked through numactl i.e.:
 numactl --interleave=all runcpu <etc>

Platform Notes

BIOS configuration:
Choose Operating Mode set to Maximum Performance
SNC set to Enable
Hardware Prefetcher set to Disable
DCU Streamer Prefetcher set to Disable
MONITORMWAIT set to Enable
Execute Disable Bit set to Disable
Trusted Execution Technology set to Enable
Per Core Pstate set to Disable
XPT Prefetcher set to Enable
Stale AtoS set to Enable
LLC Deadline Alloc set to Enable
 Sysinfo program /home/cpu2017.1.0.2.ic18.0/bin/sysinfo
 Rev: r5797 of 2017-06-14 96c45e4568ad54c135fd618bcc091c0f
 running on linux-boxi Fri Nov 10 06:14:48 2017

 SUT (System Under Test) info as seen by some common utilities.
 For more information on this section, see
    https://www.spec.org/cpu2017/Docs/config.html#sysinfo

 From /proc/cpuinfo
    model name : Intel(R) Xeon(R) Platinum 8180M CPU @ 2.50GHz
       8  "physical id"s (chips)
       448 "processors"
    cores, siblings (Caution: counting these is hw and system dependent. The following
    excerpts from /proc/cpuinfo might not be reliable.  Use with caution.)
       cpu cores : 28
       siblings  : 56
       physical 0: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30
       physical 1: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30
       physical 2: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30
       physical 3: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30
       physical 4: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30
       physical 5: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30
       physical 6: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30
       physical 7: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30

 From lscpu:
      Architecture:          x86_64
      CPU op-mode(s):        32-bit, 64-bit
      Byte Order:            Little Endian
      CPU(s):                448
      On-line CPU(s) list:   0-447
      Thread(s) per core:    2
      Core(s) per socket:    28
      Socket(s):             8
      NUMA node(s):          16
      Vendor ID:             GenuineIntel
      CPU family:            6
      Model:                 85
      Model name:            Intel(R) Xeon(R) Platinum 8180M CPU @ 2.50GHz
      Stepping:              4
      CPU MHz:               2494.150
      BogoMIPS:              4988.30
      Virtualization:        VT-x
      L1d cache:             32K
      L1i cache:             32K
      L2 cache:              1024K
      L3 cache:              39424K
      NUMA node0 CPU(s):     0-3,7-9,14-17,21-23,224-227,231-233,238-241,245-247
      NUMA node1 CPU(s):     4-6,10-13,18-20,24-27,228-230,234-237,242-244,248-251
      NUMA node2 CPU(s):     28-31,35-37,42-45,49-51,252-255,259-261,266-269,273-275
      NUMA node3 CPU(s):     32-34,38-41,46-48,52-55,256-258,262-265,270-272,276-279
      NUMA node4 CPU(s):     56-59,63-65,70-73,77-79,280-283,287-289,294-297,301-303
      NUMA node5 CPU(s):     60-62,66-69,74-76,80-83,284-286,290-293,298-300,304-307
      NUMA node6 CPU(s):     84-87,91-93,98-101,105-107,308-311,315-317,322-325,329-331
      NUMA node7 CPU(s):     88-90,94-97,102-104,108-111,312-314,318-321,326-328,332-335
      NUMA node8 CPU(s):
      112-115,119-121,126-129,133-135,336-339,343-345,350-353,357-359
      NUMA node9 CPU(s):
      116-118,122-125,130-132,136-139,340-342,346-349,354-356,360-363
      NUMA node10 CPU(s):
      140-143,147-149,154-157,161-163,364-367,371-373,378-381,385-387
      NUMA node11 CPU(s):
      144-146,150-153,158-160,164-167,368-370,374-377,382-384,388-391
      NUMA node12 CPU(s):
      168-171,175-177,182-185,189-191,392-395,399-401,406-409,413-415
      NUMA node13 CPU(s):
      172-174,178-181,186-188,192-195,396-398,402-405,410-412,416-419
      NUMA node14 CPU(s):
      196-199,203-205,210-213,217-219,420-423,427-429,434-437,441-443
      NUMA node15 CPU(s):
      200-202,206-209,214-216,220-223,424-426,430-433,438-440,444-447
      Flags:                 fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
      pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp
      lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc
      aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg
      fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes
      xsave avx f16c rdrand lahf_lm abm 3dnowprefetch ida arat epb pln pts dtherm intel_pt
      tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2
      erms invpcid rtm cqm mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd
      avx512bw avx512vl xsaveopt xsavec xgetbv1 cqm_llc cqm_occup_llc

 /proc/cpuinfo cache data
    cache size : 39424 KB

 From numactl --hardware  WARNING: a numactl 'node' might or might not correspond to a
 physical chip.
   available: 16 nodes (0-15)
   node 0 cpus: 0 1 2 3 7 8 9 14 15 16 17 21 22 23 224 225 226 227 231 232 233 238 239 240
   241 245 246 247
   node 0 size: 192984 MB
   node 0 free: 192370 MB
   node 1 cpus: 4 5 6 10 11 12 13 18 19 20 24 25 26 27 228 229 230 234 235 236 237 242 243
   244 248 249 250 251
   node 1 size: 193528 MB
   node 1 free: 192982 MB
   node 2 cpus: 28 29 30 31 35 36 37 42 43 44 45 49 50 51 252 253 254 255 259 260 261 266
   267 268 269 273 274 275
   node 2 size: 193528 MB
   node 2 free: 184527 MB
   node 3 cpus: 32 33 34 38 39 40 41 46 47 48 52 53 54 55 256 257 258 262 263 264 265 270
   271 272 276 277 278 279
   node 3 size: 193528 MB
   node 3 free: 188017 MB
   node 4 cpus: 56 57 58 59 63 64 65 70 71 72 73 77 78 79 280 281 282 283 287 288 289 294
   295 296 297 301 302 303
   node 4 size: 193528 MB
   node 4 free: 193008 MB
   node 5 cpus: 60 61 62 66 67 68 69 74 75 76 80 81 82 83 284 285 286 290 291 292 293 298
   299 300 304 305 306 307
   node 5 size: 193528 MB
   node 5 free: 193010 MB
   node 6 cpus: 84 85 86 87 91 92 93 98 99 100 101 105 106 107 308 309 310 311 315 316 317
   322 323 324 325 329 330 331
   node 6 size: 193528 MB
   node 6 free: 192995 MB
   node 7 cpus: 88 89 90 94 95 96 97 102 103 104 108 109 110 111 312 313 314 318 319 320
   321 326 327 328 332 333 334 335
   node 7 size: 193528 MB
   node 7 free: 192781 MB
   node 8 cpus: 112 113 114 115 119 120 121 126 127 128 129 133 134 135 336 337 338 339
   343 344 345 350 351 352 353 357 358 359
   node 8 size: 193528 MB
   node 8 free: 192826 MB
   node 9 cpus: 116 117 118 122 123 124 125 130 131 132 136 137 138 139 340 341 342 346
   347 348 349 354 355 356 360 361 362 363
   node 9 size: 193528 MB
   node 9 free: 192949 MB
   node 10 cpus: 140 141 142 143 147 148 149 154 155 156 157 161 162 163 364 365 366 367
   371 372 373 378 379 380 381 385 386 387
   node 10 size: 193528 MB
   node 10 free: 192982 MB
   node 11 cpus: 144 145 146 150 151 152 153 158 159 160 164 165 166 167 368 369 370 374
   375 376 377 382 383 384 388 389 390 391
   node 11 size: 193528 MB
   node 11 free: 192987 MB
   node 12 cpus: 168 169 170 171 175 176 177 182 183 184 185 189 190 191 392 393 394 395
   399 400 401 406 407 408 409 413 414 415
   node 12 size: 193528 MB
   node 12 free: 193020 MB
   node 13 cpus: 172 173 174 178 179 180 181 186 187 188 192 193 194 195 396 397 398 402
   403 404 405 410 411 412 416 417 418 419
   node 13 size: 193528 MB
   node 13 free: 193013 MB
   node 14 cpus: 196 197 198 199 203 204 205 210 211 212 213 217 218 219 420 421 422 423
   427 428 429 434 435 436 437 441 442 443
   node 14 size: 193528 MB
   node 14 free: 193004 MB
   node 15 cpus: 200 201 202 206 207 208 209 214 215 216 220 221 222 223 424 425 426 430
   431 432 433 438 439 440 444 445 446 447
   node 15 size: 193523 MB
   node 15 free: 193008 MB
   node distances:
   node   0   1   2   3   4   5   6   7   8   9  10  11  12  13  14  15
     0:  10  20  20  20  20  20  20  20  20  20  20  20  20  20  20  20
     1:  20  10  20  20  20  20  20  20  20  20  20  20  20  20  20  20
     2:  20  20  10  20  20  20  20  20  20  20  20  20  20  20  20  20
     3:  20  20  20  10  20  20  20  20  20  20  20  20  20  20  20  20
     4:  20  20  20  20  10  20  20  20  20  20  20  20  20  20  20  20
     5:  20  20  20  20  20  10  20  20  20  20  20  20  20  20  20  20
     6:  20  20  20  20  20  20  10  20  20  20  20  20  20  20  20  20
     7:  20  20  20  20  20  20  20  10  20  20  20  20  20  20  20  20
     8:  20  20  20  20  20  20  20  20  10  20  20  20  20  20  20  20
     9:  20  20  20  20  20  20  20  20  20  10  20  20  20  20  20  20
    10:  20  20  20  20  20  20  20  20  20  20  10  20  20  20  20  20
    11:  20  20  20  20  20  20  20  20  20  20  20  10  20  20  20  20
    12:  20  20  20  20  20  20  20  20  20  20  20  20  10  20  20  20
    13:  20  20  20  20  20  20  20  20  20  20  20  20  20  10  20  20
    14:  20  20  20  20  20  20  20  20  20  20  20  20  20  20  10  20
    15:  20  20  20  20  20  20  20  20  20  20  20  20  20  20  20  10

 From /proc/meminfo
    MemTotal:       3170207908 kB
    HugePages_Total:       0
    Hugepagesize:       2048 kB

 From /etc/*release* /etc/*version*
    SuSE-release:
       SUSE Linux Enterprise Server 12 (x86_64)
       VERSION = 12
       PATCHLEVEL = 2
       # This file is deprecated and will be removed in a future service pack or release.
       # Please check /etc/os-release for details about this release.
    os-release:
       NAME="SLES"
       VERSION="12-SP2"
       VERSION_ID="12.2"
       PRETTY_NAME="SUSE Linux Enterprise Server 12 SP2"
       ID="sles"
       ANSI_COLOR="0;32"
       CPE_NAME="cpe:/o:suse:sles:12:sp2"

 uname -a:
    Linux linux-boxi 4.4.21-69-default #1 SMP Tue Oct 25 10:58:20 UTC 2016 (9464f67)
    x86_64 x86_64 x86_64 GNU/Linux

 run-level 3 Nov 10 06:10

 SPEC is set to: /home/cpu2017.1.0.2.ic18.0
    Filesystem     Type   Size  Used Avail Use% Mounted on
    tmpfs          tmpfs  800G   11G  790G   2% /home

 Additional information from dmidecode follows.  WARNING: Use caution when you interpret
 this section. The 'dmidecode' program reads system data which is "intended to allow
 hardware to be accurately determined", but the intent may not be met, as there are
 frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard.
   BIOS Lenovo -[PSE105X-1.00]- 08/17/2017
   Memory:
    96x Samsung M393A4K40BB2-CTD 32 GB 2 rank 2666

 (End of data from sysinfo program)

Base Compiler Invocation

C benchmarks:

 icc 

C++ benchmarks:

 icpc 

Fortran benchmarks:

 ifort 

Benchmarks using both Fortran and C:

 ifort   icc 

Benchmarks using both C and C++:

 icpc   icc 

Benchmarks using Fortran, C, and C++:

 icpc   icc   ifort 

Base Portability Flags

503.bwaves_r:  -DSPEC_LP64 
507.cactuBSSN_r:  -DSPEC_LP64 
508.namd_r:  -DSPEC_LP64 
510.parest_r:  -DSPEC_LP64 
511.povray_r:  -DSPEC_LP64 
519.lbm_r:  -DSPEC_LP64 
521.wrf_r:  -DSPEC_LP64   -DSPEC_CASE_FLAG   -convert big_endian 
526.blender_r:  -DSPEC_LP64   -DSPEC_LINUX   -funsigned-char 
527.cam4_r:  -DSPEC_LP64   -DSPEC_CASE_FLAG 
538.imagick_r:  -DSPEC_LP64 
544.nab_r:  -DSPEC_LP64 
549.fotonik3d_r:  -DSPEC_LP64 
554.roms_r:  -DSPEC_LP64 

Base Optimization Flags

C benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3 

C++ benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3 

Fortran benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3   -nostandard-realloc-lhs   -align array32byte 

Benchmarks using both Fortran and C:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3   -nostandard-realloc-lhs   -align array32byte 

Benchmarks using both C and C++:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3 

Benchmarks using Fortran, C, and C++:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3   -nostandard-realloc-lhs   -align array32byte 

Base Other Flags

C benchmarks:

 -m64   -std=c11 

C++ benchmarks:

 -m64 

Fortran benchmarks:

 -m64 

Benchmarks using both Fortran and C:

 -m64   -std=c11 

Benchmarks using both C and C++:

 -m64   -std=c11 

Benchmarks using Fortran, C, and C++:

 -m64   -std=c11 

Peak Compiler Invocation

C benchmarks:

 icc 

C++ benchmarks:

 icpc 

Fortran benchmarks:

 ifort 

Benchmarks using both Fortran and C:

 ifort   icc 

Benchmarks using both C and C++:

 icpc   icc 

Benchmarks using Fortran, C, and C++:

 icpc   icc   ifort 

Peak Portability Flags

Same as Base Portability Flags

Peak Optimization Flags

C benchmarks:

519.lbm_r:  -prof-gen(pass 1)   -prof-use(pass 2)   -ipo   -xCORE-AVX2   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3 
538.imagick_r:  -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3 
544.nab_r:  Same as 519.lbm_r 

C++ benchmarks:

 -prof-gen(pass 1)   -prof-use(pass 2)   -ipo   -xCORE-AVX2   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3 

Fortran benchmarks:

503.bwaves_r:  -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3   -nostandard-realloc-lhs   -align array32byte 
549.fotonik3d_r:  Same as 503.bwaves_r 
554.roms_r:  -prof-gen(pass 1)   -prof-use(pass 2)   -ipo   -xCORE-AVX2   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3   -nostandard-realloc-lhs   -align array32byte 

Benchmarks using both Fortran and C:

 -prof-gen(pass 1)   -prof-use(pass 2)   -ipo   -xCORE-AVX2   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3   -nostandard-realloc-lhs   -align array32byte 

Benchmarks using both C and C++:

 -prof-gen(pass 1)   -prof-use(pass 2)   -ipo   -xCORE-AVX2   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3 

Benchmarks using Fortran, C, and C++:

 -prof-gen(pass 1)   -prof-use(pass 2)   -ipo   -xCORE-AVX2   -O3   -no-prec-div   -qopt-prefetch   -ffinite-math-only   -qopt-mem-layout-trans=3   -nostandard-realloc-lhs   -align array32byte 

Peak Other Flags

C benchmarks:

 -m64   -std=c11 

C++ benchmarks:

 -m64 

Fortran benchmarks:

 -m64 

Benchmarks using both Fortran and C:

 -m64   -std=c11 

Benchmarks using both C and C++:

 -m64   -std=c11 

Benchmarks using Fortran, C, and C++:

 -m64   -std=c11 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2017/flags/Intel-ic18.0-official-linux64.html,
http://www.spec.org/cpu2017/flags/Lenovo-Platform-Flags-V1.2-SKL-E.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2017/flags/Intel-ic18.0-official-linux64.xml,
http://www.spec.org/cpu2017/flags/Lenovo-Platform-Flags-V1.2-SKL-E.xml.