| CPU2006 license: | 4 | Test date: | May-2009 |
|---|---|---|---|
| Test sponsor: | SGI | Hardware Availability: | Mar-2009 |
| Tested by: | SGI | Software Availability: | Feb-2009 |
| Hardware | |
|---|---|
| CPU Name: | Intel Xeon X5570 |
| CPU Characteristics: | Quad Core, 2.93 GHz Intel Turbo Boost Technology up to 3.33 GHz |
| CPU MHz: | 2933 |
| FPU: | Integrated |
| CPU(s) enabled: | 16 cores, 4 chips, 4 cores/chip, 2 threads/core |
| CPU(s) orderable: | 1,2 chips per blade, 2-16384 blades |
| Primary Cache: | 32 KB I + 32 KB D on chip per core |
| Secondary Cache: | 256 KB I+D on chip per core |
| L3 Cache: | 8 MB I+D on chip per chip |
| Other Cache: | None |
| Memory: | 96 GB (2 x 12*4GB DDR3-1066 CL7 RDIMMs) |
| Disk Subsystem: | 13 TB Lustre Parallel Filesystem 1 Metadata Server and 6 Object Storage Servers 96 x 136 GB SAS (Seagate Cheetah 15000 rpm) |
| Other Hardware: | None |
| Software | |
|---|---|
| Operating System: | SUSE Linux Enterprise Server 10 (x86_64) SP2 with patch Linux kernel 20080917, Kernel 2.6.16.60-0.30-smp |
| Compiler: | Intel C++ and Fortran Compiler 11.0 for Linux Build 20090131 Package ID: l_cproc_p_11.0.080, l_cprof_p_11.0.080 |
| Auto Parallel: | No |
| File System: | lustre v1.6.7 over DDR Infiniband |
| System State: | Multi-user, run level 3 |
| Base Pointers: | 64-bit |
| Peak Pointers: | 32/64-bit |
| Other Software: | SGI ProPack 6 for Linux Service Pack 2 Binutils 2.18.50.0.7.20080502 |
| Benchmark | Base | Peak | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
| Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
| 410.bwaves | 32 | 1268 | 343 | 1273 | 342 | 1273 | 342 | 16 | 622 | 349 | 621 | 350 | 621 | 350 |
| 416.gamess | 32 | 1541 | 406 | 1542 | 406 | 1545 | 406 | 16 | 765 | 410 | 766 | 409 | 766 | 409 |
| 433.milc | 32 | 939 | 313 | 938 | 313 | 938 | 313 | 32 | 941 | 312 | 941 | 312 | 941 | 312 |
| 434.zeusmp | 32 | 713 | 408 | 707 | 412 | 713 | 408 | 32 | 703 | 414 | 677 | 430 | 704 | 413 |
| 435.gromacs | 32 | 584 | 391 | 583 | 392 | 580 | 394 | 32 | 566 | 404 | 569 | 401 | 563 | 406 |
| 436.cactusADM | 32 | 854 | 448 | 855 | 447 | 861 | 444 | 32 | 885 | 432 | 890 | 430 | 910 | 420 |
| 437.leslie3d | 32 | 1233 | 244 | 1238 | 243 | 1234 | 244 | 16 | 616 | 244 | 616 | 244 | 616 | 244 |
| 444.namd | 32 | 709 | 362 | 701 | 366 | 702 | 366 | 32 | 690 | 372 | 687 | 373 | 691 | 371 |
| 447.dealII | 32 | 651 | 562 | 645 | 568 | 647 | 566 | 32 | 602 | 608 | 622 | 588 | 604 | 606 |
| 450.soplex | 32 | 1003 | 266 | 1004 | 266 | 1005 | 266 | 16 | 474 | 281 | 474 | 282 | 474 | 281 |
| 453.povray | 32 | 320 | 532 | 320 | 531 | 320 | 532 | 32 | 269 | 633 | 267 | 639 | 266 | 640 |
| 454.calculix | 32 | 569 | 464 | 572 | 461 | 574 | 460 | 32 | 577 | 457 | 576 | 458 | 580 | 455 |
| 459.GemsFDTD | 32 | 1572 | 216 | 1571 | 216 | 1572 | 216 | 16 | 768 | 221 | 769 | 221 | 769 | 221 |
| 465.tonto | 32 | 773 | 407 | 782 | 402 | 781 | 403 | 32 | 769 | 409 | 740 | 425 | 745 | 422 |
| 470.lbm | 32 | 2053 | 214 | 2053 | 214 | 2053 | 214 | 16 | 990 | 222 | 991 | 222 | 991 | 222 |
| 481.wrf | 32 | 862 | 415 | 862 | 414 | 865 | 413 | 32 | 862 | 415 | 862 | 414 | 865 | 413 |
| 482.sphinx3 | 32 | 1615 | 386 | 1618 | 386 | 1618 | 385 | 32 | 1542 | 404 | 1543 | 404 | 1548 | 403 |
The config file option 'submit' was used. A submit.pl script was used to distribute benchmark copies across the 2 blades and to pin processes to cores using dplace. Each blade runs a separate instance of the operating system.
Adjacent cache line prefetch enabled System has 2 blades with 2 chips/blade.
| icc |
| icpc |
| ifort |
| icc ifort |
| 410.bwaves: | -DSPEC_CPU_LP64 |
| 416.gamess: | -DSPEC_CPU_LP64 |
| 433.milc: | -DSPEC_CPU_LP64 |
| 434.zeusmp: | -DSPEC_CPU_LP64 |
| 435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
| 436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
| 437.leslie3d: | -DSPEC_CPU_LP64 |
| 444.namd: | -DSPEC_CPU_LP64 |
| 447.dealII: | -DSPEC_CPU_LP64 |
| 450.soplex: | -DSPEC_CPU_LP64 |
| 453.povray: | -DSPEC_CPU_LP64 |
| 454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
| 459.GemsFDTD: | -DSPEC_CPU_LP64 |
| 465.tonto: | -DSPEC_CPU_LP64 |
| 470.lbm: | -DSPEC_CPU_LP64 |
| 481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
| 482.sphinx3: | -DSPEC_CPU_LP64 |
| -xSSE4.2 -ipo -O3 -no-prec-div -static |
| -xSSE4.2 -ipo -O3 -no-prec-div -static |
| -xSSE4.2 -ipo -O3 -no-prec-div -static |
| -xSSE4.2 -ipo -O3 -no-prec-div -static |
| icc | |
| 482.sphinx3: | icc -m32 |
| icpc | |
| 450.soplex: | icpc -m32 |
| ifort | |
| 437.leslie3d: | ifort -m32 |
| icc ifort |
| 410.bwaves: | -DSPEC_CPU_LP64 |
| 416.gamess: | -DSPEC_CPU_LP64 |
| 433.milc: | -DSPEC_CPU_LP64 |
| 434.zeusmp: | -DSPEC_CPU_LP64 |
| 435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
| 436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
| 444.namd: | -DSPEC_CPU_LP64 |
| 447.dealII: | -DSPEC_CPU_LP64 |
| 453.povray: | -DSPEC_CPU_LP64 |
| 454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
| 459.GemsFDTD: | -DSPEC_CPU_LP64 |
| 465.tonto: | -DSPEC_CPU_LP64 |
| 470.lbm: | -DSPEC_CPU_LP64 |
| 481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
| 433.milc: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -fno-alias |
| 470.lbm: | -xSSE4.2 -ipo -O3 -no-prec-div -static -opt-prefetch -auto-ilp32 |
| 482.sphinx3: | -xSSE4.2 -ipo -O3 -no-prec-div -static -unroll2 |
| 444.namd: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -fno-alias -auto-ilp32 |
| 447.dealII: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -ansi-alias -scalar-rep- |
| 450.soplex: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -opt-malloc-options=3 |
| 453.povray: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll4 -ansi-alias |
| 410.bwaves: | -xSSE4.2 -ipo -O3 -no-prec-div -static -opt-prefetch |
| 416.gamess: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -Ob0 -ansi-alias -scalar-rep- |
| 434.zeusmp: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) |
| 437.leslie3d: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -opt-malloc-options=3 -opt-prefetch |
| 459.GemsFDTD: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -Ob0 -opt-prefetch |
| 465.tonto: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll4 -auto |
| 435.gromacs: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -opt-prefetch -auto-ilp32 |
| 436.cactusADM: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -opt-prefetch -auto-ilp32 |
| 454.calculix: | -xSSE4.2 -ipo -O3 -no-prec-div -static -auto-ilp32 |
| 481.wrf: | basepeak = yes |