SPEC® CFP2006 Result

Copyright 2006-2012 Standard Performance Evaluation Corporation

IBM Corporation

IBM System x3755 M3,
AMD Opteron 6262 HE

CPU2006 license: 11 Test date: Feb-2012
Test sponsor: IBM Corporation Hardware Availability: Dec-2011
Tested by: IBM Corporation Software Availability: Jul-2011
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6262 HE
CPU Characteristics: AMD Turbo CORE technology up to 2.90 GHz
CPU MHz: 1600
FPU: Integrated
CPU(s) enabled: 64 cores, 4 chips, 16 cores/chip
CPU(s) orderable: 2,4 chips
Primary Cache: 512 KB I on chip per chip,
64 KB I shared / 2 cores;
16 KB D on chip per core
Secondary Cache: 16 MB I+D on chip per chip, 2 MB shared / 2 cores
L3 Cache: 16 MB I+D on chip per chip, 8 MB shared / 8 cores
Other Cache: None
Memory: 256 GB (32 x 8 GB 2Rx4 PC3-10600R-9, ECC)
Disk Subsystem: 1 x 600 GB SATA, 15000 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 6.1,
Kernel 2.6.32-131.0.15.el6.x86_64
Compiler: C/C++/Fortran: Version 4.2.5.2 of x86 Open64
Compiler Suite (from AMD)
Auto Parallel: No
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 64 1563 557 1560 557 1556 559 64 1528 569 1532 568 1530 569
416.gamess 64 2857 439 2858 439 2856 439 64 2723 460 2725 460 2729 459
433.milc 64 1280 459 1281 459 1279 459 64 1093 538 1091 539 1091 539
434.zeusmp 64 957 608 959 607 959 608 64 930 626 931 625 928 627
435.gromacs 64 871 525 870 525 871 524 64 858 532 857 533 858 533
436.cactusADM 64 1058 723 1060 722 1060 722 64 996 768 997 767 995 769
437.leslie3d 64 1561 385 1562 385 1562 385 64 1289 467 1289 467 1290 466
444.namd 64 1152 445 1144 448 1144 449 64 1141 450 1143 449 1135 452
447.dealII 64 738 992 738 992 741 988 64 738 992 735 996 736 995
450.soplex 64 1535 348 1254 426 1254 426 64 1311 407 1184 451 1090 490
453.povray 64 609 559 608 560 607 561 64 541 629 541 629 542 629
454.calculix 64 716 738 715 738 716 737 64 694 760 693 762 693 761
459.GemsFDTD 64 1914 355 1915 355 1916 354 64 1769 384 1765 385 1763 385
465.tonto 64 1188 530 1189 529 1190 529 64 1091 577 1091 577 1090 578
470.lbm 64 1186 742 1187 741 1186 742 64 1181 744 1183 743 1183 743
481.wrf 64 1122 637 1122 637 1124 636 64 1117 640 1109 645 1117 640
482.sphinx3 64 2176 573 2168 575 2168 575 64 2064 604 2047 609 2051 608

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set transparent_hugepage=never as a boot paramenter in /boot/grub/menu.lst
Set kernel/randomize_va_space=0 in /etc/sysctl.conf

Set vm/nr_hugepages=57344 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

Platform Notes

 BIOS settings:
 Operating Mode set to Performance Mode

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "896"
LD_LIBRARY_PATH = "/root/speccpu-rate-rev1104B1/amd1104-rate-libs-revB/32:/root/speccpu-rate-rev1104B1/amd1104-rate-libs-revB/64"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Binaries were compiled on a system with 2x AMD Opteron 6282SE chips + 64GB Memory using RHEL 6.1

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LP64   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -march=bdver1   -Ofast   -OPT:malloc_alg=1   -HP:bd=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso 

C++ benchmarks:

 -march=bdver1   -Ofast   -static   -CG:load_exe=0   -OPT:malloc_alg=1   -INLINE:aggressive=on   -HP:bd=2m:heap=2m   -D__OPEN64_FAST_SET 

Fortran benchmarks:

 -march=bdver1   -Ofast   -LNO:blocking=off   -OPT:rsqrt=2   -OPT:unroll_size=256   -HP:bd=2m:heap=2m   -mso 

Benchmarks using both Fortran and C:

 -march=bdver1   -Ofast   -OPT:malloc_alg=1   -HP:bd=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso   -LNO:blocking=off   -OPT:rsqrt=2   -OPT:unroll_size=256 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LP64   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -march=bdver1   -Ofast   -CG:movnti=1   -CG:locs_best=on   -HP:bdt=2m:heap=2m   -IPA:plimit=7000   -IPA:callee_limit=1200   -OPT:struct_array_copy=2   -OPT:alias=field_sensitive   -mso 
470.lbm:  -march=bdver1   -Ofast   -CG:cmp_peep=on   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -OPT:keep_ext=on   -HP:bdt=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso 
482.sphinx3:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:malloc_alg=2   -CG:cmp_peep=on   -CG:local_sched_alg=2   -INLINE:aggressive=on   -LNO:prefetch=2   -LNO:prefetch_ahead=4   -mso 

C++ benchmarks:

444.namd:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:ignore_feedback=off   -CG:local_sched_alg=2   -CG:load_exe=0   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m 
447.dealII:  -march=bdver1   -Ofast   -D__OPEN64_FAST_SET   -static   -INLINE:aggressive=on   -LNO:opt=0   -LNO:simd=0   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -CG:movext_icmp=off   -TENV:frame_pointer=off 
450.soplex:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -INLINE:aggressive=on   -OPT:RO=1   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -fno-exceptions   -m32   -HP:bdt=2m:heap=2m   -WOPT:sib=on 
453.povray:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -CG:pre_local_sched=off   -INLINE:aggressive=on   -HP:bd=2m:heap=2m   -OPT:transform=2   -OPT:alias=disjoint   -WOPT:aggcm=0 

Fortran benchmarks:

410.bwaves:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:Ofast   -OPT:treeheight=on   -LNO:blocking=off   -LNO:ignore_feedback=off   -LNO:fu=4   -LNO:loop_model_simd=on   -LNO:simd_rm_unity_remainder=on   -WOPT:aggstr=0   -HP:bdt=2m:heap=2m   -CG:cmp_peep=on 
416.gamess:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:fu=6   -LNO:blocking=0   -LNO:simd=0   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -OPT:unroll_times_max=2   -CG:local_sched_alg=1   -HP:bdt=2m:heap=2m   -WOPT:sib=on 
434.zeusmp:  -march=bdver1   -Ofast   -LNO:blocking=off   -LNO:interchange=off   -HP:bdt=2m:heap=2m 
437.leslie3d:  -march=bdver1   -Ofast   -CG:pre_minreg_level=2   -LNO:simd=0   -LNO:fusion=2   -HP:bdt=2m:heap=2m   -mso 
459.GemsFDTD:  -march=bdver1   -Ofast   -OPT:unroll_size=0   -LNO:fission=2   -CG:load_exe=0   -CG:local_sched_alg=2   -HP 
465.tonto:  -march=bdver1   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP:bdt=2m:heap=2m 

Benchmarks using both Fortran and C:

435.gromacs:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m 
436.cactusADM:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:blocking=off   -LNO:prefetch=2   -HP   -CG:locs_shallow_depth=1   -CG:load_exe=0   -WOPT:sib=on 
454.calculix:  -march=bdver1   -Ofast   -OPT:unroll_size=256   -GRA:optimize_boundary=on   -HP:bdt=2m:heap=2m 
481.wrf:  -march=bdver1   -Ofast   -LNO:blocking=off   -LANG:copyinout=off   -IPA:callee_limit=5000   -GRA:prioritize_by_density=on   -CG:load_exe=1   -HP   -WOPT:sib=on 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-425-flags-rate-revB.html,
http://www.spec.org/cpu2006/flags/amd-platform-rate-revB.20120103.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/x86-open64-425-flags-rate-revB.xml,
http://www.spec.org/cpu2006/flags/amd-platform-rate-revB.20120103.xml.