SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Supermicro (Test Sponsor: Advanced Micro Devices)

Supermicro A+ Server 1022G-NTF,
AMD Opteron 6168

SPECfp®2006 = 28.0

CPU2006 license: 49 Test date: Sep-2010
Test sponsor: Advanced Micro Devices Hardware Availability: Mar-2010
Tested by: Advanced Micro Devices Software Availability: May-2010
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6168
CPU Characteristics:
CPU MHz: 1900
FPU: Integrated
CPU(s) enabled: 24 cores, 2 chips, 12 cores/chip
CPU(s) orderable: 1,2 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 12 MB I+D on chip per chip, 6 MB shared / 6 cores
Other Cache: None
Memory: 64 GB (16 x 4 GB 2Rx4 PC3-10600R-9, ECC)
Disk Subsystem: 1 x 250 GB SATA, 7200 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 5.4,
Advanced Platform with patch RHSA-2009:1670,
Kernel 2.6.18-164.9.1.el5
Compiler: x86 Open64 4.2.3.2 Compiler Suite (from AMD)
Auto Parallel: Yes
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: binutils 2.18

Results Table

Benchmark Base Peak
Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 215 63.2  215 63.2  215 63.2  72.3 188   71.7 190   72.2 188  
416.gamess 1603 12.2  1612 12.1  1606 12.2  1469   13.3 1463   13.4 1462   13.4
433.milc 571 16.1  570 16.1  573 16.0  419   21.9 422   21.7 421   21.8
434.zeusmp 282 32.3  282 32.2  282 32.3  272   33.5 271   33.5 273   33.4
435.gromacs 700 10.2  701 10.2  701 10.2  543   13.2 542   13.2 543   13.2
436.cactusADM 161 74.2  164 73.0  161 74.3  110   108   110   109   110   108  
437.leslie3d 601 15.6  600 15.7  605 15.5  565   16.6 564   16.7 565   16.6
444.namd 821 9.77 821 9.77 821 9.77 748   10.7 744   10.8 746   10.7
447.dealII 591 19.4  593 19.3  601 19.0  523   21.9 522   21.9 521   21.9
450.soplex 681 12.2  681 12.2  682 12.2  602   13.8 599   13.9 604   13.8
453.povray 379 14.0  376 14.1  377 14.1  362   14.7 363   14.7 362   14.7
454.calculix 480 17.2  479 17.2  479 17.2  451   18.3 450   18.3 451   18.3
459.GemsFDTD 346 30.7  346 30.7  346 30.7  332   32.0 332   32.0 332   32.0
465.tonto 622 15.8  620 15.9  622 15.8  591   16.7 590   16.7 590   16.7
470.lbm 648 21.2  650 21.1  646 21.3  41.1 334   41.1 334   41.1 334  
481.wrf 383 29.2  384 29.1  383 29.2  383   29.2 384   29.1 383   29.2
482.sphinx3 986 19.8  979 19.9  977 19.9  948   20.6 943   20.7 950   20.5

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set vm/nr_hugepages=4000 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

cpuspeed stop was used to set the CPU frequency to its maximum.

General Notes

Environment variables set by runspec before the start of the run:
LD_LIBRARY_PATH = "/root/work/cpu2006/amd1002mc-speed-libs-revA/64:/root/work/cpu2006/amd1002mc-speed-libs-revA/32"
O64_OMP_AFFINITY_MAP = "0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23"
O64_OMP_SPIN_USER_LOCK = "true"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -march=barcelona   -Ofast   -HP:bdt=2m:heap=2m 

C++ benchmarks:

 -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -HP:bdt=2m:heap=2m 

Fortran benchmarks:

 -march=barcelona   -Ofast   -apo   -LNO:parallel_overhead=10000   -LNO:fusion_peeling_limit=0   -HP:bdt=2m:heap=2m 

Benchmarks using both Fortran and C:

 -march=barcelona   -Ofast   -HP:bdt=2m:heap=2m   -apo   -LNO:parallel_overhead=10000   -LNO:fusion_peeling_limit=0 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -march=barcelona   -Ofast   -apo   -CG:movnti=1   -CG:local_sched_alg=1   -CG:locs_shallow_depth=1   -CG:compute_to=on   -HP:bdt=2m:heap=2m   -LNO:prefetch=3 
470.lbm:  -march=barcelona   -Ofast   -mso   -apo   -CG:sse_cse_regs=0   -LNO:prefetch_ahead=4   -CG:locs_shallow_depth=1   -CG:cmp_peep=on   -CG:compute_to=on   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -OPT:keep_ext=on   -OPT:alias=restricted   -m3dnow   -IPA:inline=off 
482.sphinx3:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:malloc_alg=2   -CG:sse_cse_regs=0   -CG:locs_shallow_depth=1   -CG:cmp_peep=on   -CG:local_sched_alg=1   -INLINE:aggressive=on 

C++ benchmarks:

444.namd:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:ignore_feedback=off   -CG:local_sched_alg=2   -CG:load_exe=0   -CG:compute_to=on   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -LNO:opt=0   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -TENV:frame_pointer=off 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -CG:load_exe=0   -fno-exceptions   -m32   -HP:bdt=2m:heap=2m 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on   -HP:bdt=2m:heap=2m 

Fortran benchmarks:

410.bwaves:  -march=barcelona   -Ofast   -apo   -OPT:malloc_alg=2   -CG:use_prefetchnta=on   -CG:cmp_peep=on   -LNO:blocking=off   -LNO:prefetch=3   -LNO:prefetch_ahead=5   -LNO:ignore_feedback=off   -LNO:apo_use_feedback=on   -WOPT:aggstr=0 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:fu=6   -LNO:blocking=0   -LNO:prefetch=0   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -HP:bdt=2m:heap=2m 
434.zeusmp:  -march=barcelona   -Ofast   -apo   -LNO:blocking=off   -LNO:interchange=off   -LNO:fusion_peeling_limit=0   -OPT:treeheight=on   -OPT:unroll_size=256   -CG:cmp_peep=on   -CG:compute_to=on   -GRA:prioritize_by_density=on   -HP:bdt=2m:heap=2m 
437.leslie3d:  -march=barcelona   -Ofast   -apo   -OPT:unroll_size=256   -LNO:prefetch_ahead=4   -LNO:parallel_overhead=32768   -GRA:prioritize_by_density=on   -m3dnow   -HP:bdt=2m:heap=2m 
459.GemsFDTD:  -march=barcelona   -Ofast   -apo   -LNO:fission=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -CG:local_sched_alg=1   -HP 
465.tonto:  -march=barcelona   -Ofast   -apo   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -Ofast   -apo   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m 
436.cactusADM:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -apo   -LANG:heap_allocation_threshold=1000   -LNO:prefetch_ahead=1   -HP:bdt=2m:heap=2m 
454.calculix:  -march=barcelona   -Ofast   -LNO:prefetch_ahead=30   -CG:load_exe=0   -CG:ptr_load_use=0   -CG:local_sched_alg=2   -CG:compute_to=on   -WOPT:unroll=2   -GRA:optimize_boundary=on   -HP:bdt=2m:heap=2m   -apo 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.html,
http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.20101207.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.xml,
http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.20101207.xml.