SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Supermicro (Test Sponsor: Advanced Micro Devices)

Supermicro A+ Server 2042G-6RF,
AMD Opteron 6166 HE

SPECfp®2006 = 27.2

CPU2006 license: 49 Test date: Sep-2010
Test sponsor: Advanced Micro Devices Hardware Availability: Feb-2011
Tested by: Advanced Micro Devices Software Availability: May-2010
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6166 HE
CPU Characteristics:
CPU MHz: 1800
FPU: Integrated
CPU(s) enabled: 48 cores, 4 chips, 12 cores/chip
CPU(s) orderable: 2,4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 12 MB I+D on chip per chip, 6 MB shared / 6 cores
Other Cache: None
Memory: 128 GB (32 x 4 GB 2Rx4 PC3-10600R-9, ECC)
Disk Subsystem: 1 x 250 GB SATA, 7200 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 5.5,
Advanced Platform, Kernel 2.6.18-194.el5
Compiler: x86 Open64 4.2.3.2 Compiler Suite (from AMD)
Auto Parallel: Yes
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: binutils 2.18

Results Table

Benchmark Base Peak
Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 243 56.0  242 56.1  242 56.2  64.4 211   63.9 213   64.4 211  
416.gamess 1695 11.6  1697 11.5  1717 11.4  1554   12.6 1554   12.6 1557   12.6
433.milc 617 14.9  617 14.9  617 14.9  452   20.3 453   20.3 451   20.4
434.zeusmp 302 30.1  301 30.2  302 30.2  292   31.2 290   31.4 291   31.3
435.gromacs 742 9.62 743 9.61 742 9.62 578   12.4 578   12.3 578   12.4
436.cactusADM 172 69.5  170 70.2  173 69.2  120   99.8 119   101   120   100  
437.leslie3d 658 14.3  659 14.3  658 14.3  611   15.4 613   15.3 608   15.5
444.namd 868 9.24 868 9.24 868 9.24 792   10.1 792   10.1 791   10.1
447.dealII 631 18.1  631 18.1  632 18.1  555   20.6 555   20.6 555   20.6
450.soplex 728 11.5  731 11.4  731 11.4  648   12.9 646   12.9 645   12.9
453.povray 396 13.4  396 13.4  395 13.5  381   14.0 384   13.9 382   13.9
454.calculix 509 16.2  512 16.1  509 16.2  484   17.0 480   17.2 479   17.2
459.GemsFDTD 387 27.4  388 27.4  388 27.4  368   28.8 368   28.9 368   28.8
465.tonto 697 14.1  696 14.1  697 14.1  668   14.7 669   14.7 671   14.7
470.lbm 697 19.7  696 19.7  694 19.8  25.2 545   24.9 552   25.0 550  
481.wrf 438 25.5  439 25.5  438 25.5  438   25.5 439   25.5 438   25.5
482.sphinx3 1040 18.7  1042 18.7  1048 18.6  1011   19.3 1009   19.3 1005   19.4

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set vm/nr_hugepages=8000 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

cpuspeed stop was used to set the CPU frequency to its maximum.

General Notes

Environment variables set by runspec before the start of the run:
LD_LIBRARY_PATH = "/root/work/cpu2006/amd1002mc-speed-libs-revA/64:/root/work/cpu2006/amd1002mc-speed-libs-revA/32"
O64_OMP_AFFINITY_MAP = "0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,
24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47"
O64_OMP_SPIN_USER_LOCK = "true"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -march=barcelona   -Ofast   -HP:bdt=2m:heap=2m 

C++ benchmarks:

 -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -HP:bdt=2m:heap=2m 

Fortran benchmarks:

 -march=barcelona   -Ofast   -apo   -LNO:parallel_overhead=10000   -LNO:fusion_peeling_limit=0   -HP:bdt=2m:heap=2m 

Benchmarks using both Fortran and C:

 -march=barcelona   -Ofast   -HP:bdt=2m:heap=2m   -apo   -LNO:parallel_overhead=10000   -LNO:fusion_peeling_limit=0 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -march=barcelona   -Ofast   -apo   -CG:movnti=1   -CG:local_sched_alg=1   -CG:locs_shallow_depth=1   -CG:compute_to=on   -HP:bdt=2m:heap=2m   -LNO:prefetch=3 
470.lbm:  -march=barcelona   -Ofast   -mso   -apo   -CG:sse_cse_regs=0   -LNO:prefetch_ahead=4   -CG:locs_shallow_depth=1   -CG:cmp_peep=on   -CG:compute_to=on   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -OPT:keep_ext=on   -OPT:alias=restricted   -m3dnow   -IPA:inline=off 
482.sphinx3:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:malloc_alg=2   -CG:sse_cse_regs=0   -CG:locs_shallow_depth=1   -CG:cmp_peep=on   -CG:local_sched_alg=1   -INLINE:aggressive=on 

C++ benchmarks:

444.namd:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:ignore_feedback=off   -CG:local_sched_alg=2   -CG:load_exe=0   -CG:compute_to=on   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -LNO:opt=0   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -TENV:frame_pointer=off 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -CG:load_exe=0   -fno-exceptions   -m32   -HP:bdt=2m:heap=2m 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on   -HP:bdt=2m:heap=2m 

Fortran benchmarks:

410.bwaves:  -march=barcelona   -Ofast   -apo   -OPT:malloc_alg=2   -CG:use_prefetchnta=on   -CG:cmp_peep=on   -LNO:blocking=off   -LNO:prefetch=3   -LNO:prefetch_ahead=5   -LNO:ignore_feedback=off   -LNO:apo_use_feedback=on   -WOPT:aggstr=0 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:fu=6   -LNO:blocking=0   -LNO:prefetch=0   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -HP:bdt=2m:heap=2m 
434.zeusmp:  -march=barcelona   -Ofast   -apo   -LNO:blocking=off   -LNO:interchange=off   -LNO:fusion_peeling_limit=0   -OPT:treeheight=on   -OPT:unroll_size=256   -CG:cmp_peep=on   -CG:compute_to=on   -GRA:prioritize_by_density=on   -HP:bdt=2m:heap=2m 
437.leslie3d:  -march=barcelona   -Ofast   -apo   -OPT:unroll_size=256   -LNO:prefetch_ahead=4   -LNO:parallel_overhead=32768   -GRA:prioritize_by_density=on   -m3dnow   -HP:bdt=2m:heap=2m 
459.GemsFDTD:  -march=barcelona   -Ofast   -apo   -LNO:fission=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -CG:local_sched_alg=1   -HP 
465.tonto:  -march=barcelona   -Ofast   -apo   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -Ofast   -apo   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m 
436.cactusADM:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -apo   -LANG:heap_allocation_threshold=1000   -LNO:prefetch_ahead=1   -HP:bdt=2m:heap=2m 
454.calculix:  -march=barcelona   -Ofast   -LNO:prefetch_ahead=30   -CG:load_exe=0   -CG:ptr_load_use=0   -CG:local_sched_alg=2   -CG:compute_to=on   -WOPT:unroll=2   -GRA:optimize_boundary=on   -HP:bdt=2m:heap=2m   -apo 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.html,
http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.20101207.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.xml,
http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.20101207.xml.