SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Dell Inc.

PowerEdge M915 (AMD Opteron 6132 HE, 2.20 GHz)

CPU2006 license: 55 Test date: May-2011
Test sponsor: Dell Inc. Hardware Availability: Jun-2011
Tested by: Dell Inc. Software Availability: Jul-2010
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6132 HE
CPU Characteristics:
CPU MHz: 2200
FPU: Integrated
CPU(s) enabled: 16 cores, 2 chips, 8 cores/chip
CPU(s) orderable: 2,4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 12 MB I+D on chip per chip, 6 MB shared / 4 cores
Other Cache: None
Memory: 64 GB (16 x 4 GB 2Rx4 PC3L-10600R-9, ECC)
Disk Subsystem: 1 x 146 GB 10000 RPM SAS
Other Hardware: None
Software
Operating System: SUSE Linux Enterprise Server 11 (x86_64),
Kernel 2.6.27.19-5-default
Compiler: x86 Open64 4.2.4 Compiler Suite (from AMD)
Auto Parallel: Yes
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 16 893 243 892 244 897 242 16 829 262 828 263 828 263
416.gamess 16 1355 231 1354 231 1355 231 16 1247 251 1249 251 1248 251
433.milc 16 903 163 904 163 903 163 16 711 207 711 207 711 207
434.zeusmp 16 707 206 709 205 711 205 16 695 210 695 210 695 209
435.gromacs 16 616 185 615 186 614 186 16 478 239 476 240 478 239
436.cactusADM 16 809 236 810 236 812 236 4 139 344 139 344 139 344
437.leslie3d 16 865 174 861 175 864 174 16 869 173 865 174 860 175
444.namd 16 699 184 697 184 695 185 16 651 197 649 198 649 198
447.dealII 16 580 316 576 318 573 319 16 521 351 525 348 527 347
450.soplex 16 1012 132 921 145 921 145 16 923 145 821 163 817 163
453.povray 16 331 257 327 260 328 260 16 294 290 292 292 293 291
454.calculix 16 484 273 485 272 486 271 16 459 287 457 289 458 288
459.GemsFDTD 16 1098 155 1103 154 1097 155 16 996 170 999 170 995 171
465.tonto 16 649 243 649 243 648 243 16 593 265 593 265 587 268
470.lbm 16 778 282 777 283 777 283 16 778 282 777 283 777 283
481.wrf 16 726 246 725 247 727 246 16 676 264 677 264 681 262
482.sphinx3 16 1050 297 1050 297 1047 298 16 1030 303 1029 303 1033 302

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set vm/nr_hugepages=14336 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "896"
LD_LIBRARY_PATH = "/root/cpu2006-1.1./amd1002-rate-libs-revC/64:/root/cpu2006-1.1./amd1002-rate-libs-revC/32"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64
Binaries were compiled on SLES10 SP2 with binutils 2.18 OMP_NUM_THREADS = "4"

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -march=barcelona   -mso   -Ofast   -OPT:malloc_alg=1   -HP:bdt=2m 

C++ benchmarks:

 -march=barcelona   -mso   -Ofast   -static   -INLINE:aggressive=on   -OPT:malloc_alg=1   -HP:bdt=2m 

Fortran benchmarks:

 -march=barcelona   -mso   -Ofast   -HP 

Benchmarks using both Fortran and C:

 -march=barcelona   -mso   -Ofast   -OPT:malloc_alg=1   -HP:bdt=2m   -HP 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -march=barcelona   -mso   -Ofast   -CG:movnti=1   -CG:local_sched_alg=1   -CG:locs_shallow_depth=1   -HP:bdt=2m:heap=2m   -LNO:prefetch=3 
470.lbm:  basepeak = yes 
482.sphinx3:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:malloc_alg=2   -CG:sse_cse_regs=0   -CG:locs_shallow_depth=1   -CG:cmp_peep=on   -CG:local_sched_alg=1   -INLINE:aggressive=on 

C++ benchmarks:

444.namd:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:ignore_feedback=off   -CG:local_sched_alg=2   -CG:load_exe=0   -CG:compute_to=on   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m 
447.dealII:  -march=barcelona   -mso   -Ofast   -static   -INLINE:aggressive=on   -LNO:opt=0   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -TENV:frame_pointer=off 
450.soplex:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -OPT:malloc_alg=1   -CG:load_exe=0   -fno-exceptions   -m32   -HP:bdt=2m 
453.povray:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on 

Fortran benchmarks:

410.bwaves:  -march=barcelona   -mso   -O3   -OPT:Ofast   -OPT:treeheight=on   -LNO:blocking=off   -LNO:prefetch_ahead=5   -LNO:ignore_feedback=off   -WOPT:aggstr=0   -HP:bdt=2m:heap=2m   -CG:cmp_peep=on 
416.gamess:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:fu=6   -LNO:blocking=0   -LNO:prefetch=0   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -HP:bdt=2m:heap=2m 
434.zeusmp:  -march=barcelona   -mso   -Ofast   -LNO:blocking=off   -LNO:interchange=off   -OPT:treeheight=on   -OPT:unroll_size=256   -CG:cmp_peep=on   -GRA:prioritize_by_density=on   -HP 
437.leslie3d:  -march=barcelona   -mso   -Ofast   -HP:bdt=2m:heap=2m 
459.GemsFDTD:  -march=barcelona   -mso   -Ofast   -LNO:fission=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -CG:local_sched_alg=1   -HP 
465.tonto:  -march=barcelona   -mso   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -mso   -Ofast   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m 
436.cactusADM:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -apo   -LNO:prefetch_ahead=1   -HP:bdt=2m:heap=2m   -LANG:heap_allocation_threshold=100 
454.calculix:  -march=barcelona   -mso   -Ofast   -CG:load_exe=0   -CG:ptr_load_use=0   -CG:local_sched_alg=2   -CG:compute_to=on   -LNO:prefetch_ahead=30   -WOPT:unroll=2   -GRA:optimize_boundary=on   -HP:bdt=2m:heap=2m 
481.wrf:  -march=barcelona   -mso   -Ofast   -LNO:blocking=off   -LNO:prefetch_ahead=10   -LANG:copyinout=off   -IPA:callee_limit=5000   -GRA:prioritize_by_density=on   -m3dnow   -HP 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-424-flags-rate-revC.20100901.html,
http://www.spec.org/cpu2006/flags/amd-platform-rate-revC.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/x86-open64-424-flags-rate-revC.20100901.xml,
http://www.spec.org/cpu2006/flags/amd-platform-rate-revC.xml.