SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Hewlett-Packard Company

ProLiant DL385 G6
(2.6 GHz AMD Opteron 2435)

CPU2006 license: 3 Test date: May-2009
Test sponsor: Hewlett-Packard Company Hardware Availability: Jun-2009
Tested by: Hewlett-Packard Company Software Availability: Apr-2009
Benchmark results graph
Hardware
CPU Name: AMD Opteron 2435
CPU Characteristics:
CPU MHz: 2600
FPU: Integrated
CPU(s) enabled: 12 cores, 2 chips, 6 cores/chip
CPU(s) orderable: 1,2 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 6 MB I+D on chip per chip
Other Cache: None
Memory: 32 GB (8x4 GB, PC2-6400P CL5)
Disk Subsystem: 1 x 120 GB SFF SATA, 5400 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 5.3,
Kernel 2.6.18-128.el5
Compiler: PGI Server Complete Version 8.0
x86 Open64 4.2.2 Compiler Suite
Auto Parallel: Yes
File System: ext3
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: binutils 2.18

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 12 1561 104   1562 104   1561 104   12 1547 105   1546 105   1546 106  
416.gamess 12 1200 196   1200 196   1204 195   12 1113 211   1111 211   1112 211  
433.milc 12 1382 79.7 1382 79.7 1382 79.7 12 1382 79.7 1382 79.7 1382 79.7
434.zeusmp 12 748 146   745 147   742 147   12 739 148   737 148   735 149  
435.gromacs 12 502 171   501 171   502 171   12 411 209   411 208   412 208  
436.cactusADM 12 922 156   927 155   928 155   2 122 196   123 194   123 194  
437.leslie3d 12 1677 67.3 1679 67.2 1679 67.2 12 1579 71.4 1580 71.4 1578 71.5
444.namd 12 621 155   623 154   622 155   12 565 170   565 170   564 171  
447.dealII 12 663 207   656 209   656 209   12 480 286   484 283   479 286  
450.soplex 12 1218 82.2 1210 82.7 1207 82.9 12 1115 89.8 1109 90.3 1108 90.3
453.povray 12 322 198   322 198   323 198   12 267 239   267 239   267 240  
454.calculix 12 465 213   465 213   465 213   12 411 241   412 240   411 241  
459.GemsFDTD 12 1947 65.4 1945 65.5 1938 65.7 12 1914 66.5 1914 66.5 1914 66.5
465.tonto 12 710 166   715 165   711 166   12 588 201   597 198   592 200  
470.lbm 12 2653 62.2 2655 62.1 2646 62.3 12 2641 62.4 2633 62.6 2653 62.2
481.wrf 12 1099 122   1100 122   1098 122   12 1061 126   1059 127   1063 126  
482.sphinx3 12 1533 153   1533 153   1531 153   12 1452 161   1447 162   1448 162  

Submit Notes

The config file option 'submit' was used.
 'numactl' was used to bind copies to the cores
 See the configuration file for details.

Operating System Notes

 'ulimit -s unlimited' was used to set environment stack size
 'ulimit -l 2097152'  was used to set environment locked pages in memory limit

 Set vm/nr_hugepages=5400 in /etc/sysctl.conf
 mount -t hugetlbfs nodev /mnt/hugepages

Platform Notes

BIOS configuration:
  Power Regulator set to HP Static High Performance Mode
  Memory channel mode set to Independent

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "450"
LD_LIBRARY_PATH = "/cpu2006/amd0905is-libs/64:/cpu2006/amd0905is-libs/32"
NCPUS = "6"
PGI_HUGE_PAGES = "450"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64.

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 

C++ benchmarks:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   --zc_eh   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 

Fortran benchmarks:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mvect=short   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Mvect=short   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -Mipa=jobs:4 

C++ benchmarks:

 -Mipa=jobs:4 

Fortran benchmarks:

 -Mipa=jobs:4 

Benchmarks using both Fortran and C:

 -Mipa=jobs:4 

Peak Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks (except as noted below):

 openCC 
444.namd:  pgcpp 

Fortran benchmarks (except as noted below):

 openf95 
410.bwaves:  pgf95 
434.zeusmp:  pgf95 
437.leslie3d:  pgf95 

Benchmarks using both Fortran and C (except as noted below):

 pgcc   pgf95 
435.gromacs:  opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  basepeak = yes 
470.lbm:  -fastsse   -Msmartalloc=huge   -Mprefetch=t0   -Mloop32   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
482.sphinx3:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mfprelaxed   -Msmartalloc   -tp shanghai-64   -Bstatic_pgi 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Munroll=n:4   -Munroll=m:8   -Msmartalloc=huge   -Mnodepchk   -Mfprelaxed   --zc_eh   -tp shanghai-64   -Bstatic_pgi 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -LNO:opt=0   -Wf,-fno-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -TENV:frame_pointer=off 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -OPT:malloc_alg=1   -CG:load_exe=0   -fno-exceptions   -m32   -HP:bdt=2m 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on   -HP:bdt=2m:heap=2m 

Fortran benchmarks:

410.bwaves:  -fastsse   -Msmartalloc   -Mprefetch=nta   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O2   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -HP:bdt=2m:heap=2m 
434.zeusmp:  -fastsse   -Mfprelaxed   -Mprefetch=distance:8   -Mprefetch=t0   -Msmartalloc=huge   -Msmartalloc=hugebss   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
437.leslie3d:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mvect=fuse   -Msmartalloc=huge   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -tp shanghai-64   -Bstatic_pgi 
459.GemsFDTD:  -march=barcelona   -Ofast   -LNO:fission=2   -LNO:simd=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -HP 
465.tonto:  -march=barcelona   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -Ofast   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m 
436.cactusADM:  -fastsse   -Mconcur   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
454.calculix:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mvect=short   -Msmartalloc=huge   -Mprefetch=t0   -Mpre   -Mfprelaxed   -tp shanghai-64   -Bstatic_pgi 
481.wrf:  -fastsse   -Mvect=noaltcode   -Msmartalloc=huge   -Mprefetch=distance:8   -Mfprelaxed   -tp shanghai-64   -Bstatic_pgi 

Peak Other Flags

C benchmarks:

 -Mipa=jobs:4(pass 2) 

C++ benchmarks:

444.namd:  -Mipa=jobs:4(pass 2) 

Fortran benchmarks:

410.bwaves:  -Mipa=jobs:4 
434.zeusmp:  -Mipa=jobs:4 
437.leslie3d:  -Mipa=jobs:4(pass 2) 

Benchmarks using both Fortran and C:

436.cactusADM:  -Mipa=jobs:4 
454.calculix:  -Mipa=jobs:4(pass 2) 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/pgi80_linux_flags.html,
http://www.spec.org/cpu2006/flags/amd-platform-amd909gh.20090710.00.html,
http://www.spec.org/cpu2006/flags/x86-open64-4.2.2-flags.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/pgi80_linux_flags.xml,
http://www.spec.org/cpu2006/flags/amd-platform-amd909gh.20090710.00.xml,
http://www.spec.org/cpu2006/flags/x86-open64-4.2.2-flags.xml.