SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Tyan (Test Sponsor: Advanced Micro Devices)

Thunder n4250QE (S4985-SI), AMD Opteron 8425 HE

CPU2006 license: 49 Test date: May-2009
Test sponsor: Advanced Micro Devices Hardware Availability: Jul-2009
Tested by: Advanced Micro Devices Software Availability: Apr-2009
Benchmark results graph
Hardware
CPU Name: AMD Opteron 8425 HE
CPU Characteristics:
CPU MHz: 2100
FPU: Integrated
CPU(s) enabled: 24 cores, 4 chips, 6 cores/chip
CPU(s) orderable: 2,4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 6 MB I+D on chip per chip
Other Cache: None
Memory: 64 GB (16x4 GB, DDR2-800, CL5, Reg, Dual Rank)
Disk Subsystem: 1 x 250 GB SATA, 7200 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 5.3,
Advanced Platform, Kernel 2.6.18-128.el5
Compiler: PGI Server Complete Version 8.0
x86 Open64 4.2.2 Compiler Suite (from AMD)
Auto Parallel: Yes
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: binutils 2.18

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 24 1583 206 1583 206 1583 206 24 1547 211 1544 211 1546 211
416.gamess 24 1474 319 1474 319 1477 318 24 1375 342 1374 342 1375 342
433.milc 24 1386 159 1387 159 1386 159 24 1386 159 1387 159 1386 159
434.zeusmp 24 810 270 805 271 808 270 24 811 269 810 270 812 269
435.gromacs 24 630 272 625 274 624 274 24 515 333 511 335 518 331
436.cactusADM 24 995 288 996 288 997 288 4 144 332 143 334 143 335
437.leslie3d 24 1704 132 1704 132 1706 132 24 1611 140 1607 140 1607 140
444.namd 24 764 252 764 252 763 252 24 701 275 692 278 693 278
447.dealII 24 750 366 729 377 748 367 24 548 501 547 502 550 499
450.soplex 24 1237 162 1239 162 1243 161 24 1179 170 1151 174 1146 175
453.povray 24 418 306 411 311 426 300 24 371 344 329 388 379 337
454.calculix 24 563 352 564 351 565 351 24 498 398 500 396 497 398
459.GemsFDTD 24 1969 129 1970 129 1965 130 24 1902 134 1904 134 1904 134
465.tonto 24 851 277 847 279 848 279 24 703 336 705 335 701 337
470.lbm 24 2631 125 2632 125 2631 125 24 2627 126 2625 126 2627 126
481.wrf 24 1134 236 1137 236 1137 236 24 1099 244 1099 244 1100 244
482.sphinx3 24 1659 282 1655 283 1649 284 24 1537 304 1530 306 1538 304

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set vm/nr_hugepages=10800 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "450"
LD_LIBRARY_PATH = "/root/work/cpu2006v1.1/amd0905is-libs/64:/root/work/cpu2006v1.1/amd0905is-libs/32"
NCPUS = "6"
PGI_HUGE_PAGES = "450"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64.

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 

C++ benchmarks:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   --zc_eh   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 

Fortran benchmarks:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mvect=short   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Mvect=short   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -Mipa=jobs:4 

C++ benchmarks:

 -Mipa=jobs:4 

Fortran benchmarks:

 -Mipa=jobs:4 

Benchmarks using both Fortran and C:

 -Mipa=jobs:4 

Peak Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks (except as noted below):

 openCC 
444.namd:  pgcpp 

Fortran benchmarks (except as noted below):

 openf95 
410.bwaves:  pgf95 
434.zeusmp:  pgf95 
437.leslie3d:  pgf95 

Benchmarks using both Fortran and C (except as noted below):

 pgcc   pgf95 
435.gromacs:  opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  basepeak = yes 
470.lbm:  -fastsse   -Msmartalloc=huge   -Mprefetch=t0   -Mloop32   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
482.sphinx3:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mfprelaxed   -Msmartalloc   -tp shanghai-64   -Bstatic_pgi 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Munroll=n:4   -Munroll=m:8   -Msmartalloc=huge   -Mnodepchk   -Mfprelaxed   --zc_eh   -tp shanghai-64   -Bstatic_pgi 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -LNO:opt=0   -Wf,-fno-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -TENV:frame_pointer=off 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -OPT:malloc_alg=1   -CG:load_exe=0   -fno-exceptions   -m32   -HP:bdt=2m 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on   -HP:bdt=2m:heap=2m 

Fortran benchmarks:

410.bwaves:  -fastsse   -Msmartalloc   -Mprefetch=nta   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O2   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -HP:bdt=2m:heap=2m 
434.zeusmp:  -fastsse   -Mfprelaxed   -Mprefetch=distance:8   -Mprefetch=t0   -Msmartalloc=huge   -Msmartalloc=hugebss   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
437.leslie3d:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mvect=fuse   -Msmartalloc=huge   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -tp shanghai-64   -Bstatic_pgi 
459.GemsFDTD:  -march=barcelona   -Ofast   -LNO:fission=2   -LNO:simd=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -HP 
465.tonto:  -march=barcelona   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -Ofast   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m 
436.cactusADM:  -fastsse   -Mconcur   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
454.calculix:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mvect=short   -Msmartalloc=huge   -Mprefetch=t0   -Mpre   -Mfprelaxed   -tp shanghai-64   -Bstatic_pgi 
481.wrf:  -fastsse   -Mvect=noaltcode   -Msmartalloc=huge   -Mprefetch=distance:8   -Mfprelaxed   -tp shanghai-64   -Bstatic_pgi 

Peak Other Flags

C benchmarks:

 -Mipa=jobs:4(pass 2) 

C++ benchmarks:

444.namd:  -Mipa=jobs:4(pass 2) 

Fortran benchmarks:

410.bwaves:  -Mipa=jobs:4 
434.zeusmp:  -Mipa=jobs:4 
437.leslie3d:  -Mipa=jobs:4(pass 2) 

Benchmarks using both Fortran and C:

436.cactusADM:  -Mipa=jobs:4 
454.calculix:  -Mipa=jobs:4(pass 2) 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-4.2.2-flags-revA.20090710.html,
http://www.spec.org/cpu2006/flags/amd-platform.20090710.html,
http://www.spec.org/cpu2006/flags/pgi80_linux_flags.20090710.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/x86-open64-4.2.2-flags-revA.20090710.xml,
http://www.spec.org/cpu2006/flags/amd-platform.20090710.xml,
http://www.spec.org/cpu2006/flags/pgi80_linux_flags.20090710.xml.