SPEC(R) CFP2006 Summary Tyan Tyan YR190B8228, AMD Opteron 4184 Test Sponsor: Advanced Micro Devices Thu Nov 18 00:57:02 2010 CPU2006 License: 49 Test date: Nov-2010 Test sponsor: Advanced Micro Devices Hardware availability: Aug-2010 Tested by: Advanced Micro Devices Software availability: May-2010 Base Base Base Peak Peak Peak Benchmarks Ref. Run Time Ratio Ref. Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 13590 171 79.5 S 13590 167 81.3 S 410.bwaves 13590 170 80.1 S 13590 167 81.5 * 410.bwaves 13590 170 79.7 * 13590 166 81.6 S 416.gamess 19580 1084 18.1 S 19580 990 19.8 S 416.gamess 19580 1084 18.1 * 19580 992 19.7 S 416.gamess 19580 1090 18.0 S 19580 991 19.8 * 433.milc 9180 455 20.2 S 9180 329 27.9 * 433.milc 9180 457 20.1 * 9180 329 27.9 S 433.milc 9180 457 20.1 S 9180 329 27.9 S 434.zeusmp 9100 211 43.2 S 9100 204 44.6 S 434.zeusmp 9100 211 43.2 * 9100 204 44.7 * 434.zeusmp 9100 211 43.2 S 9100 203 44.8 S 435.gromacs 7140 472 15.1 S 7140 364 19.6 S 435.gromacs 7140 472 15.1 * 7140 366 19.5 S 435.gromacs 7140 471 15.2 S 7140 365 19.6 * 436.cactusADM 11950 129 92.4 S 11950 79.8 150 * 436.cactusADM 11950 127 94.2 * 11950 79.8 150 S 436.cactusADM 11950 126 95.0 S 11950 79.6 150 S 437.leslie3d 9400 432 21.7 S 9400 410 22.9 * 437.leslie3d 9400 431 21.8 S 9400 408 23.0 S 437.leslie3d 9400 432 21.8 * 9400 411 22.9 S 444.namd 8020 554 14.5 * 8020 504 15.9 * 444.namd 8020 554 14.5 S 8020 504 15.9 S 444.namd 8020 555 14.4 S 8020 505 15.9 S 447.dealII 11440 416 27.5 * 11440 358 31.9 * 447.dealII 11440 416 27.5 S 11440 359 31.9 S 447.dealII 11440 418 27.4 S 11440 358 32.0 S 450.soplex 8340 501 16.6 * 8340 428 19.5 S 450.soplex 8340 506 16.5 S 8340 431 19.4 S 450.soplex 8340 501 16.6 S 8340 430 19.4 * 453.povray 5320 256 20.8 S 5320 245 21.7 S 453.povray 5320 254 21.0 * 5320 246 21.6 S 453.povray 5320 253 21.0 S 5320 245 21.7 * 454.calculix 8250 324 25.4 S 8250 305 27.0 S 454.calculix 8250 325 25.4 S 8250 304 27.1 S 454.calculix 8250 324 25.4 * 8250 305 27.1 * 459.GemsFDTD 10610 294 36.0 S 10610 285 37.3 S 459.GemsFDTD 10610 294 36.0 * 10610 285 37.3 * 459.GemsFDTD 10610 295 36.0 S 10610 285 37.2 S 465.tonto 9840 422 23.3 S 9840 400 24.6 * 465.tonto 9840 422 23.3 * 9840 399 24.7 S 465.tonto 9840 425 23.2 S 9840 401 24.5 S 470.lbm 13740 430 31.9 S 13740 132 104 S 470.lbm 13740 432 31.8 S 13740 132 104 * 470.lbm 13740 430 31.9 * 13740 132 104 S 481.wrf 11170 271 41.2 S 11170 271 41.2 S 481.wrf 11170 270 41.4 S 11170 270 41.4 S 481.wrf 11170 271 41.3 * 11170 271 41.3 * 482.sphinx3 19490 696 28.0 S 19490 659 29.6 S 482.sphinx3 19490 698 27.9 * 19490 657 29.7 * 482.sphinx3 19490 700 27.9 S 19490 656 29.7 S ============================================================================== 410.bwaves 13590 170 79.7 * 13590 167 81.5 * 416.gamess 19580 1084 18.1 * 19580 991 19.8 * 433.milc 9180 457 20.1 * 9180 329 27.9 * 434.zeusmp 9100 211 43.2 * 9100 204 44.7 * 435.gromacs 7140 472 15.1 * 7140 365 19.6 * 436.cactusADM 11950 127 94.2 * 11950 79.8 150 * 437.leslie3d 9400 432 21.8 * 9400 410 22.9 * 444.namd 8020 554 14.5 * 8020 504 15.9 * 447.dealII 11440 416 27.5 * 11440 358 31.9 * 450.soplex 8340 501 16.6 * 8340 430 19.4 * 453.povray 5320 254 21.0 * 5320 245 21.7 * 454.calculix 8250 324 25.4 * 8250 305 27.1 * 459.GemsFDTD 10610 294 36.0 * 10610 285 37.3 * 465.tonto 9840 422 23.3 * 9840 400 24.6 * 470.lbm 13740 430 31.9 * 13740 132 104 * 481.wrf 11170 271 41.3 * 11170 271 41.3 * 482.sphinx3 19490 698 27.9 * 19490 657 29.7 * SPECfp(R)_base2006 28.1 SPECfp2006 33.6 HARDWARE -------- CPU Name: AMD Opteron 4184 CPU Characteristics: CPU MHz: 2800 FPU: Integrated CPU(s) enabled: 6 cores, 1 chip, 6 cores/chip CPU(s) orderable: 1,2 chips Primary Cache: 64 KB I + 64 KB D on chip per core Secondary Cache: 512 KB I+D on chip per core L3 Cache: 6 MB I+D on chip per chip Other Cache: None Memory: 16 GB (2 x 8 GB 2Rx4 PC3-10600R-9, ECC) Disk Subsystem: 1 x 128 GB SATA SSD Crucial RealSSD C300 CTFDDAC128MAG-1G1 Other Hardware: None SOFTWARE -------- Operating System: SUSE Linux Enterprise Server 11 (x86_64), Kernel 2.6.27.19-5-default Compiler: x86 Open64 4.2.3.2 Compiler Suite (from AMD) Auto Parallel: Yes File System: ext3 System State: Run level 3 (Full multiuser with network) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: None Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set vm/nr_hugepages=1000 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages powersave -f was used to set the CPU frequency to its maximum. Binaries were compiled on SLES10 SP2 with binutils 2.18 General Notes ------------- Environment variables set by runspec before the start of the run: LD_LIBRARY_PATH = "/root/work/cpu2006/amd1002-speed-libs-revA/64:/root/work/cpu2006/amd1002-speed-libs-revA/32" O64_OMP_AFFINITY_MAP = "0,1,2,3,4,5" O64_OMP_SPIN_USER_LOCK = "true" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64 Base Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -march=barcelona -Ofast -HP:bdt=2m:heap=2m C++ benchmarks: -march=barcelona -Ofast -static -INLINE:aggressive=on -HP:bdt=2m:heap=2m Fortran benchmarks: -march=barcelona -Ofast -apo -LNO:parallel_overhead=10000 -LNO:fusion_peeling_limit=0 -HP:bdt=2m:heap=2m Benchmarks using both Fortran and C: -march=barcelona -Ofast -HP:bdt=2m:heap=2m -apo -LNO:parallel_overhead=10000 -LNO:fusion_peeling_limit=0 Peak Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -march=barcelona -Ofast -apo -CG:movnti=1 -CG:local_sched_alg=1 -CG:locs_shallow_depth=1 -CG:compute_to=on -HP:bdt=2m:heap=2m -LNO:prefetch=3 470.lbm: -march=barcelona -Ofast -mso -apo -CG:sse_cse_regs=0 -LNO:prefetch_ahead=4 -CG:locs_shallow_depth=1 -CG:cmp_peep=on -CG:compute_to=on -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -OPT:keep_ext=on -OPT:alias=restricted -m3dnow -IPA:inline=off 482.sphinx3: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -OPT:malloc_alg=2 -CG:sse_cse_regs=0 -CG:locs_shallow_depth=1 -CG:cmp_peep=on -CG:local_sched_alg=1 -INLINE:aggressive=on C++ benchmarks: 444.namd: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:ignore_feedback=off -CG:local_sched_alg=2 -CG:load_exe=0 -CG:compute_to=on -OPT:unroll_size=256 -fno-exceptions -HP:bdt=2m:heap=2m 447.dealII: -march=barcelona -Ofast -static -INLINE:aggressive=on -LNO:opt=0 -fno-emit-exceptions -m32 -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -HP:bdt=2m:heap=2m -GRA:unspill=on -CG:cmp_peep=on -TENV:frame_pointer=off 450.soplex: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -INLINE:aggressive=on -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -CG:load_exe=0 -fno-exceptions -m32 -HP:bdt=2m:heap=2m 453.povray: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -INLINE:aggressive=on -HP:bdt=2m:heap=2m Fortran benchmarks: 410.bwaves: -march=barcelona -Ofast -apo -OPT:malloc_alg=2 -CG:use_prefetchnta=on -CG:cmp_peep=on -LNO:blocking=off -LNO:prefetch=3 -LNO:prefetch_ahead=5 -LNO:ignore_feedback=off -LNO:apo_use_feedback=on -WOPT:aggstr=0 416.gamess: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -LNO:fu=6 -LNO:blocking=0 -LNO:prefetch=0 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 -HP:bdt=2m:heap=2m 434.zeusmp: -march=barcelona -Ofast -apo -LNO:blocking=off -LNO:interchange=off -LNO:fusion_peeling_limit=0 -OPT:treeheight=on -OPT:unroll_size=256 -CG:cmp_peep=on -CG:compute_to=on -GRA:prioritize_by_density=on -HP:bdt=2m:heap=2m 437.leslie3d: -march=barcelona -Ofast -apo -OPT:unroll_size=256 -LNO:prefetch_ahead=4 -LNO:parallel_overhead=32768 -GRA:prioritize_by_density=on -m3dnow -HP:bdt=2m:heap=2m 459.GemsFDTD: -march=barcelona -Ofast -apo -LNO:fission=2 -LNO:prefetch_ahead=1 -CG:load_exe=0 -CG:local_sched_alg=1 -HP 465.tonto: -march=barcelona -Ofast -apo -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -IPA:plimit=525 -HP Benchmarks using both Fortran and C: 435.gromacs: -march=barcelona -Ofast -apo -OPT:rsqrt=2 -HP:bdt=2m:heap=2m 436.cactusADM: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -apo -LANG:heap_allocation_threshold=1000 -LNO:prefetch_ahead=1 -HP:bdt=2m:heap=2m 454.calculix: -march=barcelona -Ofast -LNO:prefetch_ahead=30 -CG:load_exe=0 -CG:ptr_load_use=0 -CG:local_sched_alg=2 -CG:compute_to=on -WOPT:unroll=2 -GRA:optimize_boundary=on -HP:bdt=2m:heap=2m -apo 481.wrf: basepeak = yes The flags files that were used to format this result can be browsed at http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.html http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.html You can also download the XML flags sources by saving the following links: http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.xml http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.1. Report generated on Wed Jul 23 15:20:18 2014 by CPU2006 ASCII formatter v6932. Originally published on 3 February 2011.