SPEC(R) CFP2006 Summary Supermicro Supermicro A+ Server 1022G-NTF, AMD Opteron 6328 Test Sponsor: Advanced Micro Devices Sun Oct 7 22:46:04 2012 CPU2006 License: 49 Test date: Oct-2012 Test sponsor: Advanced Micro Devices Hardware availability: Nov-2012 Tested by: Advanced Micro Devices Software availability: Jun-2012 Base Base Base Peak Peak Peak Benchmarks Ref. Run Time Ratio Ref. Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 13590 53.9 252 S 13590 42.0 323 * 410.bwaves 13590 54.2 251 S 13590 42.2 322 S 410.bwaves 13590 54.2 251 * 13590 41.9 325 S 416.gamess 19580 813 24.1 S 19580 761 25.7 S 416.gamess 19580 818 23.9 S 19580 758 25.8 * 416.gamess 19580 817 24.0 * 19580 756 25.9 S 433.milc 9180 223 41.1 S 9180 197 46.5 * 433.milc 9180 224 41.1 S 9180 197 46.6 S 433.milc 9180 223 41.1 * 9180 198 46.4 S 434.zeusmp 9100 123 74.1 * 9100 112 81.4 S 434.zeusmp 9100 123 74.1 S 9100 111 81.7 S 434.zeusmp 9100 122 74.6 S 9100 111 81.7 * 435.gromacs 7140 284 25.2 S 7140 272 26.3 S 435.gromacs 7140 284 25.1 S 7140 272 26.3 S 435.gromacs 7140 284 25.2 * 7140 272 26.3 * 436.cactusADM 11950 70.9 169 * 11950 55.7 214 * 436.cactusADM 11950 70.4 170 S 11950 56.6 211 S 436.cactusADM 11950 73.5 163 S 11950 55.5 215 S 437.leslie3d 9400 313 30.1 S 9400 308 30.5 * 437.leslie3d 9400 313 30.0 S 9400 308 30.5 S 437.leslie3d 9400 313 30.0 * 9400 308 30.5 S 444.namd 8020 400 20.1 S 8020 388 20.7 * 444.namd 8020 400 20.1 S 8020 388 20.7 S 444.namd 8020 400 20.1 * 8020 388 20.7 S 447.dealII 11440 236 48.4 S 11440 216 53.0 S 447.dealII 11440 236 48.4 S 11440 216 53.0 S 447.dealII 11440 236 48.4 * 11440 216 53.0 * 450.soplex 8340 308 27.1 S 8340 288 29.0 S 450.soplex 8340 308 27.1 * 8340 288 28.9 * 450.soplex 8340 309 27.0 S 8340 289 28.9 S 453.povray 5320 195 27.3 S 5320 179 29.8 S 453.povray 5320 195 27.3 * 5320 178 29.9 S 453.povray 5320 195 27.3 S 5320 178 29.8 * 454.calculix 8250 221 37.3 S 8250 212 39.0 S 454.calculix 8250 222 37.1 S 8250 211 39.0 S 454.calculix 8250 221 37.3 * 8250 212 39.0 * 459.GemsFDTD 10610 264 40.2 S 10610 227 46.8 S 459.GemsFDTD 10610 264 40.3 * 10610 226 46.9 * 459.GemsFDTD 10610 264 40.3 S 10610 226 46.9 S 465.tonto 9840 355 27.7 S 9840 333 29.6 S 465.tonto 9840 355 27.7 * 9840 333 29.5 S 465.tonto 9840 355 27.7 S 9840 333 29.5 * 470.lbm 13740 146 94.4 * 13740 38.0 361 S 470.lbm 13740 145 94.7 S 13740 37.9 363 * 470.lbm 13740 146 94.1 S 13740 36.2 379 S 481.wrf 11170 227 49.1 S 11170 227 49.1 S 481.wrf 11170 228 49.1 * 11170 228 49.1 * 481.wrf 11170 228 49.0 S 11170 228 49.0 S 482.sphinx3 19490 702 27.8 * 19490 532 36.7 S 482.sphinx3 19490 702 27.8 S 19490 532 36.7 * 482.sphinx3 19490 701 27.8 S 19490 533 36.6 S ============================================================================== 410.bwaves 13590 54.2 251 * 13590 42.0 323 * 416.gamess 19580 817 24.0 * 19580 758 25.8 * 433.milc 9180 223 41.1 * 9180 197 46.5 * 434.zeusmp 9100 123 74.1 * 9100 111 81.7 * 435.gromacs 7140 284 25.2 * 7140 272 26.3 * 436.cactusADM 11950 70.9 169 * 11950 55.7 214 * 437.leslie3d 9400 313 30.0 * 9400 308 30.5 * 444.namd 8020 400 20.1 * 8020 388 20.7 * 447.dealII 11440 236 48.4 * 11440 216 53.0 * 450.soplex 8340 308 27.1 * 8340 288 28.9 * 453.povray 5320 195 27.3 * 5320 178 29.8 * 454.calculix 8250 221 37.3 * 8250 212 39.0 * 459.GemsFDTD 10610 264 40.3 * 10610 226 46.9 * 465.tonto 9840 355 27.7 * 9840 333 29.5 * 470.lbm 13740 146 94.4 * 13740 37.9 363 * 481.wrf 11170 228 49.1 * 11170 228 49.1 * 482.sphinx3 19490 702 27.8 * 19490 532 36.7 * SPECfp(R)_base2006 44.1 SPECfp2006 52.6 HARDWARE -------- CPU Name: AMD Opteron 6328 CPU Characteristics: AMD Turbo CORE technology up to 3.80 GHz CPU MHz: 3200 FPU: Integrated CPU(s) enabled: 16 cores, 2 chips, 8 cores/chip CPU(s) orderable: 1,2 chips Primary Cache: 256 KB I on chip per chip, 64 KB I shared / 2 cores; 16 KB D on chip per core Secondary Cache: 8 MB I+D on chip per chip, 2 MB shared / 2 cores L3 Cache: 16 MB I+D on chip per chip, 8 MB shared / 4 cores Other Cache: None Memory: 128 GB (16 x 8 GB 2Rx4 PC3-12800R-11, ECC) Disk Subsystem: 1 x 250 GB SATA, 7200 RPM Other Hardware: None SOFTWARE -------- Operating System: Red Hat Enterprise Linux Server release 6.3, Kernel 2.6.32-279.el6.x86_64 Compiler: C/C++/Fortran: Version 4.2.5.2 of x86 Open64 Compiler Suite (from AMD) Auto Parallel: Yes File System: ext3 System State: Run level 3 (Full multiuser with network) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: None Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst cpuspeed stop was used to set the CPU frequency to its maximum. Set vm/nr_hugepages=4000 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages General Notes ------------- Environment variables set by runspec before the start of the run: HUGETLB_LIMIT = "4000" LD_LIBRARY_PATH = "/root/work/cpu2006v1.2/amd1104-speed-libs-revA/32:/root/work/cpu2006v1.2/amd1104-speed-libs-revA/64" O64_OMP_AFFINITY_MAP = "0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15" O64_OMP_SPIN_COUNT = "800000" O64_OMP_SPIN_USER_LOCK = "true" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64 Binaries were compiled on a system with 2x AMD Opteron 6220 chips + 64GB Memory using RHEL 6.1 Base Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -march=bdver1 -Ofast -HP:bdt=2m:heap=2m -apo -mso -OPT:alias=restricted -OPT:malloc_alg=2 -LNO:parallel_overhead=10000 C++ benchmarks: -march=bdver1 -Ofast -static -CG:load_exe=0 -CG:p2align=0 -INLINE:aggressive=on -HP:bdt=2m:heap=2m -D__OPEN64_FAST_SET Fortran benchmarks: -march=bdver1 -Ofast -LNO:blocking=off -LNO:fusion_peeling_limit=0 -LNO:parallel_overhead=10000 -OPT:rsqrt=2 -OPT:unroll_size=256 -HP:bdt=2m:heap=2m -apo Benchmarks using both Fortran and C: -march=bdver1 -Ofast -HP:bdt=2m:heap=2m -apo -mso -OPT:alias=restricted -OPT:malloc_alg=2 -LNO:parallel_overhead=10000 -LNO:blocking=off -LNO:fusion_peeling_limit=0 -OPT:rsqrt=2 -OPT:unroll_size=256 Peak Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -march=bdver1 -Ofast -CG:movnti=1 -CG:locs_best=on -HP:bdt=2m:heap=2m -IPA:plimit=7000 -IPA:callee_limit=1200 -OPT:struct_array_copy=2 -OPT:alias=field_sensitive 470.lbm: -march=bdver1 -Ofast -mso -apo -CG:sse_cse_regs=0 -LNO:prefetch_ahead=4 -CG:locs_shallow_depth=1 -CG:cmp_peep=on -CG:compute_to=on -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -OPT:keep_ext=on -OPT:alias=restricted -m3dnow -IPA:inline=off 482.sphinx3: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:loop_model_simd=on -LNO:simd_rm_unity_remainder=on -OPT:malloc_alg=2 -CG:cmp_peep=on -CG:local_sched_alg=2 -CG:use_incdec=off -INLINE:aggressive=on -WOPT:sib=on -HP C++ benchmarks: 444.namd: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:ignore_feedback=off -CG:local_sched_alg=2 -CG:load_exe=0 -OPT:unroll_size=256 -fno-exceptions -HP:bdt=2m:heap=2m 447.dealII: -march=bdver1 -Ofast -LNO:simd=0 -D__OPEN64_FAST_SET -static -INLINE:aggressive=on -OPT:alias=disjoint -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -HP:bdt=2m:heap=2m 450.soplex: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -INLINE:aggressive=on -OPT:RO=1 -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -fno-exceptions -CG:p2align=0 -m32 -HP:bdt=2m:heap=2m -WOPT:sib=on 453.povray: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -CG:pre_local_sched=off -INLINE:aggressive=on -HP:bdt=2m:heap=2m -OPT:transform=2 -OPT:alias=disjoint -WOPT:aggcm=0 Fortran benchmarks: 410.bwaves: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -apo -OPT:Ofast -OPT:treeheight=on -LNO:blocking=off -LNO:prefetch=2 -LNO:pf2=0 -LNO:prefetch_ahead=3 -LNO:ignore_feedback=off -LNO:fu=4 -LNO:loop_model_simd=on -LNO:simd_rm_unity_remainder=on -WOPT:aggstr=0 -HP:bdt=2m:heap=2m -CG:cmp_peep=on -CG:p2align=0 416.gamess: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -LNO:fu=6 -LNO:blocking=0 -LNO:simd=0 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 -OPT:unroll_times_max=2 -CG:local_sched_alg=1 -HP:bdt=2m:heap=2m -WOPT:sib=on 434.zeusmp: -march=bdver1 -Ofast -apo -LNO:blocking=off -LNO:interchange=off -LNO:fusion_peeling_limit=0 -OPT:treeheight=on -OPT:unroll_size=256 -CG:cmp_peep=on -CG:compute_to=on -GRA:prioritize_by_density=on -HP:bdt=2m:heap=2m 437.leslie3d: -march=bdver1 -Ofast -LNO:prefetch=2 -LNO:blocking=off -CG:interior_ptrs=on -OPT:unroll_size=256 -GRA:prioritize_by_density=on -HP:bdt=2m:heap=2m 459.GemsFDTD: -march=bdver1 -Ofast -OPT:unroll_size=0 -LNO:fission=2 -CG:load_exe=0 -CG:local_sched_alg=2 -HP -apo 465.tonto: -march=bdver1 -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -CG:local_sched_alg=1 -IPA:plimit=525 -HP Benchmarks using both Fortran and C: 435.gromacs: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -OPT:rsqrt=2 -HP:bdt=2m:heap=2m 436.cactusADM: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:prefetch=2 -HP:bdt=2m:heap=2m -CG:locs_shallow_depth=1 -CG:load_exe=0 -WOPT:sib=on -apo 454.calculix: -march=bdver1 -Ofast -OPT:unroll_size=256 -GRA:optimize_boundary=on -HP:bdt=2m:heap=2m 481.wrf: basepeak = yes The flags files that were used to format this result can be browsed at http://www.spec.org/cpu2006/flags/x86-open64-425-flags-speed-revA-I.html http://www.spec.org/cpu2006/flags/amd-platform-speed-revA-I.html You can also download the XML flags sources by saving the following links: http://www.spec.org/cpu2006/flags/x86-open64-425-flags-speed-revA-I.xml http://www.spec.org/cpu2006/flags/amd-platform-speed-revA-I.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.2. Report generated on Thu Jul 24 14:58:42 2014 by CPU2006 ASCII formatter v6932. Originally published on 8 January 2013.