SPEC(R) MPIM2007 Summary IBM Corporation IBM BladeCenter JS22 Express (4 GHz, 8x4 core) Sat Oct 25 03:18:54 2008 MPI2007 License: 0005 Test date: Oct-2008 Test sponsor: IBM Corporation Hardware availability: Nov-2008 Tested by: IBM Corporation Software availability: Nov-2008 Base Base Base Peak Peak Peak Benchmarks Ranks Run Time Ratio Ranks Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 104.milc 64 426 3.68 S 64 426 3.68 S 104.milc 64 419 3.74 S 64 419 3.74 S 104.milc 64 423 3.70 * 64 423 3.70 * 107.leslie3d 64 880 5.93 S 64 877 5.95 * 107.leslie3d 64 904 5.78 S 64 882 5.92 S 107.leslie3d 64 903 5.78 * 64 874 5.97 S 113.GemsFDTD 64 818 7.72 * 64 818 7.72 * 113.GemsFDTD 64 814 7.75 S 64 814 7.75 S 113.GemsFDTD 64 823 7.66 S 64 823 7.66 S 115.fds4 64 499 3.91 S 64 486 4.01 * 115.fds4 64 506 3.86 S 64 490 3.98 S 115.fds4 64 500 3.90 * 64 470 4.15 S 121.pop2 64 957 4.31 S 64 957 4.31 S 121.pop2 64 968 4.26 S 64 968 4.26 S 121.pop2 64 963 4.28 * 64 963 4.28 * 122.tachyon 64 1034 2.71 S 64 1022 2.74 S 122.tachyon 64 1035 2.70 S 64 1012 2.76 * 122.tachyon 64 1034 2.70 * 64 1011 2.77 S 126.lammps 64 582 5.01 S 64 582 5.01 S 126.lammps 64 583 5.00 * 64 583 5.00 * 126.lammps 64 586 4.97 S 64 586 4.97 S 127.wrf2 64 1450 5.38 * 64 1018 7.66 * 127.wrf2 64 1453 5.36 S 64 1014 7.69 S 127.wrf2 64 1449 5.38 S 64 1019 7.65 S 128.GAPgeofem 64 353 5.85 S 64 353 5.85 S 128.GAPgeofem 64 338 6.10 S 64 338 6.10 S 128.GAPgeofem 64 349 5.91 * 64 349 5.91 * 129.tera_tf 64 1100 2.52 * 64 808 3.43 S 129.tera_tf 64 1101 2.51 S 64 807 3.43 * 129.tera_tf 64 1096 2.52 S 64 806 3.43 S 130.socorro 64 604 6.32 S 64 282 13.5 S 130.socorro 64 607 6.29 * 64 279 13.7 * 130.socorro 64 607 6.29 S 64 277 13.8 S 132.zeusmp2 64 641 4.84 S 64 641 4.84 S 132.zeusmp2 64 649 4.78 S 64 649 4.78 S 132.zeusmp2 64 645 4.81 * 64 645 4.81 * 137.lu 64 642 5.72 * 64 642 5.72 * 137.lu 64 627 5.86 S 64 627 5.86 S 137.lu 64 644 5.71 S 64 644 5.71 S ============================================================================== 104.milc 64 423 3.70 * 64 423 3.70 * 107.leslie3d 64 903 5.78 * 64 877 5.95 * 113.GemsFDTD 64 818 7.72 * 64 818 7.72 * 115.fds4 64 500 3.90 * 64 486 4.01 * 121.pop2 64 963 4.28 * 64 963 4.28 * 122.tachyon 64 1034 2.70 * 64 1012 2.76 * 126.lammps 64 583 5.00 * 64 583 5.00 * 127.wrf2 64 1450 5.38 * 64 1018 7.66 * 128.GAPgeofem 64 349 5.91 * 64 349 5.91 * 129.tera_tf 64 1100 2.52 * 64 807 3.43 * 130.socorro 64 607 6.29 * 64 279 13.7 * 132.zeusmp2 64 645 4.81 * 64 645 4.81 * 137.lu 64 642 5.72 * 64 642 5.72 * SPECmpiM_base2007 4.68 SPECmpiM_peak2007 5.26 BENCHMARK DETAILS ----------------- Type of System: Heterogeneous Total Compute Nodes: 8 Total Chips: 16 Total Cores: 32 Total Threads: 64 Total Memory: 144 GB Base Ranks Run: 64 Minimum Peak Ranks: 64 Maximum Peak Ranks: 64 C Compiler: IBM XL C/C++ Enterprise Edition V9 for AIX Updated with the September 2008 Fix level C++ Compiler: IBM XL C/C++ Enterprise Edition V9 for AIX Updated with the September 2008 Fix level Fortran Compiler: IBM XL Fortran Enterprise Edition V11.1 for AIX Updated with the September 2008 Fix level Base Pointers: 32-bit Peak Pointers: 32/64-bit MPI Library: IBM Parallel Environment for AIX, Version 5 Release 1 Other MPI Info: None Pre-processors: None Other Software: IBM Engineering and Scientific Subroutine Library (ESSL) for AIX Version 4 Release 3 Updated with PTF Set 3 Node Description: IBM System JS22 ================================= HARDWARE -------- Number of nodes: 1 Uses of the node: compute, head, fileserver Vendor: IBM Corporation Model: IBM System JS22 CPU Name: POWER6 CPU(s) orderable: 4 cores per blade Chips enabled: 2 Cores enabled: 4 Cores per chip: 2 Threads per core: 2 CPU Characteristics: CPU MHz: 4000 Primary Cache: 64 KB I + 64 KB D on chip per core Secondary Cache: 4 MB I+D on chip per core L3 Cache: None Other Cache: None Memory: 32 GB (4x8 GB) DDR2 500 MHz Disk Subsystem: 1x146 GB SAS 15K RPM Other Hardware: BladeCenter-H chassis Voltaire 4X InfiniBand Pass-thru Module (P/N 43W4419) Adapter: 4X InfiniBand DDR Expansion Card (CFFh) for IBM BladeCenter (P/N 43W4423) Number of Adapters: 1 Slot Type: PCIe x8 Gen2 Data Rate: 4x DDR 20Gbps Ports Used: 1 Interconnect Type: InfiniBand SOFTWARE -------- Adapter: 4X InfiniBand DDR Expansion Card (CFFh) for IBM BladeCenter (P/N 43W4423) Adapter Driver: devices.pciex.b3157862.rte 6.1.2.0 Adapter Firmware: 2.3.0 Operating System: IBM AIX V6.1 with the 6100-02 Technology Level Local File System: AIX/JFS2 Shared File System: NFSv3 System State: Multi-user Other Software: None General Notes ------------- Blade[1] runs the following commands to compose the cluster: mkdev -c management -s infiniband -t icm /usr/sbin/mkiba -a 192.1.10.1 -m 255.255.255.0 -i ib0 -A iba0 -p 1 -P 0xFFFF -M 65532 -q 4000 -k off -Q 0x1E -S up startsrc -s ctcas preprpnode mpiblade1 mkrpdomain mpiblades mpiblade1 mpiblade2 mpiblade3 mpiblade4 mpiblade5 mpiblade6 mpiblade7 mpiblade8 startrpdomain mpiblades cd /usr/lpp/ppe.poe/samples/nrt make chmod 4755 nrt_api shutdown -rF su spec cd mpiblades.64ranks.load ../nrt_api -l Node Description: IBM System JS22 ================================= HARDWARE -------- Number of nodes: 7 Uses of the node: compute Vendor: IBM Corporation Model: IBM System JS22 CPU Name: POWER6 CPU(s) orderable: 4 cores per blade Chips enabled: 2 Cores enabled: 4 Cores per chip: 2 Threads per core: 2 CPU Characteristics: CPU MHz: 4000 Primary Cache: 64 KB I + 64 KB D on chip per core Secondary Cache: 4 MB I+D on chip per core L3 Cache: None Other Cache: None Memory: 16 GB (4x4 GB) DDR2 667 MHz Disk Subsystem: 1x146 GB SAS 15K RPM Other Hardware: BladeCenter-H chassis Voltaire 4X InfiniBand Pass-thru Module (P/N 43W4419) Adapter: 4X InfiniBand DDR Expansion Card (CFFh) for IBM BladeCenter (P/N 43W4423) Number of Adapters: 1 Slot Type: PCIe x8 Gen2 Data Rate: 4x DDR 20Gbps Ports Used: 1 Interconnect Type: InfiniBand SOFTWARE -------- Adapter: 4X InfiniBand DDR Expansion Card (CFFh) for IBM BladeCenter (P/N 43W4423) Adapter Driver: devices.pciex.b3157862.rte 6.1.2.0 Adapter Firmware: 2.3.0 Operating System: IBM AIX V6.1 with the 6100-02 Technology Level Local File System: AIX/JFS2 Shared File System: NFSv3 System State: Multi-user Other Software: None General Notes ------------- Each blade runs the following commands to compose the cluster, where $CLUSTER_INDEX is 2-8 for Blade[2]-Blade[8]: mkdev -c management -s infiniband -t icm /usr/sbin/mkiba -a 192.1.10.$CLUSTER_INDEX -m 255.255.255.0 -i ib0 -A iba0 -p 1 -P 0xFFFF -M 65532 -q 4000 -k off -Q 0x1E -S up startsrc -s ctcas preprpnode mpiblade1 cd /usr/lpp/ppe.poe/samples/nrt make chmod 4755 nrt_api shutdown -rF su spec cd mpiblades.64ranks.load ../nrt_api -l Interconnect Description: InfiniBand ==================================== HARDWARE -------- Vendor: IBM Corporation Model: 4x DDR InfiniBand Switch Model: QLogic SilverStorm 9024 Number of Switches: 1 Number of Ports: 24 Data Rate: 4x DDR 20Gbps Firmware: 4.2.1.1.1 Topology: single switch Primary Use: MPI Communication Interconnect Description: Ethernet ================================== HARDWARE -------- Vendor: IBM Corporation Model: 4-port Gigabit Ethernet Switch Model: IBM BladeCenter 4-port Gigabit Ethernet switch module (P/N 26K6483) Number of Switches: 1 Number of Ports: 18 Data Rate: 1Gbps Firmware: 1.08 Topology: single switch Primary Use: File system Compiler Invocation Notes ------------------------- Blade[1], with 32GB of memory and 32GB of paging space, was used to compile the benchmarks. Submit Notes ------------ The config file option 'submit' was used. submit = poe task_stride.2level.32+64rank 4 2 8 $ranks $command -procs $ranks -hostfile /spec/MapFiles/ib0hosts.8x.1-8 General Notes ------------- Environment settings: All ulimits set to unlimited ranks = 64 CWD = /spec/mpi2007 MEMORY_AFFINITY = MCM XLFRTEOPTS = intrinthds=1 MP_PGMMODEL = spmd MP_MSG_API = mpi MP_DEVTYPE = ib MP_CLOCK_SOURCE = AIX MP_STDINMODE = none MP_SHARED_MEMORY = yes MP_SINGLE_THREAD = yes MP_EUILIB = us NRT_WINDOW_COUNT = 1 MP_RESD = no MP_PULSE = 0 ADAPTER_USE = shared EUIDEVICE = sn_single MP_CSS_INTERRUPT = no MP_BUFFER_MEM = 67108864 MP_USE_BULK_XFER = yes MP_BULK_MIN_MSG_SIZE = 8192 MP_EAGER_LIMIT = 65536 MP_WAIT_MODE = yield MP_INFOLEVEL = 0 MP_LABELIO = no MP_STDOUTMODE = unordered MP_PMDLOG = no NRT_JOB_KEY = 64 Compiler Invocation ------------------- C benchmarks: /usr/bin/mpcc_r C++ benchmarks: 126.lammps: /usr/bin/mpCC_r Fortran benchmarks: /usr/bin/mpxlf95_r Benchmarks using both Fortran and C: /usr/bin/mpcc_r /usr/bin/mpxlf95_r Portability Flags ----------------- 107.leslie3d: -qfixed 115.fds4: -DSPEC_MPI_LC_NO_TRAILING_UNDERSCORE -qfixed 121.pop2: -DSPEC_MPI_AIX 127.wrf2: -DNOUNDERSCORE -DSPEC_MPI_AIX 130.socorro: -DSPEC_NO_UNDERSCORE -qcpluscmt 132.zeusmp2: -qfixed -DSPEC_SINGLE_UNDERSCORE 137.lu: -qfixed Base Optimization Flags ----------------------- C benchmarks: -bmaxdata:0x80000000 -O5 -D_ILS_MACROS -bdatapsize:64K -bstackpsize:64K -btextpsize:64K C++ benchmarks: 126.lammps: -bmaxdata:0x80000000 -O5 Fortran benchmarks: -bmaxdata:0x80000000 -O4 -qstrict -qalias=nostd -qhot=level=0 -qsave -bdatapsize:64K -bstackpsize:64K -btextpsize:64K Benchmarks using both Fortran and C: -bmaxdata:0x80000000 -O5 -D_ILS_MACROS -bdatapsize:64K -bstackpsize:64K -btextpsize:64K -O4 -qstrict -qalias=nostd -qhot=level=0 -qsave Peak Optimization Flags ----------------------- C benchmarks: 104.milc: basepeak = yes 122.tachyon: -O5 -lessl -D_ILS_MACROS -bdatapsize:64K -bstackpsize:64K -btextpsize:64K -q64 C++ benchmarks: 126.lammps: basepeak = yes Fortran benchmarks: 107.leslie3d: -O5 -bdatapsize:64K -bstackpsize:64K -btextpsize:64K -bmaxdata:0x70000000 113.GemsFDTD: basepeak = yes 129.tera_tf: -O5 -qessl -lessl -bdatapsize:64K -bstackpsize:64K -btextpsize:64K 137.lu: basepeak = yes Benchmarks using both Fortran and C: 115.fds4: -O5 -lessl -D_ILS_MACROS -bdatapsize:64K -bstackpsize:64K -btextpsize:64K -qstrict -qalias=nostd -qhot=level=0 -qsave -q64 121.pop2: basepeak = yes 127.wrf2: -O5 -bmaxdata:0x80000000 128.GAPgeofem: basepeak = yes 130.socorro: -O5 -lessl -D_ILS_MACROS -bdatapsize:64K -bstackpsize:64K -btextpsize:64K -qessl -bmaxdata:0x80000000 132.zeusmp2: basepeak = yes Other Flags ----------- C benchmarks: -w -qsuppress=1500-036 -qipa=noobject -qipa=threads C++ benchmarks: 126.lammps: -w -qsuppress=1500-036 -qipa=noobject -qipa=threads Fortran benchmarks: -w -qsuppress=1500-036 -qsuppress=cmpmsg -qspillsize=32648 Benchmarks using both Fortran and C: -w -qsuppress=1500-036 -qipa=noobject -qipa=threads -qsuppress=cmpmsg -qspillsize=32648 The flags files that were used to format this result can be browsed at http://www.spec.org/mpi2007/flags/MPI2007_flags.20081105.html http://www.spec.org/mpi2007/flags/IBM-XL.html http://www.spec.org/mpi2007/flags/IBM-AIX.html You can also download the XML flags sources by saving the following links: http://www.spec.org/mpi2007/flags/MPI2007_flags.20081105.xml http://www.spec.org/mpi2007/flags/IBM-XL.xml http://www.spec.org/mpi2007/flags/IBM-AIX.xml SPEC and SPEC MPI are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2010 Standard Performance Evaluation Corporation Tested with SPEC MPI2007 v1.1. Report generated on Tue Jul 22 13:34:54 2014 by MPI2007 ASCII formatter v1463. Originally published on 5 November 2008.