SPEC(R) ACCEL_OCL Summary ASUS NVIDIA Tesla K40c ASUS P9X79 Motherboard Test Sponsor: NVIDIA Corporation Fri Feb 21 18:59:05 2014 ACCEL License: 019 Test date: Feb-2014 Test sponsor: NVIDIA Corporation Hardware availability: Nov-2013 Tested by: NVIDIA Corporation Software availability: Feb-2014 Base Base Base Base Base Base Base Peak Peak Peak Peak Peak Peak Peak Benchmarks Ref. RunTime Ratio Energy MaxPwr AvgPwr ERatio Ref. RunTime Ratio Energy MaxPwr AvgPwr ERatio -------------- ------ ------- ------ ------ ------ ------ ------ ------ ------- ------ ------ ------ ------ ------ 101.tpacf 107 77.7 1.38 16.7 239 215 1.95 S 107 60.8 1.76 13.8 240 227 2.37 * 101.tpacf 107 67.7 1.58 15.3 241 225 2.14 * 107 63.7 1.68 14.9 262 234 2.19 S 101.tpacf 107 67.2 1.59 15.2 241 226 2.14 S 107 60.7 1.76 13.8 241 228 2.36 S 103.stencil 125 61.7 2.03 17.6 296 285 2.58 S 125 61.7 2.03 17.6 296 285 2.58 S 103.stencil 125 61.6 2.03 17.5 296 284 2.59 * 125 61.6 2.03 17.5 296 284 2.59 * 103.stencil 125 61.4 2.04 17.6 297 286 2.57 S 125 61.4 2.04 17.6 297 286 2.57 S 104.lbm 112 43.4 2.58 12.2 289 280 3.16 * 112 35.0 3.20 9.56 288 273 4.03 * 104.lbm 112 43.3 2.59 12.0 290 276 3.22 S 112 34.9 3.21 9.62 289 275 4.00 S 104.lbm 112 43.6 2.57 12.1 291 277 3.19 S 112 35.1 3.19 9.54 288 272 4.04 S 110.fft 111 76.3 1.45 22.8 309 299 1.80 S 111 76.3 1.45 22.8 309 299 1.80 S 110.fft 111 76.0 1.46 22.7 309 299 1.81 S 111 76.0 1.46 22.7 309 299 1.81 S 110.fft 111 76.0 1.46 22.9 316 302 1.79 * 111 76.0 1.46 22.9 316 302 1.79 * 112.spmv 147 78.8 1.87 21.5 293 273 2.44 S 147 76.4 1.92 21.1 303 276 2.50 * 112.spmv 147 79.0 1.86 21.8 293 276 2.41 * 147 76.3 1.93 21.1 294 277 2.49 S 112.spmv 147 79.4 1.85 21.8 289 275 2.42 S 147 76.7 1.92 21.1 292 275 2.50 S 114.mriq 109 33.1 3.30 8.62 271 261 4.19 S 109 33.1 3.30 8.62 271 261 4.19 S 114.mriq 109 33.2 3.28 8.49 271 256 4.25 * 109 33.2 3.28 8.49 271 256 4.25 * 114.mriq 109 33.3 3.28 8.63 271 259 4.18 S 109 33.3 3.28 8.63 271 259 4.18 S 116.histo 114 81.2 1.40 16.0 215 198 1.94 S 114 81.2 1.40 16.0 215 198 1.94 S 116.histo 114 80.8 1.41 16.0 216 198 1.95 * 114 80.8 1.41 16.0 216 198 1.95 * 116.histo 114 75.3 1.51 15.0 217 199 2.08 S 114 75.3 1.51 15.0 217 199 2.08 S 117.bfs 117 59.8 1.96 14.8 266 248 2.57 S 117 42.6 2.75 10.3 262 242 3.69 S 117.bfs 117 59.2 1.98 14.7 266 248 2.59 * 117 42.5 2.75 10.2 264 241 3.71 * 117.bfs 117 59.1 1.98 14.6 266 247 2.60 S 117 42.5 2.75 10.3 263 242 3.70 S 118.cutcp 99 34.3 2.89 8.93 274 260 3.71 S 99 34.3 2.89 8.93 274 260 3.71 S 118.cutcp 99 34.4 2.88 9.01 273 262 3.68 * 99 34.4 2.88 9.01 273 262 3.68 * 118.cutcp 99 34.4 2.88 9.02 274 262 3.68 S 99 34.4 2.88 9.02 274 262 3.68 S 120.kmeans 100 90.1 1.11 18.0 211 199 1.50 * 100 86.1 1.16 17.1 206 199 1.58 S 120.kmeans 100 89.0 1.12 17.7 206 199 1.52 S 100 84.7 1.18 16.9 208 200 1.60 S 120.kmeans 100 91.0 1.10 18.1 205 199 1.49 S 100 85.9 1.16 17.1 207 199 1.58 * 121.lavamd 109 60.2 1.81 17.3 307 288 2.28 * 109 60.2 1.81 17.3 307 288 2.28 * 121.lavamd 109 60.3 1.81 17.4 308 289 2.27 S 109 60.3 1.81 17.4 308 289 2.27 S 121.lavamd 109 59.9 1.82 17.5 309 292 2.25 S 109 59.9 1.82 17.5 309 292 2.25 S 122.cfd 126 73.5 1.71 19.2 268 261 2.25 S 126 72.3 1.74 18.6 267 258 2.31 S 122.cfd 126 73.3 1.72 18.9 268 259 2.27 S 126 72.2 1.75 18.6 268 257 2.32 * 122.cfd 126 73.3 1.72 19.1 273 260 2.26 * 126 72.1 1.75 18.8 269 261 2.29 S 123.nw 115 69.8 1.65 16.0 237 229 2.26 * 115 69.8 1.65 16.0 237 229 2.26 * 123.nw 115 69.7 1.65 15.9 237 229 2.26 S 115 69.7 1.65 15.9 237 229 2.26 S 123.nw 115 70.2 1.64 16.1 237 230 2.24 S 115 70.2 1.64 16.1 237 230 2.24 S 124.hotspot 114 38.8 2.94 10.9 302 282 3.45 S 114 38.8 2.94 10.9 302 282 3.45 S 124.hotspot 114 38.7 2.95 10.9 303 281 3.48 * 114 38.7 2.95 10.9 303 281 3.48 * 124.hotspot 114 38.6 2.95 10.9 302 282 3.47 S 114 38.6 2.95 10.9 302 282 3.47 S 125.lud 119 80.9 1.47 22.8 295 282 1.93 * 119 73.8 1.61 19.1 271 259 2.30 * 125.lud 119 81.1 1.47 22.8 294 281 1.93 S 119 73.8 1.61 19.1 271 259 2.30 S 125.lud 119 80.9 1.47 22.6 293 280 1.94 S 119 73.7 1.62 19.2 271 260 2.29 S 126.ge 155 54.1 2.86 14.3 280 265 3.74 * 155 8.98 17.3 2.04 284 227 26.3 S 126.ge 155 53.9 2.88 14.4 281 268 3.71 S 155 9.03 17.2 2.26 283 250 23.7 S 126.ge 155 54.2 2.86 14.5 281 267 3.71 S 155 9.02 17.2 2.28 285 253 23.5 * 127.srad 114 60.3 1.89 16.9 293 280 2.36 S 114 60.3 1.89 16.9 293 280 2.36 S 127.srad 114 60.8 1.87 16.9 293 278 2.36 S 114 60.8 1.87 16.9 293 278 2.36 S 127.srad 114 60.7 1.88 16.9 292 278 2.36 * 114 60.7 1.88 16.9 292 278 2.36 * 128.heartwall 106 88.0 1.20 21.7 255 247 1.66 * 106 88.0 1.20 21.7 255 247 1.66 * 128.heartwall 106 88.1 1.20 21.9 256 249 1.64 S 106 88.1 1.20 21.9 256 249 1.64 S 128.heartwall 106 87.7 1.21 21.7 254 248 1.66 S 106 87.7 1.21 21.7 254 248 1.66 S 140.bplustree 108 70.0 1.54 17.3 257 247 2.05 * 108 70.0 1.54 17.3 257 247 2.05 * 140.bplustree 108 70.1 1.54 17.3 265 246 2.05 S 108 70.1 1.54 17.3 265 246 2.05 S 140.bplustree 108 69.8 1.55 17.2 259 247 2.05 S 108 69.8 1.55 17.2 259 247 2.05 S ======================================================================================================================== 101.tpacf 107 67.7 1.58 15.3 241 225 2.14 * 107 60.8 1.76 13.8 240 227 2.37 * 103.stencil 125 61.6 2.03 17.5 296 284 2.59 * 125 61.6 2.03 17.5 296 284 2.59 * 104.lbm 112 43.4 2.58 12.2 289 280 3.16 * 112 35.0 3.20 9.56 288 273 4.03 * 110.fft 111 76.0 1.46 22.9 316 302 1.79 * 111 76.0 1.46 22.9 316 302 1.79 * 112.spmv 147 79.0 1.86 21.8 293 276 2.41 * 147 76.4 1.92 21.1 303 276 2.50 * 114.mriq 109 33.2 3.28 8.49 271 256 4.25 * 109 33.2 3.28 8.49 271 256 4.25 * 116.histo 114 80.8 1.41 16.0 216 198 1.95 * 114 80.8 1.41 16.0 216 198 1.95 * 117.bfs 117 59.2 1.98 14.7 266 248 2.59 * 117 42.5 2.75 10.2 264 241 3.71 * 118.cutcp 99 34.4 2.88 9.01 273 262 3.68 * 99 34.4 2.88 9.01 273 262 3.68 * 120.kmeans 100 90.1 1.11 18.0 211 199 1.50 * 100 85.9 1.16 17.1 207 199 1.58 * 121.lavamd 109 60.2 1.81 17.3 307 288 2.28 * 109 60.2 1.81 17.3 307 288 2.28 * 122.cfd 126 73.3 1.72 19.1 273 260 2.26 * 126 72.2 1.75 18.6 268 257 2.32 * 123.nw 115 69.8 1.65 16.0 237 229 2.26 * 115 69.8 1.65 16.0 237 229 2.26 * 124.hotspot 114 38.7 2.95 10.9 303 281 3.48 * 114 38.7 2.95 10.9 303 281 3.48 * 125.lud 119 80.9 1.47 22.8 295 282 1.93 * 119 73.8 1.61 19.1 271 259 2.30 * 126.ge 155 54.1 2.86 14.3 280 265 3.74 * 155 9.02 17.2 2.28 285 253 23.5 * 127.srad 114 60.7 1.88 16.9 292 278 2.36 * 114 60.7 1.88 16.9 292 278 2.36 * 128.heartwall 106 88.0 1.20 21.7 255 247 1.66 * 106 88.0 1.20 21.7 255 247 1.66 * 140.bplustree 108 70.0 1.54 17.3 257 247 2.05 * 108 70.0 1.54 17.3 257 247 2.05 * SPECaccel_ocl_energy_base 2.43 SPECaccel_ocl_base 1.87 SPECaccel_ocl_energy_peak 2.82 SPECaccel_ocl_peak 2.15 HARDWARE -------- CPU Name: Intel Core i7-3930K CPU Characteristics: CPU MHz: 3200 CPU MHz Maximum: 3800 FPU: Integrated CPU(s) enabled: 6 cores, 1 chip, 6 cores/chip, 2 threads/core CPU(s) orderable: 1 chip Primary Cache: 32 KB I + 32 KB D on chip per core Secondary Cache: 256 KB I+D on chip per core L3 Cache: 12 MB I+D on chip per chip Other Cache: None Memory: 8 GB (2 x 4 GB 2Rx4 PC3-14900R-9, running at 1600 MHz) Disk Subsystem: 1000 GB Seagate ST1000DM003 7200 RPM SATA Other Hardware: None ACCELERATOR ----------- Accel Model Name: Tesla K40c Accel Vendor: NVIDIA Accel Name: NVIDIA Tesla K40c Type of Accel: GPU Accel Connection: PCIe 3.0 16x Does Accel Use ECC: Yes Accel Description: See Notes Accel Driver: NVIDIA UNIX x86_64 Kernel Module 319.60 SOFTWARE -------- Operating System: Red Hat Enterprise Linux Server release 6.4 (Santiago) 2.6.32-358.el6.x86_64 Compiler: PGI Accelerator Server Complete, Release 14.2 File System: ext4 System State: Run level 3 (multi-user) Other Software: CUDA 5.5 SDK POWER ----- Power Supply: 1200 W Power Supply Details: Thermaltake SMART M1200W Max. Power (W): 315.92 Idle Power (W): 110.6 Min. Temperature (C): 25.81 POWER ANALYZER -------------- Power Analyzer: Power Analyzer Hardware Vendor: Xitron Technologies, Inc. Model: 2801 Serial Number: 28011109005 Input Connection: RS232 via USB-adapter Metrology Institute: NIST Calibration By: Micro Precision Calibration, Inc. Calibration Label: 220081222038459 Calibration Date: 02.20.2014 PTDaemon Version: 1.6.2 (372e138a; 2013-12-04) Setup Description: connected to the single power supply that powers the system Current Ranges Used: 2.0A Voltage Range Used: 135V TEMPERATURE METER ----------------- Temperature Meter: Temperature Meter Hardware Vendor: Digi Model: DigiWATCHPORT_H Serial Number: WS34682143 Input Connection: USB PTDaemon Version: 1.6.2 (372e138a; 2013-12-04) Setup Description: Position 5mm above intake fan Platform Notes -------------- Sysinfo program /local/home/SPECACCEL/Docs/sysinfo $Rev: 6874 $ $Date:: 2013-11-20 #$ 0953404ef7e75a5f9bbb534c6de3f831 running on sbe02 Fri Feb 21 15:59:07 2014 This section contains SUT (System Under Test) info as seen by some common utilities. To remove or add to this section, see: http://www.spec.org/accel/Docs/config.html#sysinfo From /proc/cpuinfo model name : Intel(R) Core(TM) i7-3930K CPU @ 3.20GHz 1 "physical id"s (chips) 12 "processors" cores, siblings (Caution: counting these is hw and system dependent. The following excerpts from /proc/cpuinfo might not be reliable. Use with caution.) cpu cores : 6 siblings : 12 physical 0: cores 0 1 2 3 4 5 cache size : 12288 KB From /proc/meminfo MemTotal: 8130700 kB HugePages_Total: 0 Hugepagesize: 2048 kB /usr/bin/lsb_release -d Red Hat Enterprise Linux Server release 6.4 (Santiago) From /etc/*release* /etc/*version* redhat-release: Red Hat Enterprise Linux Server release 6.4 (Santiago) system-release: Red Hat Enterprise Linux Server release 6.4 (Santiago) system-release-cpe: cpe:/o:redhat:enterprise_linux:6server:ga:server uname -a: Linux sbe02 2.6.32-358.el6.x86_64 #1 SMP Tue Jan 29 11:47:41 EST 2013 x86_64 x86_64 x86_64 GNU/Linux run-level 3 Feb 21 13:04 SPEC is set to: /local/home/SPECACCEL Filesystem Type Size Used Avail Use% Mounted on /dev/mapper/VolGroup-lv_home ext4 860G 48G 769G 6% /local Additional information from dmidecode: Warning: Use caution when you interpret this section. The 'dmidecode' program reads system data which is "intended to allow hardware to be accurately determined", but the intent may not be met, as there are frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard. (End of data from sysinfo program) Information from pgaccelinfo CUDA Driver Version: 5050 NVRM version: NVIDIA UNIX x86_64 Kernel Module 319.60 Wed Sep 25 14:28:26 PDT 2013 Device Number: 0 Device Name: Tesla K40c Device Revision Number: 3.5 Global Memory Size: 12079136768 Number of Multiprocessors: 15 Number of SP Cores: 2880 Number of DP Cores: 960 Concurrent Copy and Execution: Yes Total Constant Memory: 65536 Total Shared Memory per Block: 49152 Registers per Block: 65536 Warp Size: 32 Maximum Threads per Block: 1024 Maximum Block Dimensions: 1024, 1024, 64 Maximum Grid Dimensions: 2147483647 x 65535 x 65535 Maximum Memory Pitch: 2147483647B Texture Alignment: 512B Clock Rate: 745 MHz Max. Clock Rate: 875 MHz Execution Timeout: No Integrated Device: No Can Map Host Memory: Yes Compute Mode: default Concurrent Kernels: Yes ECC Enabled: Yes Memory Clock Rate: 3004 MHz Memory Bus Width: 384 bits L2 Cache Size: 1572864 bytes Max Threads Per SMP: 2048 Async Engines: 2 Unified Addressing: Yes General Notes ------------- Kit built system using a CoolMaster HAF X case Base Runtime Environment ------------------------ C benchmarks: OpenCL Platform: NVIDIA CUDA, OpenCL 1.1 CUDA 4.2.1 OpenCL Device #0: Tesla K40c, v 319.60 C++ benchmarks: OpenCL Platform: NVIDIA CUDA, OpenCL 1.1 CUDA 4.2.1 OpenCL Device #0: Tesla K40c, v 319.60 Base Compiler Invocation ------------------------ C benchmarks: pgcc C++ benchmarks: pgc++ Base Portability Flags ---------------------- 118.cutcp: -D__GNUC__ Base Optimization Flags ----------------------- C benchmarks: -fast -Mfprelaxed C++ benchmarks: -fast -Mfprelaxed Base Other Flags ---------------- C benchmarks: -I/opt/cuda-5.5/include/ -lOpenCL C++ benchmarks: -I/opt/cuda-5.5/include/ -lOpenCL Peak Runtime Environment ------------------------ C benchmarks: OpenCL Platform: NVIDIA CUDA, OpenCL 1.1 CUDA 4.2.1 OpenCL Device #0: Tesla K40c, v 319.60 C++ benchmarks: OpenCL Platform: NVIDIA CUDA, OpenCL 1.1 CUDA 4.2.1 OpenCL Device #0: Tesla K40c, v 319.60 Peak Compiler Invocation ------------------------ C benchmarks: pgcc C++ benchmarks: pgc++ Peak Portability Flags ---------------------- 118.cutcp: -D__GNUC__ Peak Optimization Flags ----------------------- C benchmarks: 110.fft: basepeak = yes 114.mriq: basepeak = yes 116.histo: basepeak = yes 117.bfs: -fast -Mfprelaxed -DSPEC_ACCEL_WG_SIZE_0_0=64 -DSPEC_ACCEL_WG_SIZE_1_0=64 118.cutcp: basepeak = yes 121.lavamd: basepeak = yes 124.hotspot: basepeak = yes 127.srad: basepeak = yes 128.heartwall: basepeak = yes 140.bplustree: basepeak = yes C++ benchmarks: 101.tpacf: -fast -Mfprelaxed -DSPEC_ACCEL_WG_SIZE_0_0=1024 103.stencil: basepeak = yes 104.lbm: -fast -Mfprelaxed -DSPEC_ACCEL_WG_SIZE_0_0=32 -DSPEC_ACCEL_WG_SIZE_0_1=1 -DSPEC_ACCEL_WG_SIZE_0_2=1 112.spmv: -fast -Mfprelaxed -DSPEC_ACCEL_WG_SIZE_0_0=96 120.kmeans: -fast -Mfprelaxed -DSPEC_ACCEL_WG_SIZE_0_0=288 122.cfd: -fast -Mfprelaxed -DSPEC_ACCEL_WG_SIZE_3_0=288 123.nw: basepeak = yes 125.lud: -fast -Mfprelaxed -DSPEC_ACCEL_WG_SIZE_0_0=32 126.ge: -fast -Mfprelaxed -DSPEC_ACCEL_WG_SIZE_0_0=512 -DSPEC_ACCEL_WG_SIZE_1_0=1 -DSPEC_ACCEL_WG_SIZE_1_1=512 Peak Other Flags ---------------- C benchmarks: -I/opt/cuda-5.5/include/ -lOpenCL C++ benchmarks: -I/opt/cuda-5.5/include/ -lOpenCL The flags file that was used to format this result can be browsed at http://www.spec.org/accel/flags/pgi2014_flags.20150303.html You can also download the XML flags source by saving the following link: http://www.spec.org/accel/flags/pgi2014_flags.20150303.xml SPEC is a registered trademark of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ------------------------------------------------------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2014-2015 Standard Performance Evaluation Corporation Tested with SPEC ACCEL v39. Report generated on Tue Mar 3 14:21:19 2015 by ACCEL ASCII formatter v1212. Originally published on 17 March 2014.