OS Images |
os_Image_1(1)
|
Hardware Description |
hw_1
|
Number of Systems |
1
|
SW Environment |
Non-virtual
|
Tuning |
- Workload Profile=High Performance Compute(HPC)
- Thermal Configuration=Maximum Cooling
- Performance Determinism=Power Deterministic
- Memory Patrol Scrubbing=Disabled
- NUMA memory domains per socket=Four memory domains per socket
- Last-Level Cache(LLC) As NUMA Node= Enabled
- Processor Power and Utilization Monitoring = Disabled
- Memory Pre-Failure Notification = Disabled
|
Notes |
None
|
|
JVM Instances |
jvm_Ctr_1(1), jvm_Backend_1(16), jvm_TxInjector_1(16)
|
OS Image Description |
os_1
|
Tuning |
- tuned-adm profile=throughput-performance
- ulimit -n 655350
- echo 15000000 > /proc/sys/kernel/sched_wakeup_granularity_ns
- echo 80000000 > /proc/sys/kernel/sched_min_granularity_ns
- echo 1000 > /proc/sys/kernel/sched_migration_cost_ns
- echo 16 > /proc/sys/kernel/sched_nr_migrate
- echo 950000 > /proc/sys/kernel/sched_rt_runtime_us
- echo 10000000 > /proc/sys/kernel/sched_latency_ns
- echo 10000 > /proc/sys/vm/dirty_expire_centisecs
- echo 1500 > /proc/sys/vm/dirty_writeback_centisecs
- echo 40 > /proc/sys/vm/dirty_ratio
- echo 10 > /proc/sys/vm/dirty_background_ratio
- echo 10 > /proc/sys/vm/swappiness
- echo 0 > /proc/sys/kernel/numa_balancing
- echo always > /sys/kernel/mm/transparent_hugepage/defrag
- echo always > /sys/kernel/mm/transparent_hugepage/enabled
|
Notes |
None
|
Parts of Benchmark |
Controller
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms3g -Xmx3g -Xmn2g -XX:+UseParallelOldGC -XX:ParallelGCThreads=1 -XX:CICompilerCount=2
|
Tuning |
None
|
Notes |
numactl --interleave=all
|
Parts of Benchmark |
Backend
|
JVM Instance Description |
jvm_1
|
Command Line |
-showversion -server -XX:AllocatePrefetchInstr=2 -XX:LargePageSizeInBytes=2m -XX:-UsePerfData -XX:-UseAdaptiveSizePolicy -XX:+AlwaysPreTouch -XX:-UseBiasedLocking -XX:+UseLargePages -XX:+UseParallelOldGC -Xms29g -Xmx29g -Xmn27g -XX:SurvivorRatio=28 -XX:TargetSurvivorRatio=98 -XX:ParallelGCThreads=16 -XX:MaxTenuringThreshold=15 -Xnoclassgc -XX:+PrintGCDetails -XX:InlineSmallCode=10k -XX:MaxGCPauseMillis=300 -XX:ThreadStackSize=1m
|
Tuning |
None
|
Notes |
Used numactl to affinitize each Backend JVM to two NUMA nodes. - numactl --cpunodebind=0,1 --localalloc
- numactl --cpunodebind=2,3 --localalloc
- numactl --cpunodebind=4,5 --localalloc
- numactl --cpunodebind=6,7 --localalloc
- numactl --cpunodebind=8,9 --localalloc
- numactl --cpunodebind=10,11 --localalloc
- numactl --cpunodebind=12,13 --localalloc
- numactl --cpunodebind=14,15 --localalloc
- numactl --cpunodebind=16,17 --localalloc
- numactl --cpunodebind=18,19 --localalloc
- numactl --cpunodebind=20,21 --localalloc
- numactl --cpunodebind=22,23 --localalloc
- numactl --cpunodebind=24,25 --localalloc
- numactl --cpunodebind=26,27 --localalloc
- numactl --cpunodebind=28,29 --localalloc
- numactl --cpunodebind=30,31 --localalloc
|
Parts of Benchmark |
TxInjector
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms4g -Xmx4g -Xmn3g -XX:+UseParallelOldGC -XX:ParallelGCThreads=3 -XX:CICompilerCount=2
|
Tuning |
None
|
Notes |
Used numactl to affinitize each Transaction Injector JVM to a Numa Node. - numactl --cpunodebind=1 --localalloc
- numactl --cpunodebind=3 --localalloc
- numactl --cpunodebind=5 --localalloc
- numactl --cpunodebind=7 --localalloc
- numactl --cpunodebind=9 --localalloc
- numactl --cpunodebind=11 --localalloc
- numactl --cpunodebind=13 --localalloc
- numactl --cpunodebind=15 --localalloc
- numactl --cpunodebind=17 --localalloc
- numactl --cpunodebind=19 --localalloc
- numactl --cpunodebind=21 --localalloc
- numactl --cpunodebind=23 --localalloc
- numactl --cpunodebind=25 --localalloc
- numactl --cpunodebind=27 --localalloc
- numactl --cpunodebind=29 --localalloc
- numactl --cpunodebind=31 --localalloc
|
|