OS Images |
os_Image_1(1)
|
Hardware Description |
hw_1
|
Number of Systems |
1
|
SW Environment |
non-virtual
|
Tuning |
BIOS Settings:- Engine Boost = Enabled
- L1 Stream HW Prefetcher = Disable
- L2 Stream HW Prefetcher = Disable
- Enhanced REP MOVSB/STOSB = Disabled
- Fast Short REP MOVSB = Disabled
- DLWM Support = Disabled
- NUMA nodes per socket = NPS4
- Memory Interleaving = Disabled
- ACPI SRAT L3 Cache as NUMA Domain = Enabled
- APBDIS = 1
- Fix SOC P-state = P0
|
Notes |
None
|
|
JVM Instances |
jvm_Ctr_1(1), jvm_Backend_1(16), jvm_TxInjector_1(16)
|
OS Image Description |
os_1
|
Tuning |
- cpupower frequency-set -g performance
- tuned-adm profile throughput-performance
- ulimit -n 655360
- echo 960000 > /proc/sys/kernel/sched_rt_runtime_us
- echo 40000000 > /proc/sys/kernel/sched_latency_ns
- echo 40000 > /proc/sys/kernel/sched_migration_cost_ns
- echo 800000000 > /proc/sys/kernel/sched_min_granularity_ns
- echo 200000000 > /proc/sys/kernel/sched_wakeup_granularity_ns
- echo 9000 > /proc/sys/kernel/sched_nr_migrate
- echo 10000 > /proc/sys/vm/dirty_expire_centisecs
- echo 1500 > /proc/sys/vm/dirty_writeback_centisecs
- echo 40 > /proc/sys/vm/dirty_ratio
- echo 10 > /proc/sys/vm/dirty_background_ratio
- echo 10 > /proc/sys/vm/swappiness
- echo 0 > /proc/sys/kernel/numa_balancing
- echo 0 > /proc/sys/vm/numa_stat
- echo always > /sys/kernel/mm/transparent_hugepage/defrag
- echo always > /sys/kernel/mm/transparent_hugepage/enabled
|
Notes |
None
|
Parts of Benchmark |
Controller
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2
|
Tuning |
Used numactl to interleave memory on all CPUs
|
Notes |
None
|
Parts of Benchmark |
Backend
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms30g -Xmx30g -Xmn27g -XX:AllocatePrefetchInstr=2 -XX:+UseParallelGC -XX:ParallelGCThreads=16 -XX:LargePageSizeInBytes=2m -XX:-UseAdaptiveSizePolicy -XX:+AlwaysPreTouch -XX:+UseLargePages -XX:SurvivorRatio=28 -XX:TargetSurvivorRatio=95 -XX:MaxTenuringThreshold=15 -XX:InlineSmallCode=11k -XX:MaxGCPauseMillis=300 -XX:LoopUnrollLimit=200 -XX:AdaptiveSizeMajorGCDecayTimeScale=12 -XX:AdaptiveSizeDecrementScaleFactor=2 -XX:+UseTransparentHugePages -XX:+UseUnalignedLoadStores -XX:-UseFastStosb -XX:+UseXMMForArrayCopy -XX:+UseXMMForObjInit -XX:+UseFPUForSpilling -XX:TLABAllocationWeight=55 -XX:ThreadStackSize=512 -XX:CompileThresholdScaling=50
|
Tuning |
Used numactl to affinitize each Backend JVM to 8 Cores / 16 Threads- numactl --physcpubind=0-7,128-135 --localalloc
- numactl --physcpubind=8-15,136-143 --localalloc
- numactl --physcpubind=16-23,144-151 --localalloc
- numactl --physcpubind=24-31,152-159 --localalloc
- numactl --physcpubind=32-39,160-167 --localalloc
- numactl --physcpubind=40-47,168-175 --localalloc
- numactl --physcpubind=48-55,176-183 --localalloc
- numactl --physcpubind=56-63,184-191 --localalloc
- numactl --physcpubind=64-71,192-199 --localalloc
- numactl --physcpubind=72-79,200-207 --localalloc
- numactl --physcpubind=80-87,208-215 --localalloc
- numactl --physcpubind=88-95,216-223 --localalloc
- numactl --physcpubind=96-103,224-231 --localalloc
- numactl --physcpubind=104-111,232-239 --localalloc
- numactl --physcpubind=112-119,240-247 --localalloc
- numactl --physcpubind=120-127,248-255 --localalloc
|
Notes |
None
|
Parts of Benchmark |
TxInjector
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2
|
Tuning |
Used numactl to affinitize each TxInjector JVM to 8 Cores / 16 Threads- numactl --physcpubind=0-7,128-135 --localalloc
- numactl --physcpubind=8-15,136-143 --localalloc
- numactl --physcpubind=16-23,144-151 --localalloc
- numactl --physcpubind=24-31,152-159 --localalloc
- numactl --physcpubind=32-39,160-167 --localalloc
- numactl --physcpubind=40-47,168-175 --localalloc
- numactl --physcpubind=48-55,176-183 --localalloc
- numactl --physcpubind=56-63,184-191 --localalloc
- numactl --physcpubind=64-71,192-199 --localalloc
- numactl --physcpubind=72-79,200-207 --localalloc
- numactl --physcpubind=80-87,208-215 --localalloc
- numactl --physcpubind=88-95,216-223 --localalloc
- numactl --physcpubind=96-103,224-231 --localalloc
- numactl --physcpubind=104-111,232-239 --localalloc
- numactl --physcpubind=112-119,240-247 --localalloc
- numactl --physcpubind=120-127,248-255 --localalloc
|
Notes |
None
|
|