| OS Images | 
            
              os_Image_1(1)
             | 
           
          
            | Hardware Description | 
            
              hw_1
             | 
           
          
            | Number of Systems | 
            
              1
             | 
           
          
            | SW Environment | 
            
              non-virtual
             | 
           
          
            | Tuning | 
            
               BIOS Settings: - SMT Control set to Enable
 - IOMMU set to Enabled
 - NUMA nodes per socket set to NPS4
 - Determinism Control set to Manual
 - Determinism Slider set to Power
 - cTDP Control set to Manual
 - cTDP set to 280
 - Package Power Limit Control set to Manual
 - Package Power Limit set to 280
 - L1 Stream HW Prefetcher set to Disable
 - L2 Stream HW Prefetcher set to Disable
 - ACPI SRAT L3 Cache as NUMA Domain set to Enable
  
             | 
           
          
            | Notes | 
            
              notes
             | 
           
         
       | 
      
        
        
             
          
            | JVM Instances | 
            
              jvm_Ctr_1(1), jvm_Backend_1(16), jvm_TxInjector_1(16)
             | 
           
          
            | OS Image Description | 
            
              os_1
             | 
           
          
            | Tuning | 
            
              - cpupower -c all frequency-set -g performance
 - tuned-adm profile throughput-performance
 
 - echo 400000000 > /proc/sys/kernel/sched_min_granularity_ns
 - echo 40000 > /proc/sys/kernel/sched_wakeup_granularity_ns
 - echo 40000 > /proc/sys/kernel/sched_migration_cost_ns
 - echo 128 > /proc/sys/kernel/sched_nr_migrate
 - echo 950000 > /proc/sys/kernel/sched_rt_runtime_us
 - echo 400000 > /proc/sys/kernel/sched_latency_ns
 
 - echo 10000 > /proc/sys/vm/dirty_expire_centisecs
 - echo 1500 > /proc/sys/vm/dirty_writeback_centisecs
 - echo 40 > /proc/sys/vm/dirty_ratio
 - echo 10 > /proc/sys/vm/dirty_background_ratio
 - echo 10 > /proc/sys/vm/swappiness
 
 - echo 0 > /proc/sys/kernel/numa_balancing
 - echo always > /sys/kernel/mm/transparent_hugepage/defrag
 - echo always > /sys/kernel/mm/transparent_hugepage/enabled
 
 - Add cgroup_disable=memory,cpu,cpuacct,blkio,hugetlb,pids,cpuset,perf_event,freezer,devices,net_cls,net_prio to GRUB_CMDLINE_LINUX_DEFAULT
 - ulimit -n 1024000
 - UserTasksMax=970000
 - DefaultTasksMax=970000
  
             | 
           
          
            | Notes | 
            
              None
             | 
           
               
          
            | Parts of Benchmark | 
            
              Controller
             | 
           
          
            | JVM Instance Description | 
            
              jvm_1
             | 
           
          
            | Command Line | 
            
               -Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2 
             | 
           
          
            | Tuning | 
            
              None
             | 
           
          
            | Notes | 
            
               Used numactl to run on all available nodes 
             | 
           
               
          
            | Parts of Benchmark | 
            
              Backend
             | 
           
          
            | JVM Instance Description | 
            
              jvm_1
             | 
           
          
            | Command Line | 
            
               -Xms31g -Xmx31g -Xmn30g -XX:AllocatePrefetchInstr=2 -XX:+UseParallelGC -XX:LargePageSizeInBytes=2m -XX:-UseAdaptiveSizePolicy -XX:+AlwaysPreTouch -XX:+UseLargePages -XX:SurvivorRatio=26 -XX:TargetSurvivorRatio=95 -XX:MaxTenuringThreshold=15 -XX:ParallelGCThreads=32 -XX:-UseDynamicNumberOfCompilerThreads -XX:-UseDynamicNumberOfGCThreads -XX:+UseTransparentHugePages -XX:MaxInlineSize=900 -XX:FreqInlineSize=900 -XX:LoopUnrollLimit=30 -XX:InlineSmallCode=3k -XX:AutoBoxCacheMax=5000 -XX:AllocatePrefetchDistance=128 -XX:AllocatePrefetchLines=9 -XX:AllocateInstancePrefetchLines=3 -XX:TLABAllocationWeight=5 -XX:ThreadStackSize=512 -XX:TLABSize=4m -XX:CompileThresholdScaling=50 
             | 
           
          
            | Tuning | 
            
              None
             | 
           
          
            | Notes | 
            
               Used numactl to affinitize each Backend JVM to 8 Cores / 16 Threads - numactl --physcpubind=0-7,128-135 --localalloc
 - numactl --physcpubind=8-15,136-143 --localalloc
 - numactl --physcpubind=16-23,144-151 --localalloc
 - numactl --physcpubind=24-31,152-159 --localalloc
 - numactl --physcpubind=32-39,160-167 --localalloc
 - numactl --physcpubind=40-47,168-175 --localalloc
 - numactl --physcpubind=48-55,176-183 --localalloc
 - numactl --physcpubind=56-63,184-191 --localalloc
 - numactl --physcpubind=64-71,192-199 --localalloc
 - numactl --physcpubind=72-79,200-207 --localalloc
 - numactl --physcpubind=80-87,208-215 --localalloc
 - numactl --physcpubind=88-95,216-223 --localalloc
 - numactl --physcpubind=96-103,224-231 --localalloc
 - numactl --physcpubind=104-111,232-239 --localalloc
 - numactl --physcpubind=112-119,240-247 --localalloc
 - numactl --physcpubind=120-127,248-255 --localalloc
  
             | 
           
               
          
            | Parts of Benchmark | 
            
              TxInjector
             | 
           
          
            | JVM Instance Description | 
            
              jvm_1
             | 
           
          
            | Command Line | 
            
               -Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2 
             | 
           
          
            | Tuning | 
            
              None
             | 
           
          
            | Notes | 
            
               Used numactl to affinitize each TxInjector JVM to 8 Cores / 16 Threads - numactl --physcpubind=0-7,128-135 --localalloc
 - numactl --physcpubind=8-15,136-143 --localalloc
 - numactl --physcpubind=16-23,144-151 --localalloc
 - numactl --physcpubind=24-31,152-159 --localalloc
 - numactl --physcpubind=32-39,160-167 --localalloc
 - numactl --physcpubind=40-47,168-175 --localalloc
 - numactl --physcpubind=48-55,176-183 --localalloc
 - numactl --physcpubind=56-63,184-191 --localalloc
 - numactl --physcpubind=64-71,192-199 --localalloc
 - numactl --physcpubind=72-79,200-207 --localalloc
 - numactl --physcpubind=80-87,208-215 --localalloc
 - numactl --physcpubind=88-95,216-223 --localalloc
 - numactl --physcpubind=96-103,224-231 --localalloc
 - numactl --physcpubind=104-111,232-239 --localalloc
 - numactl --physcpubind=112-119,240-247 --localalloc
 - numactl --physcpubind=120-127,248-255 --localalloc
  
             | 
           
         
       |