Hewlett Packard Enterprise ProLiant DL385 Gen11 681410 SPECjbb2015-MultiJVM max-jOPS
630390 SPECjbb2015-MultiJVM critical-jOPS
Tested by: Hewlett Packard Enterprise Test Sponsor: Hewlett Packard Enterprise Test location: Houston, TX Test date: May 15, 2023
SPEC license #: 3 Hardware Availability: Jun-2023 Software Availability: Apr-2023 Publication: Tue Jun 13 10:03:08 EDT 2023
Benchmark Results Summary
 
Overall Throughput RT curve
Overall SUT (System Under Test) Description
VendorHewlett Packard Enterprise
Vendor URLhttp://www.hpe.com/
System SourceSingle Supplier
System DesignationServer Rack
Total Systems1
All SUT Systems IdenticalYES
Total Nodes1
All Nodes IdenticalYES
Nodes Per System1
Total Chips2
Total Cores256
Total Threads512
Total Memory Amount (GB)1536
Total OS Images1
SW EnvironmentNon-virtual
 
Hardware hw_1
NameProLiant DL385 Gen11
VendorHewlett Packard Enterprise
Vendor URLhttp://hpe.com/
AvailableJun-2023
ModelProLiant DL385 Gen11
Form Factor2U Rack
CPU NameAMD EPYC 9754
CPU Characteristics128 Core, 2.25 GHz, 256 MB L3 Cache(Max. Boost Clock Up to 3.1 GHz)
Number of Systems1
Nodes Per System1
Chips Per System2
Cores Per System256
Cores Per Chip128
Threads Per System512
Threads Per Core2
VersionA55 v1.30 03/06/2023
CPU Frequency (MHz)2250
Primary Cache32 KB I + 32 KB D on chip per core
Secondary Cache1024 KB I+D on chip per core
Tertiary Cache256 MB (I+D) on chip per chip
Other CacheNone
Disk1 x 480 GB SATA SSD
File Systembtrfs
Memory Amount (GB)1536
# and size of DIMM(s)24 x 64 GB
Memory Details64GB 2Rx4 PC5-38400R , running at 4800 MHz
# and type of Network Interface Cards (NICs)HPE Ethernet 1Gb 4-port
Power Supply Quantity and Rating (W)2 x 1600
Other HardwareNone
Cabinet/Housing/EnclosureNone
Shared DescriptionNone
Shared CommentNone
Notes
  • NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown) is mitigated in the system as tested and documented
  • Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1) is mitigated in the system as tested and documented
  • Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2) is mitigated in the system as tested and documented
Other Hardware network_1
NameNone
VendorNone
Vendor URLNone
VersionNone
AvailableNone
BitnessNone
NotesNone
Operating System os_1
NameSUSE Linux Enterprise Server 15 SP4
VendorSUSE
Vendor URLhttp://suse.com/
Version5.14.21-150400.22-default
AvailableJun-2022
Bitness64
NotesNone
Java Virtual Machine jvm_1
NameOracle Java SE 17.0.7
VendorOracle
Vendor URLhttp://www.oracle.com/
VersionJava HotSpot 64-bit Server VM, version 17.0.7
AvailableApr-2023
Bitness64
NotesNone
Other Software other_1
NameNone
VendorNone
Vendor URLNone
VersionNone
AvailableNone
BitnessNone
NotesNone
Hardware
OS Images os_Image_1(1)
Hardware Description hw_1
Number of Systems 1
SW Environment Non-virtual
Tuning
  • Workload Profile=High Performance Compute(HPC)
  • Thermal Configuration=Maximum Cooling
  • Determinism Control=Manual
  • Performance Determinism=Power Deterministic
  • Memory Patrol Scrubbing=Disabled
  • Numa Memory Domains Per Socket(NPS)=Two Memory Domains Per Socket
  • Last-Level Cache(LLC) As NUMA Node=Disabled
  • L1 Stream HW Prefetcher=Disabled
  • L2 Stream HW Prefetcher=Disabled
  • Minimum Processor Idle Power Core C-State=No C-states
  • xGMI Link Bandwidth=32Gbps
  • Package Power Limit=400
Notes None
OS Image os_Image_1
JVM Instances jvm_Ctr_1(1), jvm_Backend_1(12), jvm_TxInjector_1(12)
OS Image Description os_1
Tuning
  • ulimit -n 1024000
  • UserTasksMax=970000
  • DefaultTasksMax=970000
  • tuned-adm profile throughput-performance
  • echo 950000 > /proc/sys/kernel/sched_rt_runtime_us
  • echo 100000000 > /proc/sys/kernel/sched_latency_ns
  • echo 80000 > /proc/sys/kernel/sched_migration_cost_ns
  • echo 300000 > /proc/sys/kernel/sched_min_granularity_ns
  • echo 300000 > /proc/sys/kernel/sched_wakeup_granularity_ns
  • echo 32 > /proc/sys/kernel/sched_nr_migrate
  • echo 10000 > /proc/sys/vm/dirty_expire_centisecs
  • echo 1500 > /proc/sys/vm/dirty_writeback_centisecs
  • echo 40 > /proc/sys/vm/dirty_ratio
  • echo 10 > /proc/sys/vm/dirty_background_ratio
  • echo 10 > /proc/sys/vm/swappiness
  • echo 0 > /proc/sys/kernel/numa_balancing
  • echo 0 > /proc/sys/vm/numa_stat
  • echo always > /sys/kernel/mm/transparent_hugepage/enabled
  • echo always > /sys/kernel/mm/transparent_hugepage/defrag
Notes None
JVM Instance jvm_Ctr_1
Parts of Benchmark Controller
JVM Instance Description jvm_1
Command Line

-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2 -XX:CICompilerCount=4

Tuning

Used numactl to interleave memory on all CPUs

  • numactl --interleave=all
Notes None
JVM Instance jvm_Backend_1
Parts of Benchmark Backend
JVM Instance Description jvm_1
Command Line

-Xms120g -Xmx120g -Xmn118g -server -XX:MetaspaceSize=256m -XX:AllocatePrefetchInstr=2 -XX:LargePageSizeInBytes=2m -XX:-UsePerfData -XX:-UseAdaptiveSizePolicy -XX:+AlwaysPreTouch -XX:+UseLargePages -XX:+UseParallelGC -XX:SurvivorRatio=100 -XX:TargetSurvivorRatio=99 -XX:ParallelGCThreads=42 -XX:MaxTenuringThreshold=15 -XX:InitialCodeCacheSize=25m -XX:InlineSmallCode=10k -XX:MaxGCPauseMillis=200 -XX:+UseCompressedOops -XX:ObjectAlignmentInBytes=32 -XX:+UseTransparentHugePages -XX:TLABAllocationWeight=55 -XX:ThreadStackSize=512 -XX:CompileThresholdScaling=120

Tuning

Used numactl to affinitize three Backend JVMs to a NUMA node

  • numactl --cpunodebind=0 --localalloc
  • numactl --cpunodebind=0 --localalloc
  • numactl --cpunodebind=0 --localalloc
  • numactl --cpunodebind=1 --localalloc
  • numactl --cpunodebind=1 --localalloc
  • numactl --cpunodebind=1 --localalloc
  • numactl --cpunodebind=2 --localalloc
  • numactl --cpunodebind=2 --localalloc
  • numactl --cpunodebind=2 --localalloc
  • numactl --cpunodebind=3 --localalloc
  • numactl --cpunodebind=3 --localalloc
  • numactl --cpunodebind=3 --localalloc
Notes None
JVM Instance jvm_TxInjector_1
Parts of Benchmark TxInjector
JVM Instance Description jvm_1
Command Line

-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2 -XX:CICompilerCount=4

Tuning

Used numactl to affinitize three Transaction Injector JVMs to a NUMA node

  • numactl --cpunodebind=0 --localalloc
  • numactl --cpunodebind=0 --localalloc
  • numactl --cpunodebind=0 --localalloc
  • numactl --cpunodebind=1 --localalloc
  • numactl --cpunodebind=1 --localalloc
  • numactl --cpunodebind=1 --localalloc
  • numactl --cpunodebind=2 --localalloc
  • numactl --cpunodebind=2 --localalloc
  • numactl --cpunodebind=2 --localalloc
  • numactl --cpunodebind=3 --localalloc
  • numactl --cpunodebind=3 --localalloc
  • numactl --cpunodebind=3 --localalloc
Notes None
max-jOPS = jOPS passed before the First Failure
Pass/Fail Pass Pass Fail Fail Fail
jOPS 672893 681410 689928 698446 706963
critical-jOPS = Geomean ( jOPS @ 10000; 25000; 50000; 75000; 100000; SLAs )
Response time percentile is 99-th
SLA (us) 10000 25000 50000 75000 100000 Geomean
jOPS 557905 609010 643081 668634 681410 630390
  Percentile
  10-th 50-th 90-th 95-th 99-th 100-th
500us 8518 / 17035 - / 8518 - / 8518 - / 8518 - / 8518 - / 8518
1000us 562164 / 570681 59623 / 68141 25553 / 34071 17035 / 25553 - / 8518 - / 8518
5000us 681410 / - 681410 / - 596234 / 604752 570681 / 579199 434399 / 442917 42588 / 34071
10000us 681410 / - 681410 / - 681410 / - 664375 / 672893 553646 / 562164 85176 / 34071
25000us 681410 / - 681410 / - 681410 / - 681410 / - 604752 / 613269 85176 / 34071
50000us 681410 / - 681410 / - 681410 / - 681410 / - 638822 / 647340 85176 / 34071
75000us 681410 / - 681410 / - 681410 / - 681410 / - 664375 / 672893 340705 / 51106
100000us 681410 / - 681410 / - 681410 / - 681410 / - 681410 / - 485505 / 153317
200000us 681410 / - 681410 / - 681410 / - 681410 / - 681410 / - 655857 / 562164
500000us 681410 / - 681410 / - 681410 / - 681410 / - 681410 / - 681410 / -
1000000us 681410 / - 681410 / - 681410 / - 681410 / - 681410 / - 681410 / -
Probes jOPS / Total jOPS
Request Mix Accuracy
Note
(Actual % in the Mix - Expected % in the Mix) must be within:
'Main Tx' limit of +/-5.0% for the requests whose expected % in the mix is >= 10.0%
'Minor Tx' limit of +/-1.0% for the requests whose expected % in the mix is < 10.0%
There were no non-critical failures in Response Time curve building
Delay between status pings
IR/PR Accuracy
This section lists properties only set by user
Property Name Default Controller Group1.Backend.beJVM Group1.TxInjector.txiJVM1 Group10.Backend.beJVM Group10.TxInjector.txiJVM1 Group11.Backend.beJVM Group11.TxInjector.txiJVM1 Group12.Backend.beJVM Group12.TxInjector.txiJVM1 Group2.Backend.beJVM Group2.TxInjector.txiJVM1 Group3.Backend.beJVM Group3.TxInjector.txiJVM1 Group4.Backend.beJVM Group4.TxInjector.txiJVM1 Group5.Backend.beJVM Group5.TxInjector.txiJVM1 Group6.Backend.beJVM Group6.TxInjector.txiJVM1 Group7.Backend.beJVM Group7.TxInjector.txiJVM1 Group8.Backend.beJVM Group8.TxInjector.txiJVM1 Group9.Backend.beJVM Group9.TxInjector.txiJVM1
specjbb.comm.connect.client.pool.size 256 256 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120
specjbb.comm.connect.selector.runner.count 0 4
specjbb.comm.connect.worker.pool.max 256 256 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120 120
specjbb.comm.connect.worker.pool.min 1 1 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24 24
specjbb.controller.handshake.period 5000 20000
specjbb.controller.handshake.timeout 600000 900000
specjbb.forkjoin.workers 512 {Tier1=127, Tier2=1, Tier3=8}
specjbb.group.count 1 12
specjbb.heartbeat.period 10000 2000
specjbb.heartbeat.threshold 100000 9000000
specjbb.mapreducer.pool.size 512 6
specjbb.txi.pergroup.count 1 1
View table in csv format
 
Level: COMPLIANCE
Check Agent Result
Check properties on compliance All PASSED
 
Level: CORRECTNESS
Check Agent Result
Compare SM and HQ Inventory All PASSED
High-bound (max attempted) is 851763 IR
High-bound (settled) is 710003 IR