CPU2006 license: | 9006 | Test date: | Mar-2016 |
---|---|---|---|
Test sponsor: | NEC Corporation | Hardware Availability: | Jun-2016 |
Tested by: | NEC Corporation | Software Availability: | Jan-2016 |
Hardware | |
---|---|
CPU Name: | Intel Xeon E5-2698 v4 |
CPU Characteristics: | Intel Turbo Boost Technology up to 3.60 GHz |
CPU MHz: | 2200 |
FPU: | Integrated |
CPU(s) enabled: | 40 cores, 2 chips, 20 cores/chip, 2 threads/core |
CPU(s) orderable: | 1,2 chips |
Primary Cache: | 32 KB I + 32 KB D on chip per core |
Secondary Cache: | 256 KB I+D on chip per core |
L3 Cache: | 50 MB I+D on chip per chip |
Other Cache: | None |
Memory: | 256 GB (8 x 32 GB 2Rx4 PC4-2400T-R) |
Disk Subsystem: | 1 x 400 GB SATA, SSD |
Other Hardware: | None |
Software | |
---|---|
Operating System: | Red Hat Enterprise Linux Server release 7.2 (Maipo) Kernel 3.10.0-327.4.5.el7.x86_64 |
Compiler: | C/C++: Version 16.0.0.101 of Intel C++ Studio XE for Linux; Fortran: Version 16.0.0.101 of Intel Fortran Studio XE for Linux |
Auto Parallel: | No |
File System: | ext4 |
System State: | Run level 3 (multi-user) |
Base Pointers: | 32/64-bit |
Peak Pointers: | 32/64-bit |
Other Software: | None |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
410.bwaves | 80 | 1609 | 676 | 1615 | 673 | 1611 | 675 | 80 | 1609 | 676 | 1615 | 673 | 1611 | 675 |
416.gamess | 80 | 1168 | 1340 | 1164 | 1350 | 1167 | 1340 | 80 | 1131 | 1390 | 1269 | 1230 | 1127 | 1390 |
433.milc | 80 | 1141 | 644 | 1144 | 642 | 1143 | 643 | 80 | 1141 | 644 | 1144 | 642 | 1143 | 643 |
434.zeusmp | 80 | 676 | 1080 | 657 | 1110 | 647 | 1120 | 80 | 676 | 1080 | 657 | 1110 | 647 | 1120 |
435.gromacs | 80 | 350 | 1630 | 361 | 1580 | 353 | 1620 | 80 | 337 | 1700 | 337 | 1700 | 336 | 1700 |
436.cactusADM | 80 | 738 | 1300 | 739 | 1290 | 744 | 1290 | 80 | 738 | 1300 | 739 | 1290 | 744 | 1290 |
437.leslie3d | 80 | 1552 | 485 | 1557 | 483 | 1547 | 486 | 80 | 1552 | 485 | 1557 | 483 | 1547 | 486 |
444.namd | 80 | 567 | 1130 | 569 | 1130 | 628 | 1020 | 80 | 565 | 1140 | 649 | 989 | 565 | 1140 |
447.dealII | 80 | 438 | 2090 | 441 | 2070 | 439 | 2090 | 80 | 438 | 2090 | 441 | 2070 | 439 | 2090 |
450.soplex | 80 | 1301 | 513 | 1295 | 515 | 1300 | 513 | 40 | 555 | 601 | 556 | 600 | 555 | 601 |
453.povray | 80 | 248 | 1720 | 246 | 1730 | 245 | 1740 | 80 | 254 | 1680 | 212 | 2010 | 224 | 1900 |
454.calculix | 80 | 371 | 1780 | 357 | 1850 | 353 | 1870 | 80 | 371 | 1780 | 357 | 1850 | 353 | 1870 |
459.GemsFDTD | 80 | 1795 | 473 | 1792 | 474 | 1799 | 472 | 80 | 1795 | 473 | 1792 | 474 | 1799 | 472 |
465.tonto | 80 | 691 | 1140 | 689 | 1140 | 676 | 1160 | 80 | 642 | 1230 | 643 | 1230 | 634 | 1240 |
470.lbm | 80 | 1160 | 947 | 1161 | 947 | 1161 | 947 | 80 | 1160 | 947 | 1161 | 947 | 1161 | 947 |
481.wrf | 80 | 1111 | 805 | 1106 | 808 | 1118 | 799 | 80 | 1111 | 805 | 1106 | 808 | 1118 | 799 |
482.sphinx3 | 80 | 1677 | 930 | 1682 | 927 | 1697 | 919 | 80 | 1677 | 930 | 1682 | 927 | 1697 | 919 |
The numactl mechanism was used to bind copies to processors. The config file option 'submit' was used to generate numactl commands to bind each copy to a specific processor. For details, please see the config file.
Stack size set to unlimited using "ulimit -s unlimited"
BIOS Settings: Energy Performance: Performance Patrol Scrub: Disabled Cluster on Die: Enabled
Environment variables set by runspec before the start of the run: LD_LIBRARY_PATH = "/home/cpu2006/libs/32:/home/cpu2006/libs/64:/home/cpu2006/sh" Binaries compiled on a system with 1x Intel Core i5-4670K CPU + 32GB memory using RedHat EL 7.1 Transparent Huge Pages enabled with: echo always > /sys/kernel/mm/transparent_hugepage/enabled Filesystem page cache cleared with: echo 1 > /proc/sys/vm/drop_caches runspec command invoked through numactl i.e.: numactl --interleave=all runspec <etc>
icc -m64 |
icpc -m64 |
ifort -m64 |
icc -m64 ifort -m64 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
450.soplex: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
-xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch -auto-p32 -ansi-alias -opt-mem-layout-trans=3 |
-xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch -auto-p32 -ansi-alias -opt-mem-layout-trans=3 |
-xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch |
-xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch -auto-p32 -ansi-alias -opt-mem-layout-trans=3 |
icc -m64 |
icpc -m64 | |
450.soplex: | icpc -m32 -L/opt/intel/compilers_and_libraries_2016/linux/compiler/lib/ia32_lin |
ifort -m64 |
icc -m64 ifort -m64 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
450.soplex: | -D_FILE_OFFSET_BITS=64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
433.milc: | basepeak = yes |
470.lbm: | basepeak = yes |
482.sphinx3: | basepeak = yes |
444.namd: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -fno-alias -auto-ilp32 |
447.dealII: | basepeak = yes |
450.soplex: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -opt-malloc-options=3 |
453.povray: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -unroll4 -ansi-alias |
410.bwaves: | basepeak = yes |
416.gamess: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -prof-use(pass 2) -unroll2 -inline-level=0 -scalar-rep- |
434.zeusmp: | basepeak = yes |
437.leslie3d: | basepeak = yes |
459.GemsFDTD: | basepeak = yes |
465.tonto: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -prof-use(pass 2) -unroll4 -auto -inline-calloc -opt-malloc-options=3 |
435.gromacs: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -opt-prefetch -auto-ilp32 |
436.cactusADM: | basepeak = yes |
454.calculix: | basepeak = yes |
481.wrf: | basepeak = yes |