| CPU2006 license: | 3 | Test date: | May-2009 |
|---|---|---|---|
| Test sponsor: | Hewlett-Packard Company | Hardware Availability: | Jun-2009 |
| Tested by: | Hewlett-Packard Company | Software Availability: | Apr-2009 |
| Hardware | |
|---|---|
| CPU Name: | AMD Opteron 8435 |
| CPU Characteristics: | |
| CPU MHz: | 2600 |
| FPU: | Integrated |
| CPU(s) enabled: | 24 cores, 4 chips, 6 cores/chip |
| CPU(s) orderable: | 2,4 chips |
| Primary Cache: | 64 KB I + 64 KB D on chip per core |
| Secondary Cache: | 512 KB I+D on chip per core |
| L3 Cache: | 6 MB I+D on chip per chip |
| Other Cache: | None |
| Memory: | 64 GB (16x4 GB, PC2-6400P CL5) |
| Disk Subsystem: | 2x146 GB 10 K SAS |
| Other Hardware: | None |
| Software | |
|---|---|
| Operating System: | Red Hat Enterprise Linux Server release 5.3, Advanced Platform, Kernel 2.6.18-128.el5 |
| Compiler: | PGI Server Complete Version 8.0 x86 Open64 4.2.2 Compiler Suite |
| Auto Parallel: | Yes |
| File System: | ext3 |
| System State: | Run level 3 (multi-user) |
| Base Pointers: | 64-bit |
| Peak Pointers: | 32/64-bit |
| Other Software: | binutils 2.18 |
| Benchmark | Base | Peak | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
| Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
| 410.bwaves | 24 | 1591 | 205 | 1592 | 205 | 1592 | 205 | 24 | 1550 | 210 | 1554 | 210 | 1550 | 210 |
| 416.gamess | 24 | 1214 | 387 | 1214 | 387 | 1212 | 388 | 24 | 1131 | 416 | 1176 | 400 | 1145 | 410 |
| 433.milc | 24 | 1411 | 156 | 1411 | 156 | 1411 | 156 | 24 | 1411 | 156 | 1411 | 156 | 1411 | 156 |
| 434.zeusmp | 24 | 759 | 288 | 761 | 287 | 755 | 289 | 24 | 759 | 288 | 761 | 287 | 755 | 289 |
| 435.gromacs | 24 | 519 | 330 | 526 | 326 | 515 | 333 | 24 | 429 | 400 | 426 | 402 | 442 | 388 |
| 436.cactusADM | 24 | 962 | 298 | 954 | 301 | 957 | 300 | 4 | 134 | 357 | 134 | 356 | 135 | 355 |
| 437.leslie3d | 24 | 1744 | 129 | 1741 | 130 | 1741 | 130 | 24 | 1645 | 137 | 1642 | 137 | 1648 | 137 |
| 444.namd | 24 | 632 | 304 | 629 | 306 | 637 | 302 | 24 | 575 | 335 | 570 | 338 | 569 | 338 |
| 447.dealII | 24 | 651 | 422 | 663 | 414 | 652 | 421 | 24 | 492 | 558 | 477 | 576 | 474 | 579 |
| 450.soplex | 24 | 1279 | 156 | 1261 | 159 | 1242 | 161 | 24 | 1283 | 156 | 1144 | 175 | 1143 | 175 |
| 453.povray | 24 | 336 | 380 | 333 | 383 | 332 | 385 | 24 | 286 | 447 | 297 | 430 | 307 | 415 |
| 454.calculix | 24 | 476 | 416 | 476 | 416 | 474 | 417 | 24 | 421 | 470 | 421 | 470 | 421 | 471 |
| 459.GemsFDTD | 24 | 2028 | 126 | 2025 | 126 | 2023 | 126 | 24 | 1957 | 130 | 1960 | 130 | 1959 | 130 |
| 465.tonto | 24 | 754 | 313 | 750 | 315 | 752 | 314 | 24 | 626 | 377 | 634 | 373 | 638 | 370 |
| 470.lbm | 24 | 2725 | 121 | 2724 | 121 | 2722 | 121 | 24 | 2719 | 121 | 2717 | 121 | 2718 | 121 |
| 481.wrf | 24 | 1138 | 236 | 1141 | 235 | 1138 | 235 | 24 | 1101 | 244 | 1103 | 243 | 1105 | 243 |
| 482.sphinx3 | 24 | 1616 | 289 | 1627 | 288 | 1610 | 290 | 24 | 1502 | 311 | 1501 | 312 | 1511 | 310 |
The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details.
'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit The libhugetlbfs libraries were installed using the installation rpms that came with the distribution. Set vm/nr_hugepages=10800 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages
BIOS configuration: Power Regulator set to Static High Performance Mode
Environment variables set by runspec before the start of the run: HUGETLB_LIMIT = "450" LD_LIBRARY_PATH = "/cpu2006/amd0905is-libs/64:/cpu2006/amd0905is-libs/32" NCPUS = "6" PGI_HUGE_PAGES = "450" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64.
| pgcc |
| pgcpp |
| pgf95 |
| pgcc pgf95 |
| 410.bwaves: | -DSPEC_CPU_LP64 |
| 416.gamess: | -DSPEC_CPU_LP64 |
| 433.milc: | -DSPEC_CPU_LP64 |
| 434.zeusmp: | -DSPEC_CPU_LP64 |
| 435.gromacs: | -DSPEC_CPU_LP64 -Mnomain |
| 436.cactusADM: | -DSPEC_CPU_LP64 -Mnomain |
| 437.leslie3d: | -DSPEC_CPU_LP64 |
| 444.namd: | -DSPEC_CPU_LP64 |
| 447.dealII: | -DSPEC_CPU_LP64 |
| 450.soplex: | -DSPEC_CPU_LP64 |
| 453.povray: | -DSPEC_CPU_LP64 |
| 454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
| 459.GemsFDTD: | -DSPEC_CPU_LP64 |
| 465.tonto: | -DSPEC_CPU_LP64 |
| 470.lbm: | -DSPEC_CPU_LP64 |
| 481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
| 482.sphinx3: | -DSPEC_CPU_LP64 |
| -fastsse -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| -fastsse -Msmartalloc=huge -Mfprelaxed --zc_eh -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| -fastsse -Msmartalloc=huge -Mfprelaxed -Mvect=short -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| -fastsse -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Mvect=short -Bstatic_pgi |
| -Mipa=jobs:4 |
| -Mipa=jobs:4 |
| -Mipa=jobs:4 |
| -Mipa=jobs:4 |
| pgcc |
| openCC | |
| 444.namd: | pgcpp |
| openf95 | |
| 410.bwaves: | pgf95 |
| 434.zeusmp: | pgf95 |
| 437.leslie3d: | pgf95 |
| pgcc pgf95 | |
| 435.gromacs: | opencc openf95 |
| 410.bwaves: | -DSPEC_CPU_LP64 |
| 416.gamess: | -DSPEC_CPU_LP64 |
| 433.milc: | -DSPEC_CPU_LP64 |
| 434.zeusmp: | -DSPEC_CPU_LP64 |
| 435.gromacs: | -DSPEC_CPU_LP64 |
| 436.cactusADM: | -DSPEC_CPU_LP64 -Mnomain |
| 437.leslie3d: | -DSPEC_CPU_LP64 |
| 444.namd: | -DSPEC_CPU_LP64 |
| 453.povray: | -DSPEC_CPU_LP64 |
| 454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
| 459.GemsFDTD: | -DSPEC_CPU_LP64 |
| 465.tonto: | -DSPEC_CPU_LP64 |
| 470.lbm: | -DSPEC_CPU_LP64 |
| 481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
| 482.sphinx3: | -DSPEC_CPU_LP64 |
| 433.milc: | basepeak = yes |
| 470.lbm: | -fastsse -Msmartalloc=huge -Mprefetch=t0 -Mloop32 -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| 482.sphinx3: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mfprelaxed -Msmartalloc -tp shanghai-64 -Bstatic_pgi |
| 410.bwaves: | -fastsse -Msmartalloc -Mprefetch=nta -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| 416.gamess: | -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O2 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 -HP:bdt=2m:heap=2m |
| 434.zeusmp: | basepeak = yes |
| 437.leslie3d: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mvect=fuse -Msmartalloc=huge -Mprefetch=distance:8 -Mprefetch=t0 -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
| 459.GemsFDTD: | -march=barcelona -Ofast -LNO:fission=2 -LNO:simd=2 -LNO:prefetch_ahead=1 -CG:load_exe=0 -HP |
| 465.tonto: | -march=barcelona -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -IPA:plimit=525 -HP |
| 435.gromacs: | -march=barcelona -Ofast -OPT:rsqrt=2 -HP:bdt=2m:heap=2m |
| 436.cactusADM: | -fastsse -Mconcur -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| 454.calculix: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mvect=short -Msmartalloc=huge -Mprefetch=t0 -Mpre -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
| 481.wrf: | -fastsse -Mvect=noaltcode -Msmartalloc=huge -Mprefetch=distance:8 -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
| -Mipa=jobs:4(pass 2) |
| 444.namd: | -Mipa=jobs:4(pass 2) |
| 410.bwaves: | -Mipa=jobs:4 |
| 434.zeusmp: | -Mipa=jobs:4 |
| 437.leslie3d: | -Mipa=jobs:4(pass 2) |
| 436.cactusADM: | -Mipa=jobs:4 |
| 454.calculix: | -Mipa=jobs:4(pass 2) |