| CPU2006 license: | 55 | Test date: | Jun-2009 |
|---|---|---|---|
| Test sponsor: | Dell Inc. | Hardware Availability: | Jul-2009 |
| Tested by: | Dell Inc. | Software Availability: | Apr-2009 |
| Hardware | |
|---|---|
| CPU Name: | AMD Opteron 8431 |
| CPU Characteristics: | |
| CPU MHz: | 2400 |
| FPU: | Integrated |
| CPU(s) enabled: | 24 cores, 4 chips, 6 cores/chip |
| CPU(s) orderable: | 2,4 chips |
| Primary Cache: | 64 KB I + 64 KB D on chip per core |
| Secondary Cache: | 512 KB I+D on chip per core |
| L3 Cache: | 6 MB I+D on chip per chip |
| Other Cache: | None |
| Memory: | 64 GB (16 x 4 GB DDR2-800) |
| Disk Subsystem: | 1 x 73 GB 15000 RPM SAS |
| Other Hardware: | None |
| Software | |
|---|---|
| Operating System: | Red Hat Enterprise Linux Server release 5.3, Kernel 2.6.18-128.el5 |
| Compiler: | PGI Server Complete Version 8.0 x86 Open64 4.2.2 Compiler Suite (from AMD) |
| Auto Parallel: | Yes |
| File System: | ext3 |
| System State: | Run level 3 (Full multiuser with network) |
| Base Pointers: | 64-bit |
| Peak Pointers: | 32/64-bit |
| Other Software: | binutils 2.18 |
| Benchmark | Base | Peak | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
| Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
| 410.bwaves | 24 | 1614 | 202 | 1614 | 202 | 1613 | 202 | 24 | 1567 | 208 | 1570 | 208 | 1570 | 208 |
| 416.gamess | 24 | 1301 | 361 | 1299 | 362 | 1299 | 362 | 24 | 1210 | 388 | 1223 | 384 | 1273 | 369 |
| 433.milc | 24 | 1415 | 156 | 1416 | 156 | 1415 | 156 | 24 | 1415 | 156 | 1416 | 156 | 1415 | 156 |
| 434.zeusmp | 24 | 783 | 279 | 785 | 278 | 784 | 279 | 24 | 783 | 279 | 785 | 278 | 784 | 279 |
| 435.gromacs | 24 | 555 | 309 | 555 | 309 | 563 | 304 | 24 | 458 | 374 | 451 | 380 | 456 | 376 |
| 436.cactusADM | 24 | 992 | 289 | 994 | 289 | 995 | 288 | 4 | 140 | 341 | 140 | 342 | 139 | 343 |
| 437.leslie3d | 24 | 1746 | 129 | 1744 | 129 | 1746 | 129 | 24 | 1650 | 137 | 1650 | 137 | 1647 | 137 |
| 444.namd | 24 | 675 | 285 | 673 | 286 | 674 | 285 | 24 | 617 | 312 | 612 | 315 | 611 | 315 |
| 447.dealII | 24 | 677 | 406 | 679 | 404 | 678 | 405 | 24 | 501 | 548 | 503 | 546 | 505 | 544 |
| 450.soplex | 24 | 1251 | 160 | 1253 | 160 | 1254 | 160 | 24 | 1164 | 172 | 1167 | 171 | 1159 | 173 |
| 453.povray | 24 | 351 | 364 | 358 | 357 | 392 | 325 | 24 | 296 | 432 | 317 | 403 | 294 | 434 |
| 454.calculix | 24 | 506 | 391 | 504 | 393 | 506 | 391 | 24 | 449 | 441 | 447 | 443 | 446 | 444 |
| 459.GemsFDTD | 24 | 2025 | 126 | 2025 | 126 | 2029 | 125 | 24 | 1963 | 130 | 1965 | 130 | 1965 | 130 |
| 465.tonto | 24 | 788 | 300 | 787 | 300 | 788 | 300 | 24 | 665 | 355 | 673 | 351 | 669 | 353 |
| 470.lbm | 24 | 2718 | 121 | 2719 | 121 | 2718 | 121 | 24 | 2711 | 122 | 2712 | 122 | 2711 | 122 |
| 481.wrf | 24 | 1150 | 233 | 1146 | 234 | 1152 | 233 | 24 | 1114 | 241 | 1111 | 241 | 1109 | 242 |
| 482.sphinx3 | 24 | 1670 | 280 | 1630 | 287 | 1689 | 277 | 24 | 1536 | 304 | 1544 | 303 | 1536 | 305 |
The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details.
'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set vm/nr_hugepages=10800 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages
Environment variables set by runspec before the start of the run: HUGETLB_LIMIT = "450" LD_LIBRARY_PATH = "/root/cpu2006-1.1/amd0905is-libs/64:/root/cpu2006-1.1/amd0905is-libs/32" NCPUS = "6" PGI_HUGE_PAGES = "450" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64
| pgcc |
| pgcpp |
| pgf95 |
| pgcc pgf95 |
| 410.bwaves: | -DSPEC_CPU_LP64 |
| 416.gamess: | -DSPEC_CPU_LP64 |
| 433.milc: | -DSPEC_CPU_LP64 |
| 434.zeusmp: | -DSPEC_CPU_LP64 |
| 435.gromacs: | -DSPEC_CPU_LP64 -Mnomain |
| 436.cactusADM: | -DSPEC_CPU_LP64 -Mnomain |
| 437.leslie3d: | -DSPEC_CPU_LP64 |
| 444.namd: | -DSPEC_CPU_LP64 |
| 447.dealII: | -DSPEC_CPU_LP64 |
| 450.soplex: | -DSPEC_CPU_LP64 |
| 453.povray: | -DSPEC_CPU_LP64 |
| 454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
| 459.GemsFDTD: | -DSPEC_CPU_LP64 |
| 465.tonto: | -DSPEC_CPU_LP64 |
| 470.lbm: | -DSPEC_CPU_LP64 |
| 481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
| 482.sphinx3: | -DSPEC_CPU_LP64 |
| -fastsse -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| -fastsse -Msmartalloc=huge -Mfprelaxed --zc_eh -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| -fastsse -Msmartalloc=huge -Mfprelaxed -Mvect=short -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| -fastsse -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Mvect=short -Bstatic_pgi |
| -Mipa=jobs:4 |
| -Mipa=jobs:4 |
| -Mipa=jobs:4 |
| -Mipa=jobs:4 |
| pgcc |
| openCC | |
| 444.namd: | pgcpp |
| openf95 | |
| 410.bwaves: | pgf95 |
| 434.zeusmp: | pgf95 |
| 437.leslie3d: | pgf95 |
| pgcc pgf95 | |
| 435.gromacs: | opencc openf95 |
| 410.bwaves: | -DSPEC_CPU_LP64 |
| 416.gamess: | -DSPEC_CPU_LP64 |
| 433.milc: | -DSPEC_CPU_LP64 |
| 434.zeusmp: | -DSPEC_CPU_LP64 |
| 435.gromacs: | -DSPEC_CPU_LP64 |
| 436.cactusADM: | -DSPEC_CPU_LP64 -Mnomain |
| 437.leslie3d: | -DSPEC_CPU_LP64 |
| 444.namd: | -DSPEC_CPU_LP64 |
| 453.povray: | -DSPEC_CPU_LP64 |
| 454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
| 459.GemsFDTD: | -DSPEC_CPU_LP64 |
| 465.tonto: | -DSPEC_CPU_LP64 |
| 470.lbm: | -DSPEC_CPU_LP64 |
| 481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
| 482.sphinx3: | -DSPEC_CPU_LP64 |
| 433.milc: | basepeak = yes |
| 470.lbm: | -fastsse -Msmartalloc=huge -Mprefetch=t0 -Mloop32 -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| 482.sphinx3: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mfprelaxed -Msmartalloc -tp shanghai-64 -Bstatic_pgi |
| 410.bwaves: | -fastsse -Msmartalloc -Mprefetch=nta -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| 416.gamess: | -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O2 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 -HP:bdt=2m:heap=2m |
| 434.zeusmp: | basepeak = yes |
| 437.leslie3d: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mvect=fuse -Msmartalloc=huge -Mprefetch=distance:8 -Mprefetch=t0 -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
| 459.GemsFDTD: | -march=barcelona -Ofast -LNO:fission=2 -LNO:simd=2 -LNO:prefetch_ahead=1 -CG:load_exe=0 -HP |
| 465.tonto: | -march=barcelona -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -IPA:plimit=525 -HP |
| 435.gromacs: | -march=barcelona -Ofast -OPT:rsqrt=2 -HP:bdt=2m:heap=2m |
| 436.cactusADM: | -fastsse -Mconcur -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
| 454.calculix: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mvect=short -Msmartalloc=huge -Mprefetch=t0 -Mpre -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
| 481.wrf: | -fastsse -Mvect=noaltcode -Msmartalloc=huge -Mprefetch=distance:8 -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
| -Mipa=jobs:4(pass 2) |
| 444.namd: | -Mipa=jobs:4(pass 2) |
| 410.bwaves: | -Mipa=jobs:4 |
| 434.zeusmp: | -Mipa=jobs:4 |
| 437.leslie3d: | -Mipa=jobs:4(pass 2) |
| 436.cactusADM: | -Mipa=jobs:4 |
| 454.calculix: | -Mipa=jobs:4(pass 2) |