SPEC OMPM2001 Summary SGI SGI Altix 3000 (1500MHz, Itanium 2) Thu Jun 3 17:10:45 2004 SPEC License #HPG0014 Tester: SGI Test date: Jun-2004 Test Site: SGI Hardware availability: Jun-2003 Software availability: May-2004 Base Base Base Peak Peak Peak Benchmarks Ref Time Run Time Ratio Ref Time Run Time Ratio ------------- -------- -------- -------- -------- -------- -------- 310.wupwise_m 6000 252 23839* 6000 252 23839* 310.wupwise_m 6000 251 23878 6000 251 23878 312.swim_m 6000 154 38873* 6000 154 38873* 312.swim_m 6000 154 39012 6000 154 39012 314.mgrid_m 7300 195 37350 7300 195 37350 314.mgrid_m 7300 197 37037* 7300 197 37037* 316.applu_m 4000 130 30758 4000 130 30758 316.applu_m 4000 130 30733* 4000 130 30733* 318.galgel_m 5100 413 12344 5100 413 12344 318.galgel_m 5100 416 12260* 5100 416 12260* 320.equake_m 2600 135 19240* 2600 112 23128* 320.equake_m 2600 134 19450 2600 112 23199 324.apsi_m 3400 231 14725* 3400 207 16453 324.apsi_m 3400 230 14759 3400 207 16452* 326.gafort_m 8700 719 12097 8700 648 13432 326.gafort_m 8700 720 12086* 8700 650 13391* 328.fma3d_m 4600 378 12176 4600 337 13642* 328.fma3d_m 4600 378 12175* 4600 337 13643 330.art_m 6400 156 41083* 6400 156 41083* 330.art_m 6400 153 41790 6400 153 41790 332.ammp_m 7000 755 9277* 7000 755 9277* 332.ammp_m 7000 754 9282 7000 754 9282 ======================================================================== 310.wupwise_m 6000 252 23839* 6000 252 23839* 312.swim_m 6000 154 38873* 6000 154 38873* 314.mgrid_m 7300 197 37037* 7300 197 37037* 316.applu_m 4000 130 30733* 4000 130 30733* 318.galgel_m 5100 416 12260* 5100 416 12260* 320.equake_m 2600 135 19240* 2600 112 23128* 324.apsi_m 3400 231 14725* 3400 207 16452* 326.gafort_m 8700 720 12086* 8700 650 13391* 328.fma3d_m 4600 378 12175* 4600 337 13642* 330.art_m 6400 156 41083* 6400 156 41083* 332.ammp_m 7000 755 9277* 7000 755 9277* SPECompMbase2001 20006 SPECompMpeak2001 20958 HARDWARE -------- Hardware Vendor: SGI Model Name: SGI Altix 3000 (1500MHz, Itanium 2) CPU: Intel Itanium 2 CPU MHz: 1500 FPU: Integrated CPU(s) enabled: 16 cores, 16 chips, 1 core/chip CPU(s) orderable: 4-256 Primary Cache: 16KBI + 16KBD (on chip) per core Secondary Cache: 256KB (on chip) per core L3 Cache: 6.0MB (on chip) per core Other Cache: N/A Memory: 64 GB (32*512MB PC2700 DIMMS per 4 core module) Disk Subsystem: 1 x 36 GB SCSI (Seagate Cheetah 15k rpm) Other Hardware: None SOFTWARE -------- OpenMP Threads: 16 Parallel: OpenMP Operating System: SGI ProPack(TM) 3 Compiler: Intel(R) Fortran Compiler for Linux 8.0 (Build 20040519) Intel(R) C++ Compiler for Linux 8.0 (Build 20040519) File System: xfs System State: Multi-user NOTES ----- Baseline optimization flags: C programs: -openmp -O3 -ipo -ansi -ansi_alias -auto_ilp32 (ONESTEP) Fortran programs: -openmp -O3 -ipo (ONESTEP) OpenMP runtime library libguide.a statically linked Portability Flags: 318.galgel_m: -FI -132 Extra Flags: 330.art_m: -DINTS_PER_CACHELINE=32 -DDBLS_PER_CACHELINE=16 Baseline user environment: OMP_NUM_THREADS 16 limit stacksize 64000 KMP_STACKSIZE 31M KMP_LIBRARY TURNAROUND OMP_DYNAMIC FALSE KMP_SCHEDULE static,balanced Peak optimization flags: 310.wupwise_m: basepeak=true 312.swim_m: basepeak=true 314.mgrid_m: basepeak=true 316.applu_m: basepeak=true 318.galgel_m: basepeak=true 320.equake_m: -openmp -O3 -ipo -ansi -ansi_alias -auto_ilp32 (ONESTEP) 324.apsi_m: -openmp -O3 -ipo (ONESTEP) 326.gafort_m: -openmp -O3 -ipo (ONESTEP) 328.fma3d_m: -openmp -O3 -ipo (ONESTEP) 330.art_m: basepeak=true 332.ammp_m: basepeak=true Alternate sources: Add critical region around update of linked list in parallel loop. Approved src.alt available as ompm-purdue1-20040324.tar.gz Used for 330.art_m, base and peak. Peak sources: SPEC OMPL2001 source for 64bit systems modified for SPEC OMPM2001. Available as ompl src.alt in SPEC OMP v3.0 Used for 320.equake_m, 324.apsi_m, 326.gafort_m, and 328.fma3d_m. For all benchmarks threads were bound to cores using the following submit command: dplace -x2 -cNTM1,0 $command, where NTM1 is the number of threads minus 1. This binds threads in order of creation, beginning with the master thread on core NTM1, the first slave thread on core NTM1-1, and so on. The -x2 flag instructs dplace to skip placement of the lightweight OpenMP monitor thread, which is created prior to the slave threads. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 1999-2002 Standard Performance Evaluation Corporation Generated on Wed Jun 23 11:07:12 2004 by SPEC OMP2001 ASCII formatter v2.1