Proceedings of the 1996 ACM/IEEE conference on Supercomputing

Supercomputing '96: Proceedings of the 1996 ACM/IEEE conference on Supercomputing

November 1996

1996 Proceeding

Chairman:
Beverly Clayton

Publisher:

IEEE Computer Society
1730 Massachusetts Ave., NW Washington, DC
United States

Conference:

SC '96: International Conference for High Performance Computing, Networking, Storage and Analysis Pittsburgh Pennsylvania USA 1 January 1996

ISBN:

978-0-89791-854-1

Published:

17 November 1996

Sponsors:

SIGARCH, IEEE-CS

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Next Conference

SC '25

Sponsor:
sighpc

The International Conference for High Performance Computing, Networking, Storage and Analysis

November 16 - 21, 2025

St Louis , MO , USA

SC '25 website

Reflects downloads up to 22 Jan 2025Bibliometrics

Citation Count

1,235

Downloads (6 weeks)

686

Downloads (12 months)

3,437

Downloads (cumulative)

20,617

Sections

Supercomputing '96: Proceedings of the 1996 ACM/IEEE conference on Supercomputing

1996

Previous Next

Abstract

No abstract available.

Proceeding Downloads

PDFFront matter material

Select All

Export Citations Save to Binder

Article

Free

Parallel hierarchical molecular structure estimation

Cheng Che Chen,
Jaswinder Pal Singh,
Russ B. Altman

Pages 1–eshttps://doi.org/10.1145/369028.369031

Determining the structure of biological macromolecules such as proteins and nucleic acids is an important element of molecular biology because of the intimate relation between form and function of these molecules. Individual sources of data about ...

- 4
- 170
Metrics
Total Citations4
Total Downloads170
Last 12 Months29
Last 6 weeks6

Abstract
View online with eReader
PDF

Article

Free

A data-parallel implementation of O(N) hierarchical N-body methods

Yu Hu,
S. Lennart Johnsson

Pages 2–eshttps://doi.org/10.1145/369028.369033

The O(N) hierarchical N-body algorithms and Massively Parallel Processors allow particle systems of 100 million particles or more to be simulated in acceptable time. We present a data-parallel implementation of Anderson's method and demonstrate both ...

- 3
- 312
Metrics
Total Citations3
Total Downloads312
Last 12 Months63
Last 6 weeks13

Abstract
View online with eReader
PDF

Article

Free

The design of a portable scientific tool: a case studying using SnB

Steven M. Gallo,
Russ Miller,
Charles M. Weeks

Pages 3–eshttps://doi.org/10.1145/369028.369035

Developing and maintaining a large software package is a complex task. Decisions are made early in the design process that affect i) the ability of a user to effectively exploit the package and ii) the ability of a software engineer to maintain it. This ...

- 0
- 195
Metrics
Total Citations0
Total Downloads195
Last 12 Months57
Last 6 weeks11

Abstract
View online with eReader
PDF

Article

Free

Runtime performance of parallel array assignment: an empirical study

Lei Wang,
James M. Stichnoth,
Siddhartha Chatterjee

Pages 4–eshttps://doi.org/10.1145/369028.369036

Compiling the array assignment statement of High Performance Fortran in the presence of block-cyclic distributions of data arrays is considered difficult, and several algorithms have been published to solve this problem. We present a comprehensive study ...

- 17
- 383
Metrics
Total Citations17
Total Downloads383
Last 12 Months246
Last 6 weeks52

Abstract
View online with eReader
PDF

Article

Free

ScaLAPACK: a portable linear algebra library for distributed memory computers - design issues and performance

Laura Susan Blackford,
J. Choi,
A. Cleary,
A. Petitet,
R. C. Whaley,
J. Demmel,
I. Dhillon,
K. Stanley,
J. Dongarra,
S. Hammarling,
G. Henry,
D. Walker

Pages 5–eshttps://doi.org/10.1145/369028.369038

This paper outlines the content and performance of ScaLAPACK, a collection of mathematical software for linear algebra computations on distributed memory computers. The importance of developing standards for computational and message passing interfaces ...

- 80
- 505
Metrics
Total Citations80
Total Downloads505
Last 12 Months91
Last 6 weeks21

Abstract
View online with eReader
PDF

Article

Free

Network performance modeling for PVM clusters

Mark J. Clement,
Michael R. Steed,
Phyllis E. Crandall

Pages 6–eshttps://doi.org/10.1145/369028.369040

The advantages of workstation clusters as a parallel computing platform include a superior price-performance ratio, availability, scalability, and ease of incremental growth. However, the performance of traditional LAN technologies such as Ethernet and ...

- 6
- 199
Metrics
Total Citations6
Total Downloads199
Last 12 Months43
Last 6 weeks9

Abstract
View online with eReader
PDF

Article

Free

Scalable parallel algorithms for interactive visualization of curved surfaces

Subodh Kumar,
Chun-Fa Chang,
Dinesh Manocha

Pages 7–eshttps://doi.org/10.1145/369028.369041

We present efficient parallel algorithms for interactive display of higher order surfaces on current graphics systems. At each frame, these algorithms approximate the surface by polygons and rasterize them over the graphics pipeline. The time for ...

- 1
- 184
Metrics
Total Citations1
Total Downloads184
Last 12 Months29
Last 6 weeks11

Abstract
View online with eReader
PDF

Article

Free

STERN: a highly scalable parallel stereo terrain renderer for planetary mission simulations

Ansel Teng,
Scott Whitman,
Meemong Lee

Pages 8–eshttps://doi.org/10.1145/369028.369043

In this paper, we describe STREN, a parallel stereo renderer for fixed-location terrain rendering tasks required for the simulation of planetary exploration missions. The renderer is based on a novel spatial data representation, called the TANPO map. ...

- 1
- 173
Metrics
Total Citations1
Total Downloads173
Last 12 Months36
Last 6 weeks10

Abstract
View online with eReader
PDF

Article

Free

Education in high performance computing via the WWW: designing and using technical materials effectively

Susan Mehringer

Pages 9–eshttps://doi.org/10.1145/369028.369045

Cornell Theory Center, a national center for high performance computing, has been designing and delivering education programs on parallel processing in traditional workshops for years. With the advent and growth of the World Wide Web, we have been able ...

- 1
- 146
Metrics
Total Citations1
Total Downloads146
Last 12 Months33
Last 6 weeks9

Abstract
View online with eReader
PDF

Article

Free

Compiler-directed shared-memory communication for iterative parallel applications

Guhan Viswanathan,
James R. Larus

Pages 10–eshttps://doi.org/10.1145/369028.369047

Many scientific applications are iterative and specify repetitive communication patterns. This paper shows how a parallel-language compiler and custom cache-coherence protocols in a distributed shared memory system together can implement shared-memory ...

- 7
- 190
Metrics
Total Citations7
Total Downloads190
Last 12 Months53
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

Dynamic data distribution with control flow analysis

Jordi Garcia,
Eduard Ayguade,
Jesus Labarta

Pages 11–eshttps://doi.org/10.1145/369028.369048

This paper describes the design of a data distribution tool which automatically derives the data mapping for the arrays and the parallelization strategy for the loops in a Fortran 77 program. The layout generated can be static or dynamic, and the ...

- 24
- 218
Metrics
Total Citations24
Total Downloads218
Last 12 Months41
Last 6 weeks8

Abstract
View this article in HTML format

Article

Free

Transformations for imperfectly nested loops

Induprakas Kodukula,
Keshav Pingali

Pages 12–eshttps://doi.org/10.1145/369028.369051

Loop transformations are critical for compiling high-performance code for modern computers. Existing work has focused on transformations for perfectly nested loops (that is, loops in which all assignment statements are contained within the innermost ...

- 21
- 268
Metrics
Total Citations21
Total Downloads268
Last 12 Months79
Last 6 weeks15

Abstract
View online with eReader
PDF

Article

Free

Earthquake ground motion modeling on parallel computers

Hesheng Bao,
Jacobo Bielak,
Omar Ghattas,
Loukas F. Kallivokas,
David R. O'Hallaron,
Jonathan R. Shewchuk,
Jifeng Xu

Pages 13–eshttps://doi.org/10.1145/369028.369053

We describe the design and discuss the performance of a parallel elastic wave propagation simulator that is being used to model and study earthquake-induced ground motion in large sedimentary basins. The components of the system include mesh generators, ...

- 38
- 405
Metrics
Total Citations38
Total Downloads405
Last 12 Months37
Last 6 weeks10

Abstract
View this article in HTML format

Article

Free

Performance analysis and optimization on the UCLA parallel atmospheric general circulation model code

John Lou,
John Farrara

Pages 14–eshttps://doi.org/10.1145/369028.369056

An analysis is presented of several factors influencing the performance of a parallel implementation of the UCLA atmospheric general circulation model(AGCM) on massively parallel computer systems. Several modifications to the parallel AGCM code aimed at ...

- 9
- 167
Metrics
Total Citations9
Total Downloads167
Last 12 Months36
Last 6 weeks13

Abstract
View online with eReader
PDF

Article

Free

Climate data assimilation on a massively parallel Supercomputer

Hong Q. Ding,
Robert D. Ferraro

Pages 15–eshttps://doi.org/10.1145/369028.369058

We have designed and implemented a set of highly efficient and highly scalable algorithms for an unstructured computational package, the PSAS data assimilation package, as demonstrated by detailed performance analysis of systematic runs on up to 512-...

- 2
- 145
Metrics
Total Citations2
Total Downloads145
Last 12 Months24
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

Performance analysis using the MIPS R10000 performance counters

Marco Zagha,
Brond Larson,
Steve Turner,
Marty Itzkowitz

Pages 16–eshttps://doi.org/10.1145/369028.369059

Tuning supercomputer application performance often requires analyzing the interaction of the application and the underlying architecture. In this paper, we describe support in the MIPS R10000 for non-intrusively monitoring a variety of processor events -...

- 126
- 860
Metrics
Total Citations126
Total Downloads860
Last 12 Months268
Last 6 weeks51

Abstract
View online with eReader
PDF

Article

Free

Profiling a parallel language based on fine-grained communication

Bjoern Haake,
Klaus E. Schauser,
Chris Scheiman

Pages 17–eshttps://doi.org/10.1145/369028.369063

Fine tuning the performance of large parallel programs is a very difficult task. A profiling tool can provide detailed insight into the utilization and communication of the different processors, which helps identify performance bottlenecks. In this ...

- 2
- 172
Metrics
Total Citations2
Total Downloads172
Last 12 Months31
Last 6 weeks7

Abstract
View online with eReader
PDF

Article

Free

Modeling, evaluation, and testing of paradyn instrumentation system

Abdul Waheed,
Diane T. Rover,
Jeffrey K. Hollingsworth

Pages 18–eshttps://doi.org/10.1145/369028.369065

This paper presents a case study of modeling, evaluating, and testing the data collection services (called an instrumentation system) of the Paradyn parallel performance measurement tool using well-known performance evaluation and experiment design ...

- 2
- 258
Metrics
Total Citations2
Total Downloads258
Last 12 Months50
Last 6 weeks11

Abstract
View online with eReader
PDF

Article

Free

An analytical model of the HINT performance metric

Quinn O. Snell,
John L. Gustafson

Pages 19–eshttps://doi.org/10.1145/369028.369067

The HINT Benchmark was developed to provide a broad-spectrum metric for computers, and to measure performance over the full range of memory sizes and time scales. We have extended our understanding of why HINT performance curves look the way they do, ...

- 3
- 229
Metrics
Total Citations3
Total Downloads229
Last 12 Months29
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

Communication patterns and models in prism: a spectral element-Fourier parallel Navier-Stokes solver

Constantinos Evangelinos,
George Em Karniadakis

Pages 20–eshttps://doi.org/10.1145/369028.369068

In this paper we analyze communication patterns in the parallel three-dimensional Navier-Stokes solver Prism, and present performance results on the IBM SP2, the Cray T3D and the SGI Power Challenge XL. Prism is used for direct numerical simulation of ...

- 8
- 235
Metrics
Total Citations8
Total Downloads235
Last 12 Months43
Last 6 weeks9

Abstract
View online with eReader
PDF

Article

The C31 parallel benchmark suite - introduction and preliminary results

Rakesh Jha,
Richard C. Metzger,
Brian VanVoorst,
Luiz S. Pires,
Wing Au,
Minesh Amin,
David A. Castanon,
Vipin Kumar

Pages 21–eshttps://doi.org/10.1145/369028.369073

Current parallel benchmarks, while appropriate for scientific applications, lack the defense relevance and representativeness for developers who are considering parallel computers for their Command, Control, Communication, and Intelligence (C3I) ...

- 3
Metrics
Total Citations3

Abstract

Article

Free

Architectural and application: the performance of the NEC SX-4 on the NCAR benchmark suite

Steven W. Hammond,
Richard D. Loft,
Philip D. Tannenbaum

Pages 22–eshttps://doi.org/10.1145/369028.369076

In November 1994, the NEC Corporation announced the SX-4 supercomputer. It is the third in the SX series of supercomputers and is upward compatible from the SX-3R vector processor with enhancements for scalar processing, short vector processing, and ...

- 3
- 191
Metrics
Total Citations3
Total Downloads191
Last 12 Months55
Last 6 weeks12

Abstract
View online with eReader
PDF

Article

Free

Minimal adaptive routing with limited injection on Toroidal k-ary n-cubes

Fabrizio Petrini,
Marco Vanneschi

Pages 23–eshttps://doi.org/10.1145/369028.369078

Virtual channels can be used to implement deadlock free adaptive routing algorithms and increase network throughput. Unfortunately, they introduce asymmetries in the use of buffers of symmetric networks as the toroidal k-ary n-cubes. In this paper we ...

- 8
- 170
Metrics
Total Citations8
Total Downloads170
Last 12 Months42
Last 6 weeks6

Abstract
View this article in HTML format

Article

Free

Low-latency communication on the IBM RISC system/6000 SP

Chi-Chao Chang,
Grzegorz Czajkowski,
Chris Hawblitzel,
Thorsten von Eicken

Pages 24–eshttps://doi.org/10.1145/369028.369079

The IBM SP is one of the most powerful commercial MPPs, yet, in spite of its fast processors and high network bandwidth, the SP's communication latency is inferior to older machines such as the TMC CM-5 or Meiko CS-2. This paper investigates the use of ...

- 11
- 248
Metrics
Total Citations11
Total Downloads248
Last 12 Months80
Last 6 weeks18

Abstract
View online with eReader
PDF

Article

Free

Compiled communication for all-optical TDM networks

Xin Yuan,
R. Melhem,
R. Gupta

Pages 25–eshttps://doi.org/10.1145/369028.369081

While all-optical networks offer large bandwidth for transferring data, the control mechanisms to dynamically establish all-optical paths incur large overhead. In this paper, we consider the problem of adapting all-optical multiplexed networks in ...

- 15
- 250
Metrics
Total Citations15
Total Downloads250
Last 12 Months46
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

Increasing the effective bandwidth of complex memory systems in multivector processors

Anna M. del Corral,
Jose M. Llaberia

Pages 26–eshttps://doi.org/10.1145/369028.369084

In multivector processors, the lost cycles due to conflicts between concurrent vector streams make the effective throughput be lower than the peak throughput. When the request rate of all the concurrent vector streams to every memory module is less than ...

- 0
- 147
Metrics
Total Citations0
Total Downloads147
Last 12 Months29
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

A parallel cosmological hydrodynamics code

Paul W. Bode,
Guohong Xu,
Renyue Cen

Pages 27–eshttps://doi.org/10.1145/369028.369085

Formation by gravitational collapse of galaxies and the large-scale structure of the universe is a nonlinear, multi-scale, multi-component problem. This complex process involves dynamics of the gaseous baryons as well as of the gravitationally dominant ...

- 0
- 188
Metrics
Total Citations0
Total Downloads188
Last 12 Months27
Last 6 weeks9

Abstract
View online with eReader
PDF

Article

Free

Transient dynamics simulations: parallel algorithms for contact detection and smoothed particle hydrodynamics

Steve Plimpton,
Bruce Hendrickson,
Steve Attaway,
Jeff Swegle,
Courtenay Vaughan,
Dave Gardner

Pages 28–eshttps://doi.org/10.1145/369028.369087

Transient dynamics simulations are commonly used to model phenomena such as car crashes, underwater explosions, and the response of shipping containers to high-speed impacts. Physical objects in such a simulation are typically represented by Lagrangian ...

- 13
- 590
Metrics
Total Citations13
Total Downloads590
Last 12 Months57
Last 6 weeks13

Abstract
View online with eReader
PDF

Article

Free

Performance of a computational fluid dynamics code on NEC and Cray supercomputers: beyond 10 gigaflops

Ferhat F. Hatay

Pages 29–eshttps://doi.org/10.1145/369028.369089

The implementation and optimization of a production mode Computational Fluid Dynamics (CFD) software to NEC and Cray supercomputing platforms are discussed. It is intended to assess the impact of different computer architectures and High Power Computing ...

- 0
- 510
Metrics
Total Citations0
Total Downloads510
Last 12 Months35
Last 6 weeks5

Abstract
View this article in HTML format

Article

Free

Parallel preconditioners for elliptic PDEs

Vivek Sarin,
Ahmed Sameh

Pages 30–eshttps://doi.org/10.1145/369028.369090

Iterative schemes for solving sparse linear systems arising from elliptic PDEs are very suitable for efficient implementation on large scale multiprocessors. However, these methods rely heavily on effective preconditioners which must also be amenable to ...

- 1
- 116
Metrics
Total Citations1
Total Downloads116
Last 12 Months22
Last 6 weeks6

Abstract
View online with eReader
PDF

Save to Binder

Create a New Binder

Name

Contributors

Beverly Clayton
- Publication Years
- Publication counts0
- Citation count0
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article0
View Full Profile

Index Terms

Proceedings of the 1996 ACM/IEEE conference on Supercomputing

Comments

Recommendations

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing
UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing
IHM '14: Proceedings of the 26th Conference on l'Interaction Homme-Machine

Acceptance Rates

Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Year	Submitted	Accepted	Rate
SC '17	327	61	19%
SC '16	442	81	18%
SC '15	358	79	22%
SC '14	394	83	21%
SC '13	449	91	20%
SC '12	461	100	22%
SC '11	352	74	21%
SC '10	253	51	20%
SC '09	261	59	23%
SC '08	277	59	21%
SC '07	268	54	20%
SC '06	239	54	23%
SC '05	260	62	24%
SC '04	200	60	30%
SC '03	207	60	29%
SC '02	230	67	29%
SC '01	240	60	25%
SC '00	179	62	35%
Supercomputing '95	241	69	29%
Supercomputing '93	300	72	24%
Supercomputing '92	220	75	34%
Supercomputing '91	215	83	39%
Overall	6,373	1,516	24%

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Index Terms

Recommendations

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing

UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing

IHM '14: Proceedings of the 26th Conference on l'Interaction Homme-Machine

Acceptance Rates

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.