Software notations and tools

Applied Filters

People

Publications

Conferences

Publication Date

8 Results for: Book/Issue: ICS '05: Proceedings of the 19th annual international conference on SupercomputingEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,774,380 records)|Limit your search to The ACM Full-Text Collection (761,092 records)

Showing 1 - 8of8 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

Article
June 2005
Automatic generation and tuning of MPI collective communication routines
- Ahmad Faraj,
- Xin Yuan
ICS '05: Proceedings of the 19th annual international conference on SupercomputingPages 393–402https://doi.org/10.1145/1088149.1088202

In order for collective communication routines to achieve high performance on different platforms, they must be able to adapt to the system architecture and use different algorithms for different situations. Current Message Passing Interface (MPI) ...
63
814
Metrics
Total Citations63
Total Downloads814
Last 12 Months21
Last 6 weeks3
Get Access
Article
June 2005
affinity-on-next-touch: increasing the performance of an industrial PDE solver on a cc-NUMA system
- Henrik Löf,
- Sverker Holmgren
ICS '05: Proceedings of the 19th annual international conference on SupercomputingPages 387–392https://doi.org/10.1145/1088149.1088201

The non-uniform memory access times of modern cc-NUMA systems often impair performance for shared memory applications. This is especially true for applications exhibiting complex access patterns. To improve performance, a mechanism for co-locating ...
25
452
Metrics
Total Citations25
Total Downloads452
Last 12 Months7
Last 6 weeks1
Get Access
Article
June 2005
Disk layout optimization for reducing energy consumption
ICS '05: Proceedings of the 19th annual international conference on SupercomputingPages 274–283https://doi.org/10.1145/1088149.1088186

Excessive power consumption is becoming a major barrier to extracting the maximum performance from high-performance parallel systems. Therefore, techniques oriented towards reducing power consumption of such systems are expected to become increasingly ...
37
536
Metrics
Total Citations37
Total Downloads536
Last 12 Months3
Last 6 weeks0
Get Access
Article
June 2005
Optimization of MPI collective communication on BlueGene/L systems
ICS '05: Proceedings of the 19th annual international conference on SupercomputingPages 253–262https://doi.org/10.1145/1088149.1088183

BlueGene/L is currently the world's fastest supercomputer. It consists of a large number of low power dual-processor compute nodes interconnected by high speed torus and collective networks, Because compute nodes do not have shared memory, MPI is the ...
118
1,551
Metrics
Total Citations118
Total Downloads1,551
Last 12 Months50
Last 6 weeks3
Get Access
Article
June 2005
Generating new general compiler optimization settings
ICS '05: Proceedings of the 19th annual international conference on SupercomputingPages 161–168https://doi.org/10.1145/1088149.1088171

Finding nearly optimal optimization settings for modern compilers which can utilize a large number of optimizations is a combinatorially exponential problem. In this paper, we investigate whether in the presence of many optimization choices random ...
22
542
Metrics
Total Citations22
Total Downloads542
Last 12 Months1
Last 6 weeks0
Get Access
Article
June 2005
Lightweight reference affinity analysis
ICS '05: Proceedings of the 19th annual international conference on SupercomputingPages 131–140https://doi.org/10.1145/1088149.1088167

Previous studies have shown that array regrouping and structure splitting significantly improve data locality. The most effective technique relies on profiling every access to every data element. The high overhead impedes its adoption in a general ...
25
349
Metrics
Total Citations25
Total Downloads349
Last 12 Months12
Last 6 weeks0
Get Access
Article
June 2005
Automatic thread distribution for nested parallelism in OpenMP
ICS '05: Proceedings of the 19th annual international conference on SupercomputingPages 121–130https://doi.org/10.1145/1088149.1088166

OpenMP is becoming the standard programming model for shared-memory parallel architectures. One of its most interesting features in the language is the support for nested parallelism. Previous research and parallelization experiences have shown the ...
30
826
Metrics
Total Citations30
Total Downloads826
Last 12 Months9
Last 6 weeks4
Get Access
Article
June 2005
A hybrid hardware/software approach to efficiently determine cache coherence Bottlenecks
ICS '05: Proceedings of the 19th annual international conference on SupercomputingPages 21–30https://doi.org/10.1145/1088149.1088153

High-end computing increasingly relies on shared-memory multiprocessors (SMPs), such as clusters of SMPs, nodes of chip-multiprocessors (CMP) or large-scale single-system image (SSI) SMPs. In such systems, performance is often affected by the sharing ...
10
539
Metrics
Total Citations10
Total Downloads539
Last 12 Months3
Last 6 weeks0
Get Access

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

Proceedings/Book Names

All Publications

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Automatic generation and tuning of MPI collective communication routines

affinity-on-next-touch: increasing the performance of an industrial PDE solver on a cc-NUMA system

Disk layout optimization for reducing energy consumption

Optimization of MPI collective communication on BlueGene/L systems

Generating new general compiler optimization settings

Lightweight reference affinity analysis

Automatic thread distribution for nested parallelism in OpenMP

A hybrid hardware/software approach to efficiently determine cache coherence Bottlenecks