Award Abstract # 1919021
SPX: Collaborative Research: Parallel Algorithm by Blocks - A Data-centric Compiler/runtime System for Productive Programming of Scalable Parallel Systems

NSF Org: CCF
Division of Computing and Communication Foundations
Recipient: GEORGIA TECH RESEARCH CORP
Initial Amendment Date: July 29, 2019
Latest Amendment Date: August 29, 2023
Award Number: 1919021
Award Instrument: Standard Grant
Program Manager: Damian Dechev
ddechev@nsf.gov
 (703)292-8910
CCF
 Division of Computing and Communication Foundations
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2019
End Date: September 30, 2024 (Estimated)
Total Intended Award Amount: $400,000.00
Total Awarded Amount to Date: $400,000.00
Funds Obligated to Date: FY 2019 = $400,000.00
History of Investigator:
  • Umit Catalyurek (Principal Investigator)
    umit@gatech.edu
  • Umit Catalyurek (Former Principal Investigator)
  • Richard Vuduc (Former Principal Investigator)
Recipient Sponsored Research Office: Georgia Tech Research Corporation
926 DALNEY ST NW
ATLANTA
GA  US  30318-6395
(404)894-4819
Sponsor Congressional District: 05
Primary Place of Performance: Georgia Institute of Technology
225 North Ave
Atlanta
GA  US  30332-0420
Primary Place of Performance
Congressional District:
05
Unique Entity Identifier (UEI): EMW9FC8J3HN4
Parent UEI: EMW9FC8J3HN4
NSF Program(s): PPoSS-PP of Scalable Systems
Primary Program Source: 01001920DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 026Z
Program Element Code(s): 042Y00
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Achieving both high productivity and high performance on scalable parallel and heterogeneous computer systems is a challenging goal for application developers. Parallel programming with Message Passing Interface (MPI) is currently the most widely used and effective means of developing scalable parallel applications; however the productivity of application developers is lower than with programming models that offer a global shared view of data structures. In comparison, achieving high performance and scalability with global-address-space programming models is challenging. This project focuses on the development of a data-centric compiler/runtime framework, "Parallel Algorithms by Blocks" (PAbB), aimed at offering users the combined positive attributes of multiple parallel programming models without the disadvantages. The main novelty of this project is that it uses a combination of user insights, new compiler optimizations, and advanced runtime support to achieve both productivity and performance for an important class of computations that operate on matrices, tensors, and graphs. The main broader impact of the work is that it can significantly lower the barrier to entry for scientists from various domains who wish to develop new high-performance applications on large scale parallel systems, but presently find it too difficult with currently available parallel programming models.

This project brings together a team of investigators, with expertise across the software stack, to develop compiler tools and runtime systems for PAbB and demonstrate its use across a number of applications from computational science and data science. The PAbB model is intended to work in concert with MPI; that is, PAbB programs can execute in any standard MPI environment, interoperating with other native MPI code. The key idea behind the proposed approach is to offer the user a global-address view of the targeted data structures, requiring only (optionally in some cases) that they specify how data should be partitioned, but have the compiler/runtime handle the tedious aspects of the global-to-local re-indexing and inter-node data movement. In addition to the productivity benefit, a second significant benefit is in enabling system support for dynamic load balancing. The approach is being designed and demonstrated in the context of applications operating on dense and sparse matrices and tensors, and graphs.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Barik, Reet and Minutoli, Marco and Halappanavar, Mahantesh and Kalyanaraman, Ananth "IMpart: A Partitioning-based Parallel Approach to Accelerate Influence Maximization" Proceedings of the International Conference on High Performance Computing, Data, and Analytics (HiPC) , 2022 https://doi.org/10.1109/HiPC56025.2022.00028 Citation Details
An, Xiaojing and Ghosh, Priyanka and Keppler, Patrick and Kurt, Sureyya Emre and Krishnamoorthy, Sriram and Sadayappan, Ponnuswamy and Rajam, Aravind Sukumaran and Çatalyürek, Ümit V. and Kalyanaraman, Ananth "BOA: A partitioned view of genome assembly" iScience , v.25 , 2022 https://doi.org/10.1016/j.isci.2022.105273 Citation Details
Rabbi, Fazlay and Daley, Christopher S. and Çatalyürek, Ümit V. and Aktulga, Hasan Metin "A Portable Sparse Solver Framework for Large Matrices on Heterogeneous Architectures" International Conference on High Performance Computing, Data, & Analytics , 2022 https://doi.org/10.1109/HiPC56025.2022.00030 Citation Details
Balin, Muhammed Fatih and Sancak, Kaan and Catalyurek, Umit V. "MG-GCN: A Scalable multi-GPU GCN Training Framework" Proceedings of the 51st International Conference on Parallel Processing (ICPP) , 2022 https://doi.org/10.1145/3545008.3545082 Citation Details
Gawande, Nitin and Ghosh, Sayan and Halappanavar, Mahantesh and Tumeo, Antonino and Kalyanaraman, Ananth "Towards scaling community detection on distributed-memory heterogeneous systems" Parallel Computing , v.111 , 2022 https://doi.org/10.1016/j.parco.2022.102898 Citation Details
Fang, Bo and Ozkaya, M. Yusuf and Li, Ang and Catalyurek, Umit V. and Krishnamoorthy, Sriram "Efficient Hierarchical State Vector Simulation of Quantum Circuits via Acyclic Graph Partitioning" International Conference on Cluster Computing , 2022 https://doi.org/10.1109/CLUSTER51413.2022.00041 Citation Details
Choudhury, Dwaipayan and Xiang, Lizhi and Rajam, Aravind Sukumaran and Kalyanaraman, Ananth and Pande, Partha Pratim "Accelerating Graph Computations on 3D NoC-enabled PIM Architectures" ACM Transactions on Design Automation of Electronic Systems , 2022 https://doi.org/10.1145/3564290 Citation Details
Choudhury, Dwaipayan and Barik, Reet and Rajam, Aravind Sukumaran and Kalyanaraman, Ananth and Pande, Partha Pratim "Software/Hardware Co-design of 3D NoC-based GPU Architectures for Accelerated Graph Computations" ACM Transactions on Design Automation of Electronic Systems , v.27 , 2022 https://doi.org/10.1145/3514354 Citation Details
Çatalyürek, Ümit and Devine, Karen and Faraj, Marcelo and Gottesbüren, Lars and Heuer, Tobias and Meyerhenke, Henning and Sanders, Peter and Schlag, Sebastian and Schulz, Christian and Seemaier, Daniel and Wagner, Dorothea "More Recent Advances in (Hyper)Graph Partitioning" ACM Computing Surveys , v.55 , 2023 https://doi.org/10.1145/3571808 Citation Details

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page