Award Abstract # 1828187
MRI: Acquisition of an HPC System for Data-Driven Discovery in Computational Astrophysics, Biology, Chemistry, and Materials Science

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: GEORGIA TECH RESEARCH CORP
Initial Amendment Date: August 24, 2018
Latest Amendment Date: September 7, 2022
Award Number: 1828187
Award Instrument: Standard Grant
Program Manager: Alejandro Suarez
alsuarez@nsf.gov
 (703)292-7092
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer & Information Science & Engineering
Start Date: October 1, 2018
End Date: September 30, 2022 (Estimated)
Total Intended Award Amount: $3,699,317.00
Total Awarded Amount to Date: $3,699,317.00
Funds Obligated to Date: FY 2018 = $3,699,317.00
History of Investigator:
  • Srinivas Aluru (Principal Investigator)
    aluru@cc.gatech.edu
  • Surya Kalidindi (Co-Principal Investigator)
  • Charles Sherrill (Co-Principal Investigator)
  • Deirdre Shoemaker (Co-Principal Investigator)
  • Richard Vuduc (Co-Principal Investigator)
Recipient Sponsored Research Office: Georgia Tech Research Corporation
926 DALNEY ST NW
ATLANTA
GA  US  30318-6395
(404)894-4819
Sponsor Congressional District: 05
Primary Place of Performance: Georgia Institute of Technology
225 North Avenue
Atlanta
GA  US  30332-0002
Primary Place of Performance
Congressional District:
05
Unique Entity Identifier (UEI): EMW9FC8J3HN4
Parent UEI: EMW9FC8J3HN4
NSF Program(s): Major Research Instrumentation,
Information Technology Researc
Primary Program Source: 01001819DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 1189, 026Z
Program Element Code(s): 118900, 164000
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

The project funds the purchase of a high-performance computing and storage system at the Georgia Institute of Technology. This computing instrument will support data-driven research in astrophysics, biosciences, computational chemistry, materials and manufacturing, and computational science. These projects contribute to national initiatives in big data, strategic computing, materials genome, and manufacturing partnership; and NSF supported observatories such as the gravitational wave observatory and the South Pole neutrino observatory. The system also serves as a springboard for developments of codes, software prototyping, and scalability studies prior to using national supercomputers. Advances made in computational methods and scientific software are disseminated in the form of open-source codes and data analysis portals. Over 33 faculty, 54 research scientists/postdocs, 195 graduate students, and 56 undergraduate students will immediately benefit from the instrument. In addition, the system provides training opportunity at all levels from undergraduate students to early career researchers, in important interdisciplinary areas of national need. A fifth of the system capacity is utilized to enable research activities of regional partners, researchers from minority serving institutions, and other users nationally through XSEDE participation. The project involves undergraduate student participation from historically black colleges from Atlanta metropolitan area. Public outreach efforts are planned through videos of public interest and local events such as the Atlanta Science Festival.

The cluster will combine regular compute nodes with others configured to emphasize one of the following: big memory, big local storage, solid state storage, Graphics Processing Units (GPU), and ARM processors. In doing so, the system can be employed by a diversity of projects. In astrophysics, the instrument bolsters data-driven research including detection of gravitational waves, astrophysical neutrinos, and gamma rays. It does it by leveraging data from leading astroparticle observatories and contributing to their mission. It also leads to improved insights into formation of supermassive black holes and large-scale structure of the universe. The computing system also aids the development of parallel software in computational genomics, systems biology, and health analytics. Important applications in assembly and network analysis of plant genomes, and environmental metagenomics are pursued. The instrument also enables next generation algorithms and software for computational chemistry and expands the boundaries of molecular simulation. The system enables advances in density function theory, enhances studies of crystal defects and nanostructures, and injects novel use of machine learning techniques in computational chemistry. It also fosters the development of data science methodologies to identify building blocks of materials at multiple scales, thus significantly reducing the development and deployments cycles for new materials.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 63)
Chang, Chaoyi and Medford, Andrew J. "Application of Density Functional Tight Binding and Machine Learning to Evaluate the Stability of Biomass Intermediates on the Rh(111) Surface" The Journal of Physical Chemistry C , v.125 , 2021 https://doi.org/10.1021/acs.jpcc.1c05715 Citation Details
Abbott, B. P. and Abbott, R. and Abbott, T. D. and Abraham, S. and Acernese, F. and Ackley, K. and Adams, A. and Adams, C. and Adhikari, R. X. and Adya, V. B. and Affeldt, C. and Agathos, M. and Agatsuma, K. and Aggarwal, N. and Aguiar, O. D. and Aiello, "Search for intermediate mass black hole binaries in the first and second observing runs of the Advanced LIGO and Virgo network" Physical Review D , v.100 , 2019 https://doi.org/10.1103/PhysRevD.100.064064 Citation Details
Abbott, B. P. and Abbott, R. and Abbott, T. D. and Abraham, S. and Acernese, F. and Ackley, K. and Adams, C. and Adhikari, R. X. and Adya, V. B. and Affeldt, C. and Agathos, M. and Agatsuma, K. and Aggarwal, N. and Aguiar, O. D. and Aiello, L. and Ain, A. "GWTC-1: A Gravitational-Wave Transient Catalog of Compact Binary Mergers Observed by LIGO and Virgo during the First and Second Observing Runs" Physical Review X , v.9 , 2019 https://doi.org/10.1103/PhysRevX.9.031040 Citation Details
Abbott, B. P. and Abbott, R. and Abbott, T. D. and Abraham, S. and Acernese, F. and Ackley, K. and Adams, C. and Adhikari, R. X. and Adya, V. B. and Affeldt, C. and Agathos, M. and Agatsuma, K. and Aggarwal, N. and Aguiar, O. D. and Aiello, L. and Ain, A. "Search for Eccentric Binary Black Hole Mergers with Advanced LIGO and Advanced Virgo during Their First and Second Observing Runs" The Astrophysical Journal , v.883 , 2019 https://doi.org/10.3847/1538-4357/ab3c2d Citation Details
Abbott, B. P. and Abbott, R. and Abbott, T. D. and Abraham, S. and Acernese, F. and Ackley, K. and Adams, C. and Adhikari, R. X. and Adya, V. B. and Affeldt, C. and Agathos, M. and Agatsuma, K. and Aggarwal, N. and Aguiar, O. D. and Aiello, L. and Ain, A. "Tests of general relativity with the binary black hole signals from the LIGO-Virgo catalog GWTC-1" Physical Review D , v.100 , 2019 https://doi.org/10.1103/PhysRevD.100.104036 Citation Details
Abbott, B. P. and Abbott, R. and Abbott, T. D. and Acernese, F. and Ackley, K. and Adams, C. and Adams, T. and Addesso, P. and Adhikari, R. X. and Adya, V. B. and Affeldt, C. and Agarwal, B. and Agathos, M. and Agatsuma, K. and Aggarwal, N. and Aguiar, O. "Tests of General Relativity with GW170817" Physical Review Letters , v.123 , 2019 https://doi.org/10.1103/PhysRevLett.123.011102 Citation Details
Abbott, R. and Abbott, T. D. and Abraham, S. and Acernese, F. and Ackley, K. and Adams, C. and Adhikari, R. X. and Adya, V. B. and Affeldt, C. and Agathos, M. and Agatsuma, K. and Aggarwal, N. and Aguiar, O. D. and Aich, A. and Aiello, L. and Ain, A. and "GW190412: Observation of a binary-black-hole coalescence with asymmetric masses" Physical Review D , v.102 , 2020 https://doi.org/10.1103/PhysRevD.102.043015 Citation Details
Abbott, R. and Abbott, T. D. and Abraham, S. and Acernese, F. and Ackley, K. and Adams, C. and Adhikari, R. X. and Adya, V. B. and Affeldt, C. and Agathos, M. and Agatsuma, K. and Aggarwal, N. and Aguiar, O. D. and Aich, A. and Aiello, L. and Ain, A. and "GW190521: A Binary Black Hole Merger with a Total Mass of 150M" Physical Review Letters , v.125 , 2020 https://doi.org/10.1103/PhysRevLett.125.101102 Citation Details
Abbott, R. and Abbott, T. D. and Abraham, S. and Acernese, F. and Ackley, K. and Adams, C. and Adhikari, R. X. and Adya, V. B. and Affeldt, C. and Agathos, M. and Agatsuma, K. and Aggarwal, N. and Aguiar, O. D. and Aich, A. and Aiello, L. and Ain, A. and "GW190814: Gravitational Waves from the Coalescence of a 23 Solar Mass Black Hole with a 2.6 Solar Mass Compact Object" The Astrophysical Journal , v.896 , 2020 https://doi.org/10.3847/2041-8213/ab960f Citation Details
Abbott, R. and Abbott, T. D. and Abraham, S. and Acernese, F. and Ackley, K. and Adams, C. and Adhikari, R. X. and Adya, V. B. and Affeldt, C. and Agathos, M. and Agatsuma, K. and Aggarwal, N. and Aguiar, O. D. and Aich, A. and Aiello, L. and Ain, A. and "Properties and Astrophysical Implications of the 150 M Binary Black Hole Merger GW190521" The Astrophysical Journal , v.900 , 2020 https://doi.org/10.3847/2041-8213/aba493 Citation Details
Acharya, Atanu and Lynch, Diane L. and Pavlova, Anna and Pang, Yui Tik and Gumbart, James C. "ACE2 glycans preferentially interact with SARS-CoV-2 over SARS-CoV" Chemical Communications , v.57 , 2021 https://doi.org/10.1039/d1cc02305e Citation Details
(Showing: 1 - 10 of 63)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The project funded the acquisition of a high-performance computing cluster named Hive to support data-driven research in multiple fields including astrophysics, data-enabled chemistry, biology and health, and materials science and manufacturing. The Hive cluster consists of 484 Xeon compute nodes and 2.5 petabytes of storage, connected by 100 GB/s EDR Infiniband interconnection network. Some of the nodes are enhanced/specialized to meet different application requirements ? 16 nodes with quad V100 NVIDIA GPUs, 16 with large local disks, 16 with large local Solid State Disks (SSDs), and four with much larger main memory. In addition, a 16 node HPE/Cray NSP-1 system with Fujitsu 48-core A64FX ARM-based processors and 32 GB High Bandwidth Memory is procured. The Hive cluster is used as primary resource for computation and data-enabled research of 35 faculty members and their research groups totaling about 250 users during its span of operation. In addition, the system is made available via the NSF XSEDE network by deploying the Hive gateway using SciGaP?s Apache Airavata framework.

Intellectual merit: The project funded instrumentation that simultaneously impacted multiple disciplines. Research in astrophysics has contributed to our understanding of binary black holes, gravitational waves, and the first generations of stars and galaxies in the early Universe. Computational chemistry research created detailed machine learning models for numerous reactions and materials. The research has provided further insight into electrochemistry, filling in some details of fundamental processes by atomistic simulation. Hive allowed for the first systematic investigation of the range-dependence of intermolecular interactions in molecular crystals, which should help theorists who are designing improved algorithms for rapidly computing accurate lattice energies. Multi-scale datasets generated by materials science researchers on Hive are now allowing us to harvest a new class of materials knowledge systems that are focused on accelerating the rate of materials discovery, development, and deployment in practical advanced technology applications, directly in support of the national Materials Genome Initiative. Finally, research in high performance computing led to the development of parallel algorithms and software tools for learning graphical models, constructing tensor decompositions, and carrying out analytics on hypergraphs.

Broader impacts: The project led to multiple open-source software products that contributed to the high-performance computing software ecosystem that exists in several science and engineering disciplines. For example, the SPARC DFT software helps accelerate the speed of computational chemistry research. Research in materials science supported the continued development of www.pymks.org as an open-source repository for codes capable of establishing process-structure-property linkages needed to accelerate materials innovation efforts (in direct support of the federal Materials Genome Initiative). Researchers also contributed data and software to Materials Commons (https://materialscommons.org/), and continued the development of www.materialhub.org as an open-access sharing repository for material microstructure datasets. During Covid, some capacity of the cluster is redirected in support of scientific work to fight the pandemic. Research studies conducted on Hive of the main protease in the SARS-CoV-2 virus revealed that binding of inhibitors can shift the dominant protonation states, an important consideration for drug design efforts.

The project also created valuable datasets that are useful in the respective scientific communities. For example, the computational chemistry datasets generated are being utilized by theoretical chemistry methods developers and for machine learning applications in helping parameterize, train, or calibrate new methods. The knowledge created using the Hive cluster was critical in establishing the start-up company Multiscale Technologies, Inc., which provides software and hardware solutions aimed at accelerating materials innovation/design cycles for its customers using emergent AI/ML and high-throughput strategies.

In addition to supporting research by Georgia Tech investigators, up to 20% of the capacity of the Hive cluster is set aside for external users, with particular focus on HBCUs and MSIs.


Last Modified: 04/22/2023
Modified by: Srinivas Aluru

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page