| NSF Org: |
OAC Office of Advanced Cyberinfrastructure (OAC) |
| Recipient: |
|
| Initial Amendment Date: | August 24, 2018 |
| Latest Amendment Date: | September 7, 2022 |
| Award Number: | 1828187 |
| Award Instrument: | Standard Grant |
| Program Manager: |
Alejandro Suarez
alsuarez@nsf.gov (703)292-7092 OAC Office of Advanced Cyberinfrastructure (OAC) CSE Directorate for Computer & Information Science & Engineering |
| Start Date: | October 1, 2018 |
| End Date: | September 30, 2022 (Estimated) |
| Total Intended Award Amount: | $3,699,317.00 |
| Total Awarded Amount to Date: | $3,699,317.00 |
| Funds Obligated to Date: |
|
| History of Investigator: |
|
| Recipient Sponsored Research Office: |
926 DALNEY ST NW ATLANTA GA US 30318-6395 (404)894-4819 |
| Sponsor Congressional District: |
|
| Primary Place of Performance: |
225 North Avenue Atlanta GA US 30332-0002 |
| Primary Place of
Performance Congressional District: |
|
| Unique Entity Identifier (UEI): |
|
| Parent UEI: |
|
| NSF Program(s): |
Major Research Instrumentation, Information Technology Researc |
| Primary Program Source: |
|
| Program Reference Code(s): |
|
| Program Element Code(s): |
|
| Award Agency Code: | 4900 |
| Fund Agency Code: | 4900 |
| Assistance Listing Number(s): | 47.070 |
ABSTRACT
![]()
The project funds the purchase of a high-performance computing and storage system at the Georgia Institute of Technology. This computing instrument will support data-driven research in astrophysics, biosciences, computational chemistry, materials and manufacturing, and computational science. These projects contribute to national initiatives in big data, strategic computing, materials genome, and manufacturing partnership; and NSF supported observatories such as the gravitational wave observatory and the South Pole neutrino observatory. The system also serves as a springboard for developments of codes, software prototyping, and scalability studies prior to using national supercomputers. Advances made in computational methods and scientific software are disseminated in the form of open-source codes and data analysis portals. Over 33 faculty, 54 research scientists/postdocs, 195 graduate students, and 56 undergraduate students will immediately benefit from the instrument. In addition, the system provides training opportunity at all levels from undergraduate students to early career researchers, in important interdisciplinary areas of national need. A fifth of the system capacity is utilized to enable research activities of regional partners, researchers from minority serving institutions, and other users nationally through XSEDE participation. The project involves undergraduate student participation from historically black colleges from Atlanta metropolitan area. Public outreach efforts are planned through videos of public interest and local events such as the Atlanta Science Festival.
The cluster will combine regular compute nodes with others configured to emphasize one of the following: big memory, big local storage, solid state storage, Graphics Processing Units (GPU), and ARM processors. In doing so, the system can be employed by a diversity of projects. In astrophysics, the instrument bolsters data-driven research including detection of gravitational waves, astrophysical neutrinos, and gamma rays. It does it by leveraging data from leading astroparticle observatories and contributing to their mission. It also leads to improved insights into formation of supermassive black holes and large-scale structure of the universe. The computing system also aids the development of parallel software in computational genomics, systems biology, and health analytics. Important applications in assembly and network analysis of plant genomes, and environmental metagenomics are pursued. The instrument also enables next generation algorithms and software for computational chemistry and expands the boundaries of molecular simulation. The system enables advances in density function theory, enhances studies of crystal defects and nanostructures, and injects novel use of machine learning techniques in computational chemistry. It also fosters the development of data science methodologies to identify building blocks of materials at multiple scales, thus significantly reducing the development and deployments cycles for new materials.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
![]()
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
![]()
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
The project funded the acquisition of a high-performance computing cluster named Hive to support data-driven research in multiple fields including astrophysics, data-enabled chemistry, biology and health, and materials science and manufacturing. The Hive cluster consists of 484 Xeon compute nodes and 2.5 petabytes of storage, connected by 100 GB/s EDR Infiniband interconnection network. Some of the nodes are enhanced/specialized to meet different application requirements ? 16 nodes with quad V100 NVIDIA GPUs, 16 with large local disks, 16 with large local Solid State Disks (SSDs), and four with much larger main memory. In addition, a 16 node HPE/Cray NSP-1 system with Fujitsu 48-core A64FX ARM-based processors and 32 GB High Bandwidth Memory is procured. The Hive cluster is used as primary resource for computation and data-enabled research of 35 faculty members and their research groups totaling about 250 users during its span of operation. In addition, the system is made available via the NSF XSEDE network by deploying the Hive gateway using SciGaP?s Apache Airavata framework.
Intellectual merit: The project funded instrumentation that simultaneously impacted multiple disciplines. Research in astrophysics has contributed to our understanding of binary black holes, gravitational waves, and the first generations of stars and galaxies in the early Universe. Computational chemistry research created detailed machine learning models for numerous reactions and materials. The research has provided further insight into electrochemistry, filling in some details of fundamental processes by atomistic simulation. Hive allowed for the first systematic investigation of the range-dependence of intermolecular interactions in molecular crystals, which should help theorists who are designing improved algorithms for rapidly computing accurate lattice energies. Multi-scale datasets generated by materials science researchers on Hive are now allowing us to harvest a new class of materials knowledge systems that are focused on accelerating the rate of materials discovery, development, and deployment in practical advanced technology applications, directly in support of the national Materials Genome Initiative. Finally, research in high performance computing led to the development of parallel algorithms and software tools for learning graphical models, constructing tensor decompositions, and carrying out analytics on hypergraphs.
Broader impacts: The project led to multiple open-source software products that contributed to the high-performance computing software ecosystem that exists in several science and engineering disciplines. For example, the SPARC DFT software helps accelerate the speed of computational chemistry research. Research in materials science supported the continued development of www.pymks.org as an open-source repository for codes capable of establishing process-structure-property linkages needed to accelerate materials innovation efforts (in direct support of the federal Materials Genome Initiative). Researchers also contributed data and software to Materials Commons (https://materialscommons.org/), and continued the development of www.materialhub.org as an open-access sharing repository for material microstructure datasets. During Covid, some capacity of the cluster is redirected in support of scientific work to fight the pandemic. Research studies conducted on Hive of the main protease in the SARS-CoV-2 virus revealed that binding of inhibitors can shift the dominant protonation states, an important consideration for drug design efforts.
The project also created valuable datasets that are useful in the respective scientific communities. For example, the computational chemistry datasets generated are being utilized by theoretical chemistry methods developers and for machine learning applications in helping parameterize, train, or calibrate new methods. The knowledge created using the Hive cluster was critical in establishing the start-up company Multiscale Technologies, Inc., which provides software and hardware solutions aimed at accelerating materials innovation/design cycles for its customers using emergent AI/ML and high-throughput strategies.
In addition to supporting research by Georgia Tech investigators, up to 20% of the capacity of the Hive cluster is set aside for external users, with particular focus on HBCUs and MSIs.
Last Modified: 04/22/2023
Modified by: Srinivas Aluru
Please report errors in award information by writing to: awardsearch@nsf.gov.


