| NSF Org: |
DBI Division of Biological Infrastructure |
| Recipient: |
|
| Initial Amendment Date: | June 2, 2020 |
| Latest Amendment Date: | June 2, 2020 |
| Award Number: | 2019771 |
| Award Instrument: | Standard Grant |
| Program Manager: |
Jennifer Weller
jweller@nsf.gov (703)292-2224 DBI Division of Biological Infrastructure BIO Directorate for Biological Sciences |
| Start Date: | July 1, 2020 |
| End Date: | June 30, 2024 (Estimated) |
| Total Intended Award Amount: | $400,000.00 |
| Total Awarded Amount to Date: | $400,000.00 |
| Funds Obligated to Date: |
|
| History of Investigator: |
|
| Recipient Sponsored Research Office: |
926 DALNEY ST NW ATLANTA GA US 30318-6395 (404)894-4819 |
| Sponsor Congressional District: |
|
| Primary Place of Performance: |
225 North Avenue, NW Atlanta GA US 30332-0002 |
| Primary Place of
Performance Congressional District: |
|
| Unique Entity Identifier (UEI): |
|
| Parent UEI: |
|
| NSF Program(s): | Infrastructure Innovation for |
| Primary Program Source: |
|
| Program Reference Code(s): |
|
| Program Element Code(s): |
|
| Award Agency Code: | 4900 |
| Fund Agency Code: | 4900 |
| Assistance Listing Number(s): | 47.074 |
ABSTRACT
![]()
Cells are the fundamental units that provide functions needed to sustain life in living organisms. Cellular functions are carried out by proteins, products of genes, and the process of producing proteins from genes (i.e., gene expression) is mediated by complex regulation systems. Much remains unknown about the mechanisms of gene regulations. Given all genes in a cell, the regulatory relationships among genes can be represented by networks, called gene regulatory networks. It has been a long-standing challenge to reconstruct these networks experimentally and computationally. A gene can express multiple isoforms (mRNA molecules), and hence produces multiple different proteins, which makes the underlying gene regulatory networks more complicated. Recent advances in single cell RNA-Sequencing (scRNA-Seq) technology has brought new opportunities in resolving high-quality regulatory networks, but also posed new computational challenges. The project aims to computationally reconstruct accurate regulatory networks at the isoform-level from large-scale sequencing data. Educational and outreach activities, such as courses on topics in computational biology and inclusion of minority students, will be carried out.
The project will develop efficient approaches to identify expressed isoforms and to determine expression abundances, and then develop a network-reconstruction method which improves current state-of-art. The new computational methods will be validated and applied to the field of immunology--to study cellular mechanisms in steroid-producing cells. The project will make contribution in improvements over existing methods. First, the proposed methods for developing a scalable transcript assembler will enable accurate determination and quantification of the expressed isoforms, and make it possible to build regulatory networks at the level of isoforms to reflect the possible difference in regulatory mechanisms for different isoforms. Second, many recently developed methods for network inference require cells to be pre-ordered with trajectory inference or RNA-velocity to mimic time-series data. Errors in the cell ordering can mislead network inference and lead to false predictions. The project proposes to perform cell ordering and network inference simultaneously, which is expected to provide better results for both cell ordering and network inference. The project will reconstruct transcript-level regulatory networks for different types of steroid-producing cells from both published and newly generated single-cell data. The results of the project can be found at the PI?s website: https://www.cc.gatech.edu/~xzhang954/.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
![]()
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
![]()
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
Cells in an organism undergo significant transformations during development and disease progression. Unraveling the mechanisms that regulate these changes is crucial for identifying the key molecules responsible for ensuring proper cellular processes, as well as those that drive disease. The major goal of this project is to study cell regulatory mechanisms. Under this theme, we studied the following problems: 1. Information on cell dynamics, that is, how cells change their identity or cell types in a temporal process, is important to study the regulatory mechanism. We therefore developed methods to better learn cell dynamics. We developed a method to reconstruct high-resolution cell trajectories from RNA velocity, as well as a method to infer cell trajectories using integrated single cell multi-modality data. 2. Information on cell types and cell identity is also key for understanding regulatory mechanisms, as the regulatory mechanisms primarily lead to different cell types. We developed methods to integrate multi-modality data including spatial coordinates of cells, to learn cell identity. 3. Biological networks are natural representations of regulatory mechanisms that indicate interactions between molecules in the cells. We studied the evolution of protein protein interaction networks where we learned interactions that are preserved during evolution which are likely to play key roles. 4. Temporal dynamics of cells can also be studied using a specific technology named lineage tracing. We developed methods to reconstruct cell lineage trees from lineage tracing barcodes, as well as simulation tools to benchmark these methods. 5. Perturbation studies, that is, considering gene expression profiles under different conditions, can also help to learn the regulatory mechanisms. We developed a method to process single cell gene expression data from patients under different conditions, and to predict data under new conditions. These developed tools have been applied for analysis of real biological datasets and have led to collaborations with biologists. Research products on learning cell dynamics and cell representations have been incorporated into the PI’s course, Machine Learning in Computational Biology, and a summer short course, Computational Problems for Single Cell Transcriptomics and Multi-omics. All developed computational tools are available through public repositories, with links provided in the manuscripts. This project has provided opportunities to include undergraduate researchers, and three undergraduate students trained in the lab have gained authorship in associated publications. The group also has been providing research opportunities for master student researchers and students from under-represented groups.
Last Modified: 11/26/2024
Modified by: Xiuwei Zhang
Please report errors in award information by writing to: awardsearch@nsf.gov.
