Award Abstract # 2019771
BBSRC-NSF/BIO: IIBR Informatics: Collaborative Research: Inference of isoform-level regulatory infrastructures with studies in steroid-producing cells

NSF Org: DBI
Division of Biological Infrastructure
Recipient: GEORGIA TECH RESEARCH CORP
Initial Amendment Date: June 2, 2020
Latest Amendment Date: June 2, 2020
Award Number: 2019771
Award Instrument: Standard Grant
Program Manager: Jennifer Weller
jweller@nsf.gov
 (703)292-2224
DBI
 Division of Biological Infrastructure
BIO
 Directorate for Biological Sciences
Start Date: July 1, 2020
End Date: June 30, 2024 (Estimated)
Total Intended Award Amount: $400,000.00
Total Awarded Amount to Date: $400,000.00
Funds Obligated to Date: FY 2020 = $400,000.00
History of Investigator:
  • Xiuwei Zhang (Principal Investigator)
    xiuwei.zhang@gatech.edu
Recipient Sponsored Research Office: Georgia Tech Research Corporation
926 DALNEY ST NW
ATLANTA
GA  US  30318-6395
(404)894-4819
Sponsor Congressional District: 05
Primary Place of Performance: Georgia Institute of Technology
225 North Avenue, NW
Atlanta
GA  US  30332-0002
Primary Place of Performance
Congressional District:
05
Unique Entity Identifier (UEI): EMW9FC8J3HN4
Parent UEI: EMW9FC8J3HN4
NSF Program(s): Infrastructure Innovation for
Primary Program Source: 01002021DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 1165
Program Element Code(s): 084Y00
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.074

ABSTRACT

Cells are the fundamental units that provide functions needed to sustain life in living organisms. Cellular functions are carried out by proteins, products of genes, and the process of producing proteins from genes (i.e., gene expression) is mediated by complex regulation systems. Much remains unknown about the mechanisms of gene regulations. Given all genes in a cell, the regulatory relationships among genes can be represented by networks, called gene regulatory networks. It has been a long-standing challenge to reconstruct these networks experimentally and computationally. A gene can express multiple isoforms (mRNA molecules), and hence produces multiple different proteins, which makes the underlying gene regulatory networks more complicated. Recent advances in single cell RNA-Sequencing (scRNA-Seq) technology has brought new opportunities in resolving high-quality regulatory networks, but also posed new computational challenges. The project aims to computationally reconstruct accurate regulatory networks at the isoform-level from large-scale sequencing data. Educational and outreach activities, such as courses on topics in computational biology and inclusion of minority students, will be carried out.

The project will develop efficient approaches to identify expressed isoforms and to determine expression abundances, and then develop a network-reconstruction method which improves current state-of-art. The new computational methods will be validated and applied to the field of immunology--to study cellular mechanisms in steroid-producing cells. The project will make contribution in improvements over existing methods. First, the proposed methods for developing a scalable transcript assembler will enable accurate determination and quantification of the expressed isoforms, and make it possible to build regulatory networks at the level of isoforms to reflect the possible difference in regulatory mechanisms for different isoforms. Second, many recently developed methods for network inference require cells to be pre-ordered with trajectory inference or RNA-velocity to mimic time-series data. Errors in the cell ordering can mislead network inference and lead to false predictions. The project proposes to perform cell ordering and network inference simultaneously, which is expected to provide better results for both cell ordering and network inference. The project will reconstruct transcript-level regulatory networks for different types of steroid-producing cells from both published and newly generated single-cell data. The results of the project can be found at the PI?s website: https://www.cc.gatech.edu/~xzhang954/.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Pan, Xinhai and Li, Hechen and Putta, Pranav and Zhang, Xiuwei "LinRace: cell division history reconstruction of single cells using paired lineage barcode and gene expression data" Nature Communications , v.14 , 2023 https://doi.org/10.1038/s41467-023-44173-3 Citation Details
Oh, Sooyoun and Park, Haesun and Zhang, Xiuwei "Hybrid Clustering of Single-Cell Gene Expression and Spatial Information via Integrated NMF and K-Means" Frontiers in Genetics , v.12 , 2021 https://doi.org/10.3389/fgene.2021.763263 Citation Details
Rajan, Vaibhav and Zhang, Ziqi and Kingsford, Carl and Zhang, Xiuwei "Maximum likelihood reconstruction of ancestral networks by integer linear programming" Bioinformatics , v.37 , 2020 https://doi.org/10.1093/bioinformatics/btaa931 Citation Details
Zhang, Ziqi and Sun, Haoran and Mariappan, Ragunathan and Chen, Xi and Chen, Xinyu and Jain, Mika S. and Efremova, Mirjana and Teichmann, Sarah A. and Rajan, Vaibhav and Zhang, Xiuwei "scMoMaT jointly performs single cell mosaic integration and multi-modal bio-marker detection" Nature Communications , v.14 , 2023 https://doi.org/10.1038/s41467-023-36066-2 Citation Details
Zhang, Ziqi and Yang, Chengkai and Zhang, Xiuwei "scDART: integrating unmatched scRNA-seq and scATAC-seq data and learning cross-modality relationship simultaneously" Genome Biology , v.23 , 2022 https://doi.org/10.1186/s13059-022-02706-x Citation Details
Pan, Xinhai and Li, Hechen and Zhang, Xiuwei "TedSim: temporal dynamics simulation of single-cell RNA sequencing data and cell division history" Nucleic Acids Research , v.50 , 2022 https://doi.org/10.1093/nar/gkac235 Citation Details
Zhang, Ziqi and Zhang, Xiuwei "Inference of high-resolution trajectories in single-cell RNA-seq data by using RNA velocity" Cell Reports Methods , v.1 , 2021 https://doi.org/10.1016/j.crmeth.2021.100095 Citation Details
Pan, Xinhai and Zhang, Xiuwei "Studying temporal dynamics of single cells: expression, lineage and regulatory networks" Biophysical Reviews , 2023 https://doi.org/10.1007/s12551-023-01090-5 Citation Details
Zhang, Ziqi and Zhao, Xinye and Bindra, Mehak and Qiu, Peng and Zhang, Xiuwei "scDisInFact: disentangled learning for integration and prediction of multi-batch multi-condition single-cell RNA-sequencing data" Nature Communications , v.15 , 2024 https://doi.org/10.1038/s41467-024-45227-w Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Cells in an organism undergo significant transformations during development and disease progression. Unraveling the mechanisms that regulate these changes is crucial for identifying the key molecules responsible for ensuring proper cellular processes, as well as those that drive disease.

The major goal of this project is to study cell regulatory mechanisms. Under this theme, we studied the following problems: 1. Information on cell dynamics, that is, how cells change their identity or cell types in a temporal process, is important to study the regulatory mechanism. We therefore developed methods to better learn cell dynamics. We developed a method to reconstruct high-resolution cell trajectories from RNA velocity, as well as a method to infer cell trajectories using integrated single cell multi-modality data. 2. Information on cell types and cell identity is also key for understanding regulatory mechanisms, as the regulatory mechanisms primarily lead to different cell types. We developed methods to integrate multi-modality data including spatial coordinates of cells, to learn cell identity. 3. Biological networks are natural representations of regulatory mechanisms that indicate interactions between molecules in the cells. We studied the evolution of protein protein interaction networks where we learned interactions that are preserved during evolution which are likely to play key roles. 4. Temporal dynamics of cells can also be studied using a specific technology named lineage tracing. We developed methods to reconstruct cell lineage trees from lineage tracing barcodes, as well as simulation tools to benchmark these methods. 5. Perturbation studies, that is, considering gene expression profiles under different conditions, can also help to learn the regulatory mechanisms. We developed a method to process single cell gene expression data from patients under different conditions, and to predict data under new conditions. 

These developed tools have been applied for analysis of real biological datasets and have led to collaborations with biologists. Research products on learning cell dynamics and cell representations have been incorporated into the PI’s course, Machine Learning in Computational Biology, and a summer short course, Computational Problems for Single Cell Transcriptomics and Multi-omics. All developed computational tools are available through public repositories, with links provided in the manuscripts. This project has provided opportunities to include undergraduate researchers, and three undergraduate students trained in the lab have gained authorship in associated publications. The group also has been providing research opportunities for master student researchers and students from under-represented groups. 


 


Last Modified: 11/26/2024
Modified by: Xiuwei Zhang

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page